Allen AI Releases 4 MolmoPoint Datasets and Models in a Row
Fine-grained human judgment Becomes Fuel for Multimodal Agents
This week scanned 86 HF orgs · 50 GitHub orgs · 71 blogs · 125 X accounts
Allen AI released 4 MolmoPoint-related datasets/models consecutively from 2026-03-15 to 2026-03-17, with video and GUI pointing to data-intensive growth [P0]; NVIDIA simultaneously disclosed RL and SFT training data from 2026-03-18 to 2026-03-19, accelerating the assetization of post-training data [P0]; NVIDIA's robotics and Physical AI datasets continue to lead in downloads, with teleoperation demonstrations becoming the strongest public demand signal [P1]. This week's strongest data demand signal: video understanding/tracking data.
Key Findings
This week's 5 high commercial value findings
Allen AI released allenai/MolmoPoint-TrackSyn on 2026-03-15, with 94 downloads and 2 likes; on the same day, it also released allenai/MolmoPoint-TrackAny, with 108 downloads and 2 likes. On 2026-03-16, it released the model allenai/MolmoPoint-8B, with 289 downloads and 11 likes. On 2026-03-17, it released the models allenai/MolmoPoint-GUI-8B and allenai/MolmoPoint-Vid-4B, with 91 downloads each. Previously, the related dataset allenai/MolmoPoint-GUISyn was released on 2026-02-24, with 265 downloads and 6 likes; allenai/Molmo2-VideoPoint has now reached 440 downloads, up +22 from the previous period.
nvidia/Nemotron-Cascade-2-RL-data was released on 2026-03-18, with 15 downloads and 12 likes; nvidia/Nemotron-Cascade-2-SFT-Data was released on 2026-03-19, with 32 downloads and 10 likes. The corresponding paper, "Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation," was released on 2026-03-19. The dataset description explicitly includes instruction-following RL, multi-domain RL, on-policy distillation, and software engineering RL. During the same period, nvidia/Nemotron-RL-bixbench_hypothesis was released on 2026-03-14, with 2,534 downloads and 4 likes.
nvidia/PhysicalAI-Robotics-Open-H-Embodiment was released on 2026-02-06, with 37,433 downloads and 8 likes; nvidia/PhysicalAI-Robotics-Manipulation-Kitchen-Demos was released on 2026-02-10, with 20,849 downloads and 38 likes, and the dataset includes 600 hours of human teleoperation demonstrations, 316 tasks, and 55k trajectories. The larger-scale nvidia/PhysicalAI-Autonomous-Vehicles has reached 214,152 downloads and 785 likes. On Meta's side, facebook/ego-1k was released on 2026-01-29, with 5,903 downloads, further strengthening egocentric 3D/multiview data.
stepfun-ai/Step-3.5-Flash-SFT was released on 2026-03-14, with 27,044 downloads and 260 likes, making it one of the highest-downloaded new SFT datasets this week. Its tags cover chat, sft, instruction-tuning, reasoning, and code. InternLM released internlm/VC-RewardBench on 2026-03-12, with 1,810 downloads and 6 likes, and simultaneously released the internlm/Visual-ERM model, whose tags directly reference dataset:internlm/VC-RewardBench. internlm/EndoCoT-Data was released on 2026-03-11, with 1,764 downloads and 6 likes, ranking first among this week's Download Movers.
CausalRM, published on 2026-03-19, proposes learning reward models from observational user feedback. MOSAIC, also published on 2026-03-19, discusses multi-objective slice-aware iterative curation. Efficient Exploration at Scale, published on 2026-03-18, emphasizes online updates to choice data. Via Negativa for AI Alignment, published on 2026-03-17, argues that negative-only feedback can approach or surpass standard RLHF. HIPO, also published on 2026-03-17, focuses on hierarchical instruction adherence. During the same period, Anthropic released news on large-scale qualitative user feedback from “81,000 people.”
Demand Signals
Infer training data demands from model releases
Download Movers
Datasets with the largest download changes this week
| Dataset | Downloads | Weekly Growth |
|---|---|---|
| nvidia/HiLiftAeroML | 1,200 | +66.4% |
| laion/majestrino-data | 7,837 | +28.4% |
| allenai/asta-summary-citation-counts | 509 | +11.6% |
| allenai/Molmo2-VideoPoint | 440 | +5.3% |
| internlm/EndoCoT-Data | 1,764 | new |
Want to discuss this issue?
Auto-generated by AI Dataset Radar · Updated weekly
AI Dataset Radar →