NVIDIA video benchmark surges to 2,479 downloads in two weeks
Video scene judgment becomes a new data frontier
This week scanned 86 HF orgs · 50 GitHub orgs · 71 blogs · 125 X accounts
NVIDIA's PhysicalAI-VANTAGE-Bench reached 2,479 downloads within 14 days of its release on 2026-05-04, while the Subset version reached 1,284 downloads after its release on 2026-05-05 [P0]; LAION added 16 rl_environment, 4 reward_model, and 1 rlhf_preference datasets this period, forming a systematic alignment data stack [P0]; Meta and Google simultaneously strengthened multilingual quality datasets, with facebook/bouquet and google/fleurs reaching 1,435 and 57,173 downloads respectively [P1]. The strongest data demand signal this week: fixed-camera video understanding / cross-camera tracking data.
Key Findings
This week's 5 high commercial value findings
nvidia/PhysicalAI-VANTAGE-Bench was released on 2026-05-04 and currently has 2,479 downloads and 9 likes; nvidia/PhysicalAI-VANTAGE-Bench-Subset was released on 2026-05-05 and currently has 1,284 downloads and 1 like. Change tracking shows VANTAGE-Bench increased from 19 in the previous period to 2,479, up by 2,460 downloads or 12,947.4%; the Subset version rose from 6 to 1,284, up by 1,278 downloads or 21,300.0%. Both focus on video understanding tasks from fixed infrastructure cameras, covering real-world scenarios such as warehouses and smart cities.
In the change data, rl_environment increased from 1 to 16 datasets, adding 15 new ones; reward_model rose from 0 to 4; rlhf_preference rose from 0 to 1. Representative datasets include laion/nemotron-gym-safety, laion/nemotron-gym-agent-workplace, laion/nemotron-gym-agent-calendar, laion/nemotron-gym-competitive-coding, laion/scaling-laws-for-comparison-full, as well as laion/mix_h10_reward_binary-v2, laion/mix_h10_reward_proportional-v2, laion/mix_h10_reward_staged-v2, and laion/mix_baseline_uniform-v2, all of which appeared for the first time this period.
facebook/bouquet was released on 2025-06-10 and currently has 1,435 downloads and 36 likes. It is a many-to-many parallel translation quality evaluation set across 8 languages, with underlying text manually created by linguists. google/fleurs was released on 2022-04-19 and currently has 57,173 downloads and 402 likes, covering speech recognition in 102 languages, with labels including expert-generated, crowdsourced, and machine-generated sources. Together, both point to multilingual speech/translation quality evaluation rather than simple corpus expansion.
internlm/WildClawBench was released on 2026-03-24 and currently has 8,250 downloads and 59 likes, up by 567 from 7,683 in the previous period. Change data also shows microsoft/Orchard newly added 166 downloads and 8 likes, and microsoft/WebTailBench newly added 366 downloads and 16 likes; both are categorized as agent_tool. Databricks' databricks/officeqa was released on 2025-12-15 and currently has 131 downloads, focusing on end-to-end reasoning over real documents.
allenai/olmoearth-paper-embeddings was released on 2026-05-15 and currently has 2,876 downloads and 2 likes, providing paper embeddings for 26 Earth observation foundation models across 24 downstream tasks. databricks/officeqa was released on 2025-12-15 and currently has 131 downloads, centered on grounded reasoning over U.S. Treasury bulletin documents dating back to the 1930s. Meanwhile, Microsoft Research published a blog related to SocialReasoning-Bench on 2026-05-14, emphasizing that agents may execute tasks but do not necessarily continue improving users' situations.
Demand Signals
Infer training data demands from model releases
Download Movers
Datasets with the largest download changes this week
| Dataset | Downloads | Weekly Growth |
|---|---|---|
| nvidia/PhysicalAI-VANTAGE-Bench-Subset | 1,284 | +21300.0% |
| nvidia/PhysicalAI-VANTAGE-Bench | 2,479 | +12947.4% |
| laion/Scientific-Summaries | 34,214 | +1241.7% |
| microsoft/delulu-fim-benchmark | 659 | +112.6% |
| internlm/WildClawBench | 8,250 | +7.4% |
Want to discuss this issue?
Auto-generated by AI Dataset Radar · Updated weekly
AI Dataset Radar →