Allen AI Withdraws 29 Video Tracking Datasets
AI Data Intelligence Weekly
This week scanned 86 HF orgs · 50 GitHub orgs · 71 blogs · 125 X accounts
Allen AI withdraws 29 video tracking datasets, signaling video understanding data shortage [P0], coding agent trajectory data becomes scarce resource as TogetherAI withdraws CoderForge-Preview dataset [P0], Chinese embodied intelligence dataset BAAI/ToucHD series withdrawn, tactile data emerges as new frontier [P1]. This week's strongest data demand signal: Video Understanding/Tracking Data.
Key Findings
This week's 5 high commercial value findings
Allen AI suddenly withdrew all 29 video datasets from the Molmo2 series on March 5, including core video understanding benchmarks like VideoLocalizedNarratives, VideoMME, and TVQA. These datasets were originally used to train video tracking and understanding capabilities for their multimodal models. Concurrently, NVIDIA added embodied intelligence repositories like Isaac-GR00T (6,321 stars), showing the industry is competing for video-action alignment data.
TogetherAI withdrew the CoderForge-Preview dataset on March 5, which contained high-quality coding agent execution trajectories. Concurrently, OpenAI released the codex repository (63,080 stars), and Anthropic's claude-code reached 73,813 stars. The paper "A Rubric-Supervised Critic from Sparse Real-World Outcomes" (2026-03-04) proposes learning evaluation models from sparse human interactions.
Beijing Academy of Artificial Intelligence (BAAI) withdrew three robot tactile datasets - ToucHD-Force, ToucHD-Mani, and ToucHD-Sim on March 5, 2026. These datasets originally contained force feedback and tactile information from robot manipulation. NVIDIA concurrently released PhysicalAI-Robotics-NuRec and Arena-GR1-Manipulation datasets, showing tactile modality becoming a key bottleneck in embodied intelligence.
EleutherAI withdrew djinn-problems-v0.9 and rh-misalignment-control-sft datasets. NVIDIA's SPEED-Bench and Microsoft's TestExplora evaluation benchmarks were simultaneously withdrawn. The paper "QEDBENCH: Quantifying the Alignment Gap" (2026-02-24) shows academia is establishing stricter model alignment evaluation standards.
The paper "JANUS: Structured Bidirectional Generation" (2026-03-04) proposes a framework that simultaneously addresses Fidelity, Control (logical constraint control), Reliability (uncertainty estimation), and Efficiency (computational efficiency). SuperAnnotate released MCP Server tools supporting AI agents to directly connect labeling projects.
Demand Signals
Infer training data demands from model releases
Deep Dive — DataRecipe
This week's 3 high-value datasets reverse-analyzed (auto-generated by DataRecipe)
Data Structure
Risk Assessment
Data Structure
Risk Assessment
Data Structure
Risk Assessment
This week analyzed 3 datasets · 99.6% human involvement
Want to discuss this issue?
Auto-generated by AI Dataset Radar · Updated weekly
AI Dataset Radar →