Hiring Shanghai (Remote negotiable)

Data Validation Scientist

数据价值验证研究员 · Prove the value of data through experiments

The Role

You will answer one question: Does our data actually work?

We build evaluation datasets, but data quality can't be self-proclaimed.
You need to prove through experiments that models trained with our data are genuinely better.

What You'll Do

Design Validation Experiments

  • Design controlled experiments to verify the impact of different datasets on model performance
  • Define evaluation metrics to measure "how much the data improved the model"
  • Control variables to ensure credible experimental conclusions

Run Model Training

  • Use 3B/7B small models for rapid validation experiments
  • Proficient in fine-tuning, SFT, DPO, and other methods
  • Experienced with training frameworks like LLaMA-Factory and Axolotl

Analyze Experimental Results

  • Interpret training results and assess data effectiveness
  • Identify data issues (which data is useful, which is noise)
  • Produce visual reports that non-technical stakeholders can understand

Feed Back into Data Iteration

  • Guide data collection and labeling improvements based on experimental results
  • Collaborate with Evaluation Scientists to form a "data -> validation -> improvement" loop

What We're Looking For

Required Skills

  • Model training: Familiar with LLM fine-tuning workflows (SFT, DPO, RLHF concepts)
  • Experiment design: Understands variable control, control groups, and statistical significance
  • Result analysis: Can draw reliable conclusions from experimental data
  • Tool proficiency: PyTorch, Transformers, and common training frameworks

Required Mindset

  • Genuinely curious about how data affects models
  • Pursues experimental rigor — not satisfied with "it ran, so it's done"
  • Willing to communicate with non-technical teams and explain experimental findings

Nice to Have

  • Experience with RLHF or human feedback
  • Published relevant papers
  • Experience in data quality assessment
  • Familiarity with reward model training

Not a Fit If You

  • Only tune hyperparameters without understanding the "why"
  • Only care about models and not about data
  • Cannot explain results to non-technical people

Want to chat before applying?

王瑶" onerror="var d=document.createElement('div');d.innerHTML=this.dataset.fallback;this.replaceWith(d.firstChild)" />
王瑶 VP, People & Culture
赵建军 HR
叶心蕾" onerror="var d=document.createElement('div');d.innerHTML=this.dataset.fallback;this.replaceWith(d.firstChild)" />
叶心蕾 AI HR