Applied Data Scientist, Evaluation & Model Behavior

On Site Full TimeSan Francisco, California, United StatesAGI, Inc.

Join AGI, Inc. as an Applied Scientist focused on Evaluation & Model Behavior, designing and implementing systems to measure and improve the performance of Computer Use Agents. As a key member of the team, you will define the technical definition of model quality and ensure every metric, dataset, and prompt moves us toward more reliable, trustworthy agents.

Requirements

  • Master’s degree or PhD in Computer Science, Data Science, Statistics, or a related technical field
  • 3+ years of experience in Data Science, Machine Learning, or Applied Science
  • Proficiency in Python, with experience writing production-quality code for data pipelines or evaluation harnesses
  • Experience with experimental design, A/B testing, or statistical analysis

Benefits

  • Competitive company-sponsored medical, dental, and vision insurance
  • Top-tier relocation and immigration support

Before applying for this position you need to submit your online resume. Click the button below to continue.

Tired of manual job applications?

JobCopilot auto-applies to thousands of RevOps and GTM roles on your behalf — so you can focus on interviews, not applications.

Applying for this role?

Tailor your resume to this exact role — hiring managers notice the difference.

Latest articles on the blog

RECRUITERS!

Reduce the risk of your recruitment process (applicant quality, long and inefficient process) by selecting from a relevant pool of candidates.

POST A NEW JOB NOW!