We’re looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harvey’s platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is mission-critical—and evaluation complexity is increasing 10x.
Requirements
- 4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
- Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
- Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data
- Ability to work deeply with legal experts and operationalize complex evaluation methodologies
- Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
Benefits
- Competitive salary ($178,500 – $210,000 USD)
- Company matching 401k
- Relocation assistance
- Flexible work arrangements
To apply for this job please visit jobs.ashbyhq.com.

Follow us on social media