We are seeking an expert in evaluating machine learning and deep learning models, including foundation models and multimodal systems. The ideal candidate combines strong analytical thinking, expertise in Python, and advanced knowledge of statistical methodologies and data quality standards.
Requirements
- BS degree and a minimum of 3 years of relevant industry experience
- Strong experience in evaluating supervised, unsupervised, and deep learning models
- Hands-on experience evaluating LLMs and using them as scoring/judging mechanisms
- Familiarity with multimodal models and related evaluation challenges
- Proficiency in Python and libraries such as NumPy, pandas, scikit-learn, PyTorch, or TensorFlow
- Solid understanding of statistical testing, sampling, confidence intervals, and metrics
Benefits
- Full-time employment
To apply for this job please visit jobs.apple.com.

Follow us on social media