Data Engineer

On Site Full TimeHyderabad, Telangana, IndiaAmgen

Build and operate large-scale healthcare data pipelines across batch workflows, metadata-driven ingestion, and data service publishing. Own end-to-end engineering from source ingestion to conformed data products, with strong focus on reliability, data quality, and operational observability.

Requirements

  • Design and maintain PySpark/SQL pipelines in Databricks for landing, unified, unstitched, and published data layers.
  • Build and support Airflow DAGs for scheduling, dependencies, retries, and production operations.
  • Implement metadata/config-driven frameworks for ingestion, transformation, and rule-based processing.
  • Develop robust data quality controls, DQ summaries, failure handling, and alerting workflows.
  • Manage batch/process audit logs, run status tracking, release flags, and operational reporting.
  • Integrate multi-source data (files, APIs, cloud storage, and relational systems) into governed Delta/Spark tables.
  • Optimize pipeline performance using partitioning, parallelization, and query tuning.
  • Collaborate on schema evolution, business-rule onboarding, and production support.

Tagged as: ,

To apply for this job please visit amgen.wd1.myworkdayjobs.com.


You can apply to this job and others using your online resume. Click the link below to submit your online resume and email your application to this employer.

Tired of manual job applications?

JobCopilot auto-applies to thousands of RevOps and GTM roles on your behalf — so you can focus on interviews, not applications.

Applying for this role?

Tailor your resume to this exact role — hiring managers notice the difference.

Latest articles on the blog

RECRUITERS!

Reduce the risk of your recruitment process (applicant quality, long and inefficient process) by selecting from a relevant pool of candidates.

POST A NEW JOB NOW!