Data Engineer

On Site Full TimeHyderabad, Telangana, IndiaAmgen

Build and operate large-scale healthcare data pipelines across batch workflows, metadata-driven ingestion, and data service publishing. Own end-to-end engineering from source ingestion to conformed data products, with strong focus on reliability, data quality, and operational observability.

Requirements

  • Design and maintain PySpark/SQL pipelines in Databricks for landing, unified, unstitched, and published data layers.
  • Build and support Airflow DAGs for scheduling, dependencies, retries, and production operations.
  • Implement metadata/config-driven frameworks for ingestion, transformation, and rule-based processing.
  • Develop robust data quality controls, DQ summaries, failure handling, and alerting workflows.
  • Manage batch/process audit logs, run status tracking, release flags, and operational reporting.
  • Integrate multi-source data (files, APIs, cloud storage, and relational systems) into governed Delta/Spark tables.
  • Optimize pipeline performance using partitioning, parallelization, and query tuning.
  • Collaborate on schema evolution, business-rule onboarding, and production support.

Tagged as: ,

Before applying for this position you need to submit your online resume. Click the button below to continue.

Tired of manual job applications?

JobCopilot auto-applies to thousands of RevOps and GTM roles on your behalf — so you can focus on interviews, not applications.

Applying for this role?

Tailor your resume to this exact role — hiring managers notice the difference.

Latest articles on the blog

RECRUITERS!

Reduce the risk of your recruitment process (applicant quality, long and inefficient process) by selecting from a relevant pool of candidates.

POST A NEW JOB NOW!