Data Engineer (Python & PySpark)

On Site Full TimeBengaluru, Karnataka, IndiaZensar

Design, develop, and maintain end-to-end ETL/ELT pipelines using Python and PySpark. Build large-scale data processing frameworks to handle structured and unstructured data, ensuring high performance and reliability. Architect and manage data solutions within the GCP ecosystem, focusing on cost-efficiency and security.

Requirements

  • Strong proficiency in Python, including experience with libraries like Pandas, NumPy, and logging frameworks.
  • 3+ years of hands-on experience with Apache Spark (PySpark) for distributed data processing.
  • Practical experience with Google Cloud services, specifically BigQuery, Cloud DataProc or Dataflow, Cloud Storage, Cloud Functions, and Cloud Composer.
  • Solid understanding of relational databases and SQL (PostgreSQL, MySQL) as well as NoSQL environments.
  • Experience with Git, Docker, and CI/CD pipelines. Familiarity with Terraform or other IaC tools is a significant plus.

Tagged as:

To apply for this job please visit fa-etvl-saasfaprod1.fa.ocs.oraclecloud.com.


You can apply to this job and others using your online resume. Click the link below to submit your online resume and email your application to this employer.

Tired of manual job applications?

JobCopilot auto-applies to thousands of RevOps and GTM roles on your behalf — so you can focus on interviews, not applications.

Applying for this role?

Tailor your resume to this exact role — hiring managers notice the difference.

Latest articles on the blog

RECRUITERS!

Reduce the risk of your recruitment process (applicant quality, long and inefficient process) by selecting from a relevant pool of candidates.

POST A NEW JOB NOW!