Data Engineer | 5-8 YoE | Join a Fast-Growing HealthTech Startup in Pune

We’re hiring an experienced Data Engineer to design and build modern data pipelines that power advanced analytics, AI, and healthcare insights.

If you thrive in cloud-native environments and love transforming complex, multi-source data into meaningful intelligence — this role is for you.

⚙️ What You’ll Work On

  • Design and maintain scalable batch and streaming data pipelines using Google Cloud Dataflow, Datastream, and Airbyte
  • Develop and optimize ETL / ELT processes across AWS Postgres, Google FHIR Store, and BigQuery
  • Build unified data models integrating EHR / FHIR, claims, HL7, CRM, and transactional data
  • Implement transformations in dbt / OBT to create curated semantic layers for AI / BI pipelines
  • Ensure data quality, lineage, validation, and HIPAA compliance across all pipelines
  • Collaborate with AI / ML, BI, and product teams to deliver data-driven insights
  • Drive cost optimization and performance tuning for BigQuery and streaming systems
  • Contribute to architectural decisions and mentor junior engineers on best practices

What You Bring

  • 5–8 years of hands-on experience in data engineering
  • Deep proficiency in SQL (joins, window functions, performance tuning) and Python
  • Proven expertise in Google Cloud (Dataflow, Datastream) and BigQuery (partitioning, clustering, optimization)
  • Solid experience with ETL / ELT orchestration tools (Airflow, Prefect, Dagster)
  • Strong understanding of data modeling (star / snowflake) and cloud-native architectures (GCP / AWS)
  • Familiarity with healthcare data standards (FHIR, HL7, X12, ICD, ADT, CCDA)
  • Experience ensuring HIPAA-compliant data governance and security
  • Nice-to-Have

  • Experience with real-time streaming and feature stores for AI / ML pipelines
  • Hands-on experience with Power BI or other BI tools for analytics enablement
  • Background in Value-Based Care, ACOs, or population health
  • Exposure to healthcare standardization frameworks
  • Location : Pune (Work from Office)

    ☁️ Tech Stack : GCP, Dataflow, Datastream, BigQuery, Python, SQL, dbt

    Domain : HealthTech / Cloud Data Engineering

    Join a growing healthtech product team and help build the data backbone that powers AI, predictive analytics, and better patient outcomes.

    Back to blog