C the Signs Logo

C the Signs

Lead Data Engineer

Reposted 19 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The Lead Data Engineer will architect and develop a healthcare data platform on GCP, guiding teams in pipeline development, data modeling, and ensuring compliance with HIPAA. They will collaborate across functions, set technical directions, and mentor junior engineers.
The summary above was generated by AI

We are seeking a Lead Data Engineer to architect, build, and scale our next-generation healthcare data platform. In this role, you will lead the effort to design robust pipelines, modernize data architecture, and ensure high-quality ingestion and transformation of clinical and operational data. You’ll collaborate closely with product, analytics, clinical informatics, machine learning, and engineering teams to deliver trusted, timely, and compliant insights.

This is a hands-on leadership role ideal for someone who enjoys setting technical direction while still contributing code and guiding stakeholders through complex healthcare data challenges.

ResponsibilitiesArchitecture & Strategy
  • Lead design and evolution of our cloud-native data platform built primarily on Google Cloud Platform, including BigQuery, Cloud Storage, Pub/Sub, Cloud Run, Airflow (Cloud Composer), and Healthcare API.
  • Inform strategic decisions around multi-cloud or AWS interoperability when needed.
  • Establish data engineering best practices, coding standards, and architectural patterns.
Pipeline Development
  • Build scalable ETL/ELT pipelines using dbt for transformations and Airflow for orchestration.
  • Develop ingestion pipelines for clinical and administrative data in HL7, FHIR, DICOM, and custom formats.
  • Develop ingestion and transformation pipelines to be used for AI/ML development and model training.
  • Implement streaming and batch dataflows using Pub/Sub, Dataflow, and serverless compute.
  • Support or guide integrations with AWS-based partner systems or AWS-hosted data sources when applicable.
Data Modeling & Warehousing
  • Design and maintain BigQuery datasets, semantic layers, and warehouse structures.
  • Leverage industry standards such as FHIR resources for canonical healthcare models.
  • Provide guidance on data modeling and warehouse best practices across both GCP and AWS ecosystems.
Data Quality, Observability & Governance
  • Implement data quality frameworks, automated testing, and monitoring.
  • Ensure HIPAA compliance and proper handling of PHI/PII across all pipelines and cloud environments.
  • Drive lineage, documentation, metadata governance, and dbt docs adoption.
Leadership & Collaboration
  • Partner with analytics, product, clinical informatics, and security teams to deliver high-quality, trustworthy data products.
  • Provide oversight and technical direction for multi-cloud data integrations with AWS-based systems or partners.
  • Assist in the recruitment and development of junior data engineers

Requirements
  • 7+ years of data engineering experience; 2–3+ years in a lead or senior technical role.
  • Deep, hands-on expertise in GCP, particularly:
    • BigQuery
    • GCP Healthcare API (FHIR and DICOM stores)
    • Cloud Storage, Pub/Sub, Cloud Run/Functions
  • Strong proficiency with:
    • dbt (Core or Cloud)
    • Airflow (Cloud Composer or self-managed)
    • Python and advanced SQL (BigQuery preferred)
  • Hands-on experience with healthcare standards:
    • FHIR (R4/US Core), HL7 v2/v3, DICOM, C-CDA, X12
  • Strong understanding of PHI handling, HIPAA compliance, and healthcare interoperability.
Preferred
  • AWS experience, especially with:
    • Redshift, Lambda, S3, Glue, Kinesis, Athena, API Gateway, Step Functions
  • Experience building or maintaining multi-cloud pipelines bridging GCP and AWS.
  • Background with Dataflow/Beam or other stream processing frameworks.
  • Experience working with EHR integrations, claims processing, HIEs, or clinical data networks.
  • Familiarity with ML-enabled data pipelines or feature engineering in healthcare contexts.

Benefits

Why Join Us?

Joining C the Signs is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.

Benefits:

  • Competitive salary and benefits package.
  • Flexible working arrangements (remote or hybrid options available).
  • The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.

Similar Jobs

6 Days Ago
Remote
USA
143K-169K Annually
Senior level
143K-169K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning
Lead architecture and implementation of cloud-native data platforms on Google Cloud (BigQuery). Build unified batch/streaming pipelines, enforce IaC with Terraform, implement open table formats and governance (Dataplex/Data Catalog), automate data quality/lineage, partner with analytics and product teams, and mentor data engineers.
Top Skills: Apache BeamApache IcebergBigQueryCloud BuildData CatalogDataflowDataplexDataprocGithub ActionsGCPPub/SubSnowflakeSparkTerraform
10 Days Ago
In-Office or Remote
2 Locations
73K-171K Annually
Senior level
73K-171K Annually
Senior level
Information Technology
Lead design and delivery of Databricks-based Lakehouse solutions. Architect, develop, test, and deploy scalable ETL/ELT pipelines using Python, PySpark, and SQL. Mentor engineers, enforce Databricks best practices, integrate with cloud platforms (Azure/AWS), support CI/CD, perform code reviews, and collaborate with data scientists and stakeholders to meet analytics and AI requirements.
Top Skills: AdfAdlsAWSAws LambdaAzureAzure DevopsCi/CdData LakehouseDatabricksDatabricks Asset BundlesEltETLEvent HubGitGitGlueJIRAMedallion ArchitectureNotebook RepositoriesPostgresPysparkPythonRdbmsS3SQLUnit Testing Frameworks
12 Days Ago
Remote
USA
170K-190K Annually
Senior level
170K-190K Annually
Senior level
Fintech • Payments • Financial Services
Lead Data Engineer responsible for defining architecture, engineering standards, and best practices for the enterprise data platform, collaborating with distributed teams for modernization and scalability.
Top Skills: AirflowAurora PostgresqlAWSKafkaRedshiftSQL Server

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account