The Data Engineer will design and maintain data pipelines on Azure, manage ClickHouse for analytics, and ensure healthcare compliance while supporting ML integration.
Position Summary:
MedReview Innovation and Development team is seeking a data engineer to function as the primary architect and operator of our data infrastructure. Your mission is to evolve our current environment into a rapid-acquisition engine capable of feeding real-time ML models, innovation, and operations while maintaining rigorous healthcare compliance standards.
Responsibilities:
Salary: 105,000 - 115,000
MedReview Innovation and Development team is seeking a data engineer to function as the primary architect and operator of our data infrastructure. Your mission is to evolve our current environment into a rapid-acquisition engine capable of feeding real-time ML models, innovation, and operations while maintaining rigorous healthcare compliance standards.
Responsibilities:
- Pipeline Architecture: Design, implement, and maintain end-to-end data pipelines on Azure, ensuring high availability and low latency for healthcare claim and analytics processing.
- High-Performance Storage: Manage and optimize ClickHouse as our primary analytical engine, focusing on rapid data ingestion and lightning-fast query performance for large-scale datasets.
- ML Data Readiness: Structure data environments to support the full ML lifecycle, from feature engineering and training to real-time model inference.
- MLOps Integration: Collaborate with Data Scientists to implement automated CI/CD pipelines for model deployment, monitoring, and retraining.
- Rapid Acquisition: Develop scalable frameworks to ingest diverse healthcare data sources (EDI, claims, clinical notes) with high velocity.
- Security & Compliance: Ensure all data structures and processes adhere to HITRUST/HIPAA standards, collaborating with IT and the leads for technical efforts for HITRUST certification readiness.
- Cloud Expertise: 5+ years of experience in data engineering, with deep proficiency in Azure Data Factory, Azure Databricks, or Azure Synapse.
- OLAP Mastery: Proven experience managing and tuning ClickHouse (or similar columnar databases like Druid/Pinot) for massive datasets.
- Programming: Expert-level Python and SQL skills.
- ML Engineering: Familiarity with ML frameworks (PyTorch, TensorFlow) and MLOps tools (MLflow, Kubeflow, or Azure Machine Learning).
- Healthcare Domain: Prior experience with healthcare data formats (HL7, FHIR, 835/837) and a strong understanding of HITRUST/HIPAA security requirements.
- Scale-up Mindset: Ability to build "v1" processes while designing for 10x growth.
- Experience with Infrastructure as Code (Terraform, Bicep).
- Knowledge of stream processing (Kafka, Azure Event Hubs).
- Background in financial or payment integrity analytics.
Salary: 105,000 - 115,000
What you need to know about the NYC Tech Scene
As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory
