Arine Logo

Arine

Staff Data Engineer

Reposted 5 Days Ago
Remote
Hiring Remotely in United States of America
170K-185K Annually
Expert/Leader
Remote
Hiring Remotely in United States of America
170K-185K Annually
Expert/Leader
Lead the design and optimization of scalable data ingestion pipelines, develop reusable ETL components, and mentor junior engineers while leveraging expertise in Python and AWS for the Arine platform.
The summary above was generated by AI

Based in San Francisco, Arine is a rapidly growing healthcare technology and clinical services company with a mission to ensure individuals receive the safest and most effective treatments for their unique and evolving healthcare needs. 

Frequently, medications cause more harm than good. Incorrect drugs and doses costs the US healthcare system over $528 billion in waste, avoidable harm, and hospitalizations each year. Arine is redefining what excellent healthcare looks like by solving these issues through our software platform (SaaS). We combine cutting edge data science, machine learning, AI, and deep clinical expertise to introduce a patient-centric view to medication management, and develop and deliver personalized care plans on a massive scale for patients and their care teams.

Arine is committed to improving the lives and health of complex patients that have an outsized impact on healthcare costs and have traditionally been difficult to identify and address. These patients face numerous challenges including complicated prescribing issues across multiple medications and providers, medication challenges with many chronic diseases, and patient issues with access to care. Backed by leading healthcare investors and collaborating with top healthcare organizations and providers, we deliver recommendations and facilitate clinical interventions that lead to significant, measurable health improvements for patients and cost savings for customers. 

Why is Arine a Great Place to Work?:

Outstanding Team and Culture - Our shared mission unites and motivates us to do our best work. We have a relentless passion and commitment to the innovation required to be the market leader in medication intelligence.

Making a Proven Difference in Healthcare - We are saving patient lives, and enabling individuals to experience improved health outcomes, including significant reductions in hospitalizations and cost of care.

Market Opportunity - Arine is backed by leading healthcare investors and was founded to tackle one of the largest healthcare problems today. Non-optimized medications therapies which cost the US 275,000 lives and $528 billion annually.

Dramatic Growth - Arine is managing more than 18 million lives across prominent health plans after only 4 years in the market, and was ranked 236 on the 2024 Inc. 5000 list and was named the 5th fastest-growing company in the AI category.

The Role:

As a key technical leader and team architect working in a fast-paced environment, you will drive the design, development, and optimization of scalable data ingestion pipelines within the Arine platform. Leveraging expert-level proficiency in Python and AWS, you will architect solutions that handle diverse file types and large-scale healthcare datasets. You will have a direct impact on building reusable, configurable tools for handling data needs for the entire company.

A key part of this role is leading the team’s transformation toward AI-driven software development - shifting engineers from being primary builders of code to skilled directors and reviewers of AI-generated work.

What You'll be Doing:

  • Act as the team architect by leading system design reviews, offering recommendations, conducting comprehensive peer reviews, and demonstrating expert-level proficiency in Python and AWS services
  • Architecting and implementing scalable data ingestion pipelines, including incremental ingestion strategies for large-scale healthcare datasets
  • Developing reusable, configuration-driven, containerized pipeline components and toolsets that diverse engineering profiles can use and maintain
  • Work collaboratively with cross-functional teams to ensure their data requirements are met through ETL components
  • Designing and maintaining data transformation pipelines using dbt, including utilizing core concepts like macros, incremental models and dbt tests
  • Building monitoring and alerting systems for data ingestion processes and pipeline health
  • Applying software engineering best practices including test-driven development and modular design to data infrastructure, including refactoring existing ingestion processes to improve scalability and operational efficiency
  • Provide technical guidance, mentorship to junior engineers, and promote best practices and coding standards
  • Champion AI-assisted development across the team - establishing norms, workflows, and expectations for using AI coding tools (e.g., Claude Code, Cursor, Copilot) to generate, iterate, and ship production-quality code
  • Model the “builder to reviewer” shift - demonstrating how senior engineers direct AI agents to produce full solutions, then apply rigorous review, testing, and judgment to own the output
  • Identify opportunities to automate repetitive engineering work using LLMs and AI tooling, including pipeline scaffolding, boilerplate generation, data transformation logic, and documentation
  • Author and support high-quality technical documentation, assisting junior engineers in doing the same

Who You Are and What You Bring:

  • 10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure
  • A track record of building automated, production-grade ETL processes using Python and DBT SQL
  • Strong understanding of ETL/ELT frameworks and distributed data processing
  • Demonstrated hands-on experience building software with AI coding tools - not just autocomplete, but directing AI agents to generate complete solutions and applying disciplined review and ownership of the output
  • A genuine conviction that AI-augmented development is the future of software engineering, paired with the judgment to validate, test, and take accountability for AI-generated code
  • Experience or strong interest in integrating LLMs into engineering workflows beyond development assistance - such as automating data quality checks, generating pipeline logic, or surfacing anomalies
  • Proven ability to handle and process varied file types and formats, including healthcare standards such as HL7, 834, 837, and NCPDP
  • Demonstrated success integrating and consolidating data from diverse source systems into a unified repository, including EHR and claims systems, via both file-based and API integrations
  • Comfort working with large-scale datasets (10GB+), with strong capability implementing incremental processing and change data capture (CDC) methodologies
  • Extensive background designing scalable data architectures in AWS environments
  • Solid grounding in software engineering principles, including test-driven development, loose coupling, single responsibility, and modular design
  • Hands-on familiarity with containerization (Docker, Kubernetes) and proven ability to build configuration-driven systems that diverse engineering profiles can operate without code changes
  • A passion for building new data infrastructure and continuously improving existing systems with robustness, maintainability, and operational excellence
  • Familiarity with healthcare data and regulatory environments (HIPAA) as a plus
  • Strong written and verbal communication skills, with comfort partnering across technical and non-technical stakeholders to explain complex infrastructure concepts

Remote Work Requirements:

  • An established private work area that ensures information privacy
  • A stable high-speed internet connection for remote work
  • This role is remote, but you will be required to come to on-site meetings multiple times per year. This may be in the interview process, onboarding, and team meetings

Perks:

Joining Arine offers you a dynamic role and the opportunity to contribute to the company's growth and shape its future. You'll have unparalleled learning and growth prospects, collaborating closely with experienced Clinicians, Engineers, Software Architects, and Digital Health Entrepreneurs.

The posted range represents the expected salary for this position and does not include any other potential components of the compensation package (including bonus and equity), benefits, and perks. Ultimately, the final pay decision will consider factors such as your experience, job level, location, and other relevant job-related criteria. The salary range for this position is: $170,000-185,000/year.

Job Requirements:

  • Ability to pass a background check
  • Must live in and be eligible to work in the United States

Information Security Roles and Responsibilities:

All staff at Arine are expected to be part of its Information Security Management Program and undergo periodic training on Information Security Awareness and HIPAA guidelines. Each user is responsible to maintain a secure working environment and follow all policies and procedures. Upon hire, each person is assigned and must complete trainings before access is granted for their specific role within Arine.

Arine is an equal opportunity employer. We are committed to creating a diverse and inclusive workplace where all employees are treated with fairness and respect. We do not discriminate on the basis of race, ethnicity, color, religion, gender, sexual orientation, age, disability, or any other legally protected status. Our hiring decisions and employment practices are based solely on qualifications, merit, and business needs. We encourage individuals from all backgrounds to apply and join us in our mission.

Check our website at https://www.arine.io. This is a unique opportunity to join a growing start-up revolutionizing the healthcare industry!

Job Offers: Arine uses the arine.io domain and email addresses for all official communications. If you received communication from any other domain, please consider it spam. 

Note to Recruitment Agencies: We appreciate your interest in finding talent for Arine, but please be advised that we do not accept unsolicited resumes from recruitment agencies. All resumes submitted to Arine without a prior written agreement in place will be considered property of Arine, and no fee will be paid in the event of a hire. Thank you for your understanding.

Top Skills

AWS
Dbt
Docker
Kubernetes
Python
SQL

Similar Jobs

9 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
170K-200K Annually
Expert/Leader
170K-200K Annually
Expert/Leader
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Staff Data Engineer will design and implement a unified semantic data layer, integrating heterogeneous data sources and enabling AI interactions while ensuring security and compliance across the organization.
Top Skills: AerospikeAmazon S3Amazon SqsApache KafkaBigQueryDelta LakeDynamoDBGraphQLIcebergMongoDBMySQLPostgresRedshiftRestSnowflake
Yesterday
Remote
United States
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Healthtech • Mobile • Analytics
The Staff Data Engineer will enhance the data ecosystem at WeightWatchers by developing data pipelines, collaborating cross-functionally, and ensuring data quality while leading projects and technical initiatives.
Top Skills: AirflowArgo CdDatadogGithub ActionsLookerMonte CarloPrefectPythonSnowflakeSQL
9 Days Ago
Remote or Hybrid
176K-308K Annually
Senior level
176K-308K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Staff Data Platform Software Engineer will enhance and maintain the Veza Access Graph, focusing on backend services, APIs, and cross-functional collaboration to optimize identity security features.
Top Skills: AWSAzureDockerGoGoogle Cloud PlatformKotlinKubernetesNeo4J

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account