Genentech Logo

Genentech

Data Engineer

Reposted 8 Days Ago
In-Office
New York City, NY
128K-238K Annually
Mid level
In-Office
New York City, NY
128K-238K Annually
Mid level
As a Data Engineer, you'll implement and maintain the Therapeutic Molecule Registration platform, focusing on data pipelines, migrations, and collaborating with cross-functional teams.
The summary above was generated by AI

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche.
Advances in AI, data, and computational sciences are transforming drug discovery and development. Roche’s Research and Early Development organisations at Genentech (gRED) and Pharma (pRED) have demonstrated how these technologies accelerate R&D, leveraging data and novel computational models to drive impact. Seamless data sharing and access to models across gRED and pRED are essential to maximising these opportunities. The new Computational Sciences Center of Excellence (CoE) is a strategic, unified group whose goal is to harness this transformative power of data and Artificial Intelligence (AI) to assist our scientists in both pRED and gRED to deliver more innovative and transformative medicines for patients worldwide.

The Opportunity

At Genentech and Roche, we're at the forefront of a revolutionary transformation in drug discovery powered by AI and machine learning. Our "lab in the loop" strategy processes massive quantities of experimental data to train AI models that accelerate the discovery of new medicines. To enable this vision, we're seeking an exceptional Data Engineer to be part of the team building and maintaining our next-generation Therapeutic Molecule Registration (TMR) platform - a foundational component of our AI-driven drug discovery infrastructure, Lab-in-the-Loop (https://www.youtube.com/watch?v=cN1PxxQWoEc). This platform will serve as the central nervous system for managing and integrating molecular data across our global research organization, handling hundreds of billions of records and enabling unprecedented scale in virtual molecule design and testing. As the volume of AI-generated molecular designs grows exponentially, our TMR platform must evolve to become a high-performance, cloud-native system capable of supporting rapid iteration cycles between computational design and experimental validation. You will be instrumental in consolidating our molecule registration systems into a single, harmonized environment, unlocking the full potential of our data and accelerating the development of life-changing therapies. The ideal candidate will combine data engineering experience with an interest in chemical and biological data management systems.

You will work closely with our machine learning for drug development team, Genentech Research & Early Development (gRED) Drug Discovery teams, including the Antibody Engineering division, and other teams across the Roche family of companies to identify, strategize, and productionalize high-impact applications from across the drug discovery and development pipeline. Genentech provides a dynamic and challenging environment for cutting-edge, multidisciplinary research in AI and drug discovery including access to rich sources of data, close links to top academic institutions around the world, as well as internal Genentech and Roche partners and research units.

In this role, you will:

  • Implement features of our TMR platform

  • Write clean, well-tested code following team standards

  • Build and maintain data pipelines

  • Facilitate data migration to TMR and production deployment

  • Participate in code reviews and technical discussions

  • Contribute to documentation and testing efforts

  • Collaborate with team members on technical solutions

Who you are

  • 3+ years of data engineering experience

  • Strong experience with Postgres SQL and Oracle

  • Skilled with at least one modern data toolkit (Glue, dbt, Databricks,...)

  • Cloud platform exposure (preferably AWS)

  • Python programming skills

  • Strong testing practices

  • Bachelor's degree in Computer Science or related field (or equivalent experience)

Preferred

  • Experience with data modelling & schema design

  • Involvement in a database migration

  • Familiarity with scientific software

  • Interest in chemical/biological data systems

  • Experience with AWS services

Relocation benefits are available for this job posting

The expected salary range for this position, based on the primary location of New York is $128,300 - 238,300.  Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law.  A discretionary annual bonus may be available based on individual and Company performance.  This position also qualifies for the benefits detailed at the link provided below.

Benefits

#ComputationCoE

#​tech4lifeComputationalScience 

Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.

If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form Accommodations for Applicants.

Top Skills

AWS
Databricks
Dbt
Glue
Oracle
Postgres Sql
Python

Similar Jobs

6 Days Ago
In-Office
New York, NY, USA
150K-300K Annually
Mid level
150K-300K Annually
Mid level
Information Technology • Software • Financial Services
As an AI Data Engineer, you will develop AI tools, automate workflows, and collaborate with teams to enhance decision-making processes at Citadel Securities.
Top Skills: FlaskLlmsNlpPythonStreamlit
6 Days Ago
In-Office
New York, NY, USA
95K-163K Annually
Senior level
95K-163K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Senior Data Engineer will design and implement data pipelines and manage cloud-based data solutions while collaborating with architecture and product teams.
Top Skills: AWSAzureDatabricks SqlDbtIdmcMySQLOraclePostgresPysparkPythonRedshiftSnowflakeSQL
14 Days Ago
In-Office
New York, NY, USA
160K-200K Annually
Mid level
160K-200K Annually
Mid level
Healthtech
The Data Engineer will build and scale data pipelines for healthcare data, ensuring reliable data flow and supporting various analytics and product development needs.
Top Skills: AirflowBigQueryBigtableDagsterFastapiGCPNext.JsPostgresPythonSQLTemporalTypescript

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account