Child Mind Institute Logo

Child Mind Institute

Data Engineer

Posted 4 Days Ago
Hybrid
New York, NY, USA
119K-150K Annually
Senior level
Hybrid
New York, NY, USA
119K-150K Annually
Senior level
Build and maintain scalable, secure data pipelines and storage for multimodal clinical and NLP data; generate and process synthetic data; enable real-time model evaluation and monitoring; ensure data quality, security, compliance; create dashboards and documentation to support AI research.
The summary above was generated by AI

About Child Mind Institute

We're dedicated to transforming the lives of children and families struggling with mental health and learning disorders by giving them the help they need. We've become the leading independent nonprofit in children's mental health by providing gold-standard evidence-based care, delivering educational resources to millions of families each year, training educators in underserved communities, and developing tomorrow's breakthrough treatments.

Position Details:

As part of the Center for Data Analytics, Innovation, and Rigor team, you will report to Rubric Engineering and Measurement Specialist. You will develop infrastructure to support large-scale AI evaluation frameworks. You will design scalable data pipelines for generating and processing synthetic data, implement secure data storage solutions, and create infrastructure for real-time model evaluation and monitoring. You will use common frameworks, platforms, and languages, such as Python, SQL, GitHub, containerization tools (e.g., Docker, Kubernetes), and cloud computing infrastructures (e.g., AWS, Azure) to build robust and scalable data infrastructure that support our AI research initiatives.

This is an exempt, full-time, hybrid position located in our NYC headquarters office or other relevant location. This position requires a minimum of four (4) days per week in the office, on a schedule determined by your supervisor. The in-office requirement and schedule are subject to change based on the needs of the program and the organization.

You Will:

  • Create and maintain scalable data pipelines for efficient storage and retrieval of multimodal data, with particular emphasis on clinical, natural language, and multi-turn response data.

  • Create pipelines for data transformation, preprocessing, and management. · Ensure data quality, security, and compliance with privacy regulations for handling sensitive data.

  • Perform quality assurance of pipelines/processes to maintain integrity throughout the data lifecycle.

  • Create interactive visualizations and dashboards to communicate data insights and pipeline performance metrics.

  • Write documentation and relevant text for scientific, clinical, or public dissemination of knowledge.

  • Perform additional job-related duties as assigned.

You Have:

  • Master's degree in Neuroscience, Psychology, Engineering, Computer Science or equivalent combination of education and experience is required.

  • 5+ years of experience in data analysis and data science fundamentals (e.g., algorithms, data structures, data visualization, machine learning), preferably in a clinical or research setting.

  • 5+ years of experience in at least one scientific programming language (e.g., Python/R, Matlab) and related toolboxes or frameworks (e.g., Tidyverse, Scipy, Sklearn, Polars, Pytorch) is required.

  • 5+ years of experience working in a Linux environment, using version control systems (e.g., GitHub), and software virtualization platforms (e.g., Docker).

  • 5+ years of practical experience in Extract, Transform, Load (ETL) processes and database management languages (SQL,NoSQL), and familiarity with associated cloud computing services and frameworks (AWS, Azure, Terraform).

#LI-hybrid

Our Benefits


Our great compensation package and benefits include medical insurance, 401(k), paid parental leave, dependent care, discounted tickets and entertainment perks programs. For more information about our benefits, please visit our employee benefits website.

Pay Range

The salary range for the position is posted. Factors such as candidate's work experience, education/training, job-related skills, internal peer equity, as well as market and business considerations affect the salary offered within this range. In addition, this salary may be subject to a geographic adjustment (according to a specific city and state and depending on the role), if an authorization is granted to work outside of the location listed in this posting.

EEO Disclaimer

Child Mind Institute is committed to fostering an inclusive and equitable workplace where all individuals are treated with respect and dignity. We are proud to be an equal opportunity employer and prohibit discrimination and harassment of any kind.

We provide equal employment opportunities to all employees and applicants for employment, regardless of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), sexual orientation, gender identity, gender expression, age, national origin, ancestry, citizenship status, marital status, military or veteran status, physical or mental disability, genetic information, medical condition, or any other characteristic protected by applicable federal, state, or local laws.

In compliance with California law, we also prohibit discrimination based on reproductive health decision-making, status as a victim of domestic violence, sexual assault, or stalking, or any other category protected by the California Fair Employment and Housing Act (FEHA). In New York, we extend this prohibition to include status as a victim of domestic violence, familial status, or any other characteristic protected by the New York State Human Rights Law (NYSHRL).

Child Mind Institute is dedicated to ensuring accessibility and reasonable accommodations for individuals with disabilities or medical conditions. If you require an accommodation to participate in the application process or perform your job, please contact our HR Department at [email protected]

This policy applies to all aspects of employment, including recruitment, hiring, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, benefits, and training.

HQ

Child Mind Institute New York, New York, USA Office

215 E 50th St, New York, NY, United States, 10022

Child Mind Institute New York, New York, USA Office

286 Malcolm X Blvd., 2nd Floor, New York, NY, United States, 10027

Child Mind Institute New York, New York, USA Office

1110 South Avenue, Suites 79 & 80, New York, NY, United States, 10314

Similar Jobs

Yesterday
Remote or Hybrid
USA
85K-120K Annually
Mid level
85K-120K Annually
Mid level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design, build and maintain scalable data pipelines and automated workflows using Python, Airflow, DBT and cloud warehouses. Partner with Analytics, Sales and Marketing to deliver curated marketing datasets, enforce data quality through validation and CI/CD, and support production deployment and monitoring of Enterprise Data Lake/ODS solutions.
Top Skills: Apache AirflowDbtGitGitlab Ci/CdJenkinsJinjaPythonRedshiftSnowflakeSQL
11 Days Ago
Remote or Hybrid
USA
195K-320K Annually
Expert/Leader
195K-320K Annually
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Principal Data Engineer, you will design and implement LLM, AI-powered security data platforms, mentor engineers, and drive the adoption of data solutions across teams.
Top Skills: AirflowAWSBigQueryDaskDockerFlinkGCPKafkaKubeflowKubernetesLangchainLlamaindexMlflowMlops ToolsOciPulsarPythonSagemakerSnowflakeSparkVertex Ai
11 Days Ago
Remote or Hybrid
New York, NY, USA
120K-135K Annually
Mid level
120K-135K Annually
Mid level
AdTech • Big Data • Digital Media • Software
As a Data Engineer II at Magnite, you will build and optimize data pipelines with Apache Spark and Python, collaborate on data architecture, ensure operational performance, and work on infrastructure provisioning using Terraform and AWS.
Top Skills: SparkAWSDatabricksDockerGithub ActionsGoJavaJenkinsKafkaKubernetesLinuxMariadbMemcachePostgresPythonRdsRedisRustSnsSpring BootSQLSqsTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account