Aaru Logo

Aaru

Software Engineer, Data Integration

Posted Yesterday
Be an Early Applicant
In-Office
New York, NY, USA
250K-450K Annually
Mid level
In-Office
New York, NY, USA
250K-450K Annually
Mid level
Build and maintain scalable pipelines to ingest, clean, and integrate large multimodal datasets. Own ingestion across APIs, files, cloud storage, and warehouses. Design linkage, entity resolution, deduplication, and schema harmonization. Implement data quality checks, validation, and documentation. Collaborate with engineering, research, and deployment teams to prepare integrated data for simulations and evaluate new data sources.
The summary above was generated by AI
ABOUT AARU

Aaru operates at the bleeding edge of predictive intelligence, using AI to simulate and predict human behavior at scale. By generating and deploying instances of artificial intelligence that mirror humans, called agents, Aaru simulates entire populations with unprecedented accuracy. Our partners work with us for many reasons: they leverage Aaru to refine strategy in volatile geopolitical climates, cut new product innovation timelines from months to minutes, and deploy marketing campaigns that win in an era where consumers have never been harder to understand. We provide organizations with invaluable foresight, empowering them to anticipate outcomes and proactively make the right decisions at the right time, every time.

We’re a small, dedicated, mission-driven team and we intend to stay that way. We believe the best work happens when exceptionally talented people are given ownership, trust, and the space to operate without bureaucratic friction. We work with urgency and intellectual honesty and expect new team members to match our velocity. We seek individuals who thrive at the frontier, who push beyond conventional limits, who bring curiosity and conviction in equal measure, and who want their work to have demonstrable impact in the world. If you’re energized by the idea of a small team doing things that feel impossible, let’s build together.

ABOUT THE ROLE

As a Data Integration Specialist, you will build and maintain the data foundation that powers Aaru’s simulations. You will work across large internal and third-party datasets, designing reliable integration workflows, and ensuring that data can be linked, queried, and trusted at scale. This role sits at the intersection of data engineering and architecture and is critical to how Aaru produces predictive intelligence.

RESPONSIBILITIES
  • Build and maintain scalable pipelines to ingest, clean, and integrate large multimodal datasets

  • Own data ingestion across APIs, flat files, cloud storage, and data warehouses

  • Design workflows for linkage, entity resolution, deduplication, and schema harmonization across imperfect or incongruent datasets

  • Work with engineering, research, and deployment teams to make integrated data usable for simulation ingestion

  • Establish and monitor data quality checks, validation logic, and documentation across datasets and pipelines

  • Help evaluate new data sources and determine how they can be joined with existing data assets

YOU MAY BE A FIT IF
  • You have 3+ years of experience in data integration, data engineering, ETL/ELT, or a similar role involving large-scale datasets

  • You have hands-on experience working with messy, high-volume data (>100 TB) and know how to build systems that remain reliable at scale

  • You are highly fluent in SQL and Python, and comfortable working across modern data infrastructure such as Snowflake, BigQuery, Databricks, or similar tools

  • You have strong judgment around data quality and know how to preemptively identify inconsistencies, edge cases, and integration risks

STRONG CANDIDATES MAY ALSO
  • Have experience with alternative data, (transaction data, clickstream, geospatial, etc) either from a hedge fund or data marketplace lens

  • Have experience building matching or entity-resolution systems across fragmented or noisy identifiers

  • Have familiarity with privacy, compliance, and data licensing considerations when working with sensitive or third-party data

  • Have worked closely with researcher or product teams to turn raw data from disjoint sources into accessible structured database

  • Have a background in statistics and familiarity with sampling biases, bot-detection, imputation, and standard data quality metrics

LOCATION

This role is based in New York City. Aaru is an in-person company, working 5 days a week in office. Candidates are expected to be located within the New York City metropolitan area or open to relocation.

BENEFITS

At Aaru, we take care of our people. In addition to a competitive base salary and equity participation, we offer comprehensive medical, vision, and dental coverage, visa sponsorship and relocation support, and various other benefits and perks.

Similar Jobs

Yesterday
Remote or Hybrid
United States
83K-154K Annually
Mid level
83K-154K Annually
Mid level
Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics
Support design, development, testing, and maintenance of data integration solutions for the OneGL finance environment. Help build and troubleshoot ETL/data pipelines, execute tests, document designs, assist with enhancements and production support, and collaborate with senior engineers and cross-functional teams to support month-end close processes and deployment activities.
Top Skills: AgileAmazon RedshiftAPIsAws GlueAws LambdaAws S3Bi ToolsCi/CdDevOpsEtl ToolsPythonSQLTesting Tools
Yesterday
Remote or Hybrid
United States
106K-197K Annually
Senior level
106K-197K Annually
Senior level
Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics
Design, develop, integrate, test, and optimize scalable data integration solutions and pipelines for the OneGL finance environment. Support production users and month-end close processes, troubleshoot issues, develop test scripts, and document architectures. Collaborate with technical and business stakeholders, perform gap analysis, and support implementation and post-implementation activities.
Top Skills: Amazon RedshiftAPIsAws GlueAws LambdaAws S3Bi ToolsCi/CdDevOpsEtl ToolsPythonSap FioriSap S4/HanaSQL
9 Days Ago
In-Office
New York, NY, USA
156K-214K Annually
Senior level
156K-214K Annually
Senior level
Digital Media • Gaming • News + Entertainment • Sports
The Lead Software Engineer will design, develop, and support scalable cloud-based messaging and integration platforms, provide technical leadership, and ensure performance and reliability across platforms, primarily using Java, Spring Boot, and AWS services.
Top Skills: Apache KafkaAWSCi/CdJavaMicroservicesSpring BootTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account