Raylu Logo

Raylu

Senior Data Engineer

Reposted 24 Days Ago
In-Office
New York, NY, USA
165K-250K Annually
Senior level
In-Office
New York, NY, USA
165K-250K Annually
Senior level
The Senior Data Engineer will design and manage data pipelines for massive datasets, ensuring quality, cost management, and effective data operations. Responsibilities include overseeing data flows, monitoring data quality, and managing vendor partnerships.
The summary above was generated by AI
Position Summary
We’re hiring a Senior Data Engineer to own data at truly massive scale. You’ll design and run pipelines that clean, enrich, and serve data spanning hundreds of attributes across 80M+ companies and 800M+ people. The role blends classic data engineering with data operations, vendor/BPO orchestration, and data partnerships.

What we’re looking for
  • Core stack: Python, Dagster, DuckDB
  • Pipelines at scale: Building resilient ELT/ETL with strong contracts, idempotency, and lineage.
  • Data operations: Set quality bars, manage BPO workflows, and run SLAs with external data partners.
  • Serving & access: Position data for production use from serving infrastructure, documentation, and SLAs for internal consumers.
  • Cost & performance: You tune storage/compute and keep a sharp eye on unit economics.
  • Opinionated: Deep level of understanding of the technological landscape, making both high level system and granular code design decisions based on understanding rather than preference - diving deep on unknown patterns in order to build the best product.

Responsibilities
  • Own end-to-end data flows: ingestion, normalization, entity resolution, enrichment, and delivery.
  • Stand up monitoring for freshness, completeness, and accuracy; drive RCA and prevention.
  • Build internal tools that make data discoverable and usable by engineering and product.
  • Recruit, onboard, and manage BPO vendors; negotiate and run data partnerships.

Nice to have
  • Experience with big data, columnar storage formats, vector indexes, and privacy/compliance in data products.

Logistics:
  • Compensation: $165K - $250K salary, $100K-200K equivalent in new hire equity (4 year vest) 
  • Location: New York City. We are a fully in-office team working out of Midtown Manhattan Monday through Friday. We allow for WFH days when anyone is traveling, but we do not allow for permanent remote work.
  • Benefits:
    • Generous health, dental, and vision insurance
    • 401k with 3% automatic contribution (no vesting)
    • Paid Lunches
    • Wellness and Citi Bike benefit

HQ

Raylu New York, New York, USA Office

New York, New York, United States

Similar Jobs

6 Days Ago
Remote or Hybrid
US
135K-155K Annually
Senior level
135K-155K Annually
Senior level
Professional Services • Software
Lead architecture and buildout of a new graph-backed enterprise data platform: design ingestion, graph and relational storage, entity resolution pipelines, temporal models, ETL/ELT pipelines, governance, APIs, and production connectors. Ship scalable graph data models, traversal queries, and platform roadmap while enabling observability, security, and containerized deployments.
Top Skills: AirflowAzureCypherDagsterDbtDockerGremlinHelmJavaKubernetesPythonSalesforceServicenowSparqlSQL
9 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
186K-222K Annually
Senior level
186K-222K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Design and scale data pipelines and ML/LLM systems, build agentic automation for pipeline generation and maintenance, improve data monitoring, and collaborate with analysts, product, and ML teams to deliver reliable end-to-end data and AI infrastructure for a high-growth e-commerce platform.
Top Skills: AirflowAws Ec2Aws EksAws LambdaAws S3DbtLlmsMcp ServersMl PipelinesPythonRagSnowflake
12 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
118K-179K Annually
Senior level
118K-179K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and operate large-scale data platforms and Spark/PySpark pipelines. Enable data integration, modeling, quality, and observability. Build MCP servers and AI-augmented tooling, mentor engineers, and lead cross-functional projects to deliver reliable data products.
Top Skills: Ai AgentsApache IcebergAuroraAWSAws RdsAzureDatabricksDbtFivetranGCPGoogle BigqueryMcp ServersMs Sql ServerMySQLOraclePostgresPysparkPythonSnowflakeSparkSQL

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account