Jellyfish is the backbone for elite engineering organizations, and our data infrastructure needs to be as high-performing and insightful as the teams we serve. We are looking for a Staff/Lead Data Architect to help us design, automate, and scale the next generation of our Jellyfish data platform. You’ll be responsible for maturing our core data models, automating environment boundaries, and driving advanced observability and cost-attribution deeper into our data pipeline architecture. If you view manual data intervention as a technical debt to be solved and want to work in an environment where your architectural decisions directly impact how the world’s best engineering leaders measure their productivity, you’re the perfect fit.
What you’ll actually be doing:Architectural Evolution & Blueprinting – You’ll own the blueprint for the next-generation Jellyfish data platform. You'll tackle our existing data footprint, refactoring pipelines and structures into highly efficient, scalable patterns (like Medallion-style schemas or unified semantic layers).
Automated Data Governance – You’ll design and automate strict, code-driven environment isolation boundaries. You'll ensure dev, staging, and production data catalogs (and their underlying cloud storage) never dangerously cohabitate, eliminating the risk of "fat-finger" data drops or PII leakage.
Orchestration & Compute Scaling – You’ll lead the modernization of our workflow orchestration and distributed compute engines. You’ll focus on slashing engine runtime overhead, eliminating API bottlenecks, and streamlining heavy parallelized or mapped data tasks.
Modern Integration Middleware – You'll partner with application teams to ensure our React frontends and backend services hit highly secure, cached API and Backend-for-Frontend (BFF) layers rather than querying raw data services directly, protecting our warehouses from concurrency spikes.
Proactive Data Observability & FinOps – You’ll build and maintain granular data-quality monitors and cost-allocation frameworks. You won't just track overall warehouse spend; you’ll implement systems to map execution cost and token usage directly down to the tenant, team, or user level.
Data Tooling Fluency – You have deep, production-level experience with Python, advanced SQL, and modern data stack essentials. You are deeply familiar with programmatic orchestrators (like Prefect, Dagster, or Airflow) and modern data validation engines (like Pydantic v2).
Catalog & Warehouse Practitioner – You have hands-on mastery of enterprise-scale data platforms and governance layers (e.g., Snowflake, Databricks Unity Catalog, BigQuery) and know exactly how to map environments to catalogs and data quality to schemas.
Automation Mindset – You look at a manual data backfill or a clicked-together database permission and immediately think about how to automate it via Infrastructure-as-Code (Terraform) or programmatic workflows.
Collaborative Systems Thinker – You don’t design in a vacuum. You are excellent at documenting data lineage, mentoring data engineers, and collaborating across DevOps and Product teams to align infrastructure with business goals.
Pragmatic Problem Solver – You know the difference between data quality stages and software development lifecycles. You know when a "perfect" distributed cluster is required and when a "good enough" cached view keeps the business moving.
You’ve survived (and thrived in) a rapidly scaling B2B SaaS startup handling massive multi-tenant data sets.
You have strong opinions on the future of Git-like data versioning and zero-copy cloning (e.g., Iceberg, Nessie).
You’ve managed complex cloud-billing attributions or scaled heavy LLM/vector-embedding data workloads and lived to tell the tale.
A list of job experiences and qualification requirements is great, but humility, a performance-driven attitude, and a team-player approach are most important to us. We love to have fun and win in the process. We only hire people who have a passion for building great companies in an environment where a sense of humor is a must.
Occasional travel may be required.
Applicants must be authorized to work for any employer in the US. We are unable to sponsor or take over sponsorship of an employment visa at this time.
Let’s talk about us!
This is all about you, but you want to know a little about us. Jellyfish enables leaders to effectively build AI-integrated engineering teams, align engineering decisions with business initiatives and deliver the right software efficiently and on time. AI tools alone won’t transform your org—Jellyfish shows you what’s working, what’s not, and how to build high-performing teams that know how to use AI the right way.
Similar Jobs at Jellyfish
What you need to know about the NYC Tech Scene
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

