Kepler  Logo

Kepler

Data Platform Engineer

Posted 3 Hours Ago
Be an Early Applicant
In-Office
New York City, NY
250K-320K Annually
Expert/Leader
In-Office
New York City, NY
250K-320K Annually
Expert/Leader
The Data Platform Engineer will architect and build a foundational data platform for AI, managing ingestion systems, data quality, and mentorship while working with diverse data types and technologies.
The summary above was generated by AI

Kepler: AI you can trust and verify.

Every AI tool has the same flaw: the model touches the data. It guesses numbers, fabricates sources, and gives you a different answer every time you ask. For the people making million-dollar decisions, that's not a feature gap. It's a dealbreaker.

We built the architecture that makes hallucination structurally impossible. AI interprets your intent. Deterministic code retrieves every figure from source documents. The model never produces a number, so it can't get one wrong. Every output traces to a filing, a page, a line item. Every calculation shows its formula. Every answer is defensible.

Live in production. 950K+ SEC filings. 14K+ companies. 40M+ documents. 27 global markets. Trusted by firms that don't get to be wrong.

The architecture is domain-independent. Finance is first because the pain is sharpest. Healthcare, legal, insurance are next. Same system, new data sources. We're not building a finance product. We're building the verification layer for the entire AI stack.

Founded by Vinoo Ganesh (7 yrs Palantir, Head of Business Engineering at Citadel) and Dr. John McRaven (11 yrs Palantir, created the analytics engine behind $100M+ contracts with BP and Airbus, Ph.D. Physics). Backed by the founders of OpenAI, Facebook AI Research, MotherDuck, dbt, and Outerbounds.

The Role

You'll architect the foundational data platform that powers Kepler's AI research experience. Financial data is fragmented, messy, and comes in every format imaginable: SEC filings, earnings transcripts, market data feeds, research reports, live audio, internal documents. You'll own the architecture that ingests, structures, and unifies all of it into a single coherent system where every answer traces back to its source.

This is a greenfield build. You'll define the storage technologies, search and retrieval systems, indexing strategies, and observability tools that become the foundation for everything we do. You'll drive technical direction, mentor engineers, and make architectural decisions that shape the platform for years to come.

This role is for engineers who want to build the data infrastructure for the AI era, not another dashboard or data warehouse.

Within your first 90 days, you will:

  • Own and ship a major data pipeline end-to-end

  • Make foundational technology decisions that shape platform architecture

  • Build ingestion systems that power real financial research workflows

  • Establish data engineering patterns and best practices for the team

What You'll Do
  • Architect the data platform: Define storage technologies, indexing strategies, search and retrieval systems, and observability tools from first principles. Drive technical direction and make high-stakes architecture decisions.

  • Build ingestion pipelines: Design systems that ingest data from dozens of heterogeneous sources: SEC filings, earnings transcripts, market data, research reports, live audio, internal documents. Structured, unstructured, and everything in between.

  • Build semantic layers: Create the mapping between raw data and precise definitions that powers our platform. Normalize entities across sources, resolve ambiguity, and ensure the same concept means the same thing everywhere.

  • Build for AI and analytics: Infrastructure that serves both traditional query performance and AI-native requirements: document processing, embedding pipelines, vector search, retrieval systems that pull the right context from millions of documents in milliseconds.

  • Build provenance systems: Every number traces to a source document, section, and disclosure. Full lineage that satisfies institutional compliance and makes our AI trustworthy.

  • Own data quality: Observability, monitoring, validation, and governance. Set the standard for data reliability across the platform.

  • Mentor and grow the team: Code reviews, architectural guidance, and technical mentorship for engineers.

  • Ship with production excellence: Comprehensive testing, monitoring, deployment pipelines. Set the bar for engineering quality.

What We're Looking For
  • 10+ years of data engineering experience building enterprise data platforms from scratch

  • Data architecture: Proven track record designing and scaling ingestion, storage, transformation, and retrieval systems

  • Diverse data types: Deep experience with structured, unstructured, and semi-structured data. Bonus if you've worked with document processing, audio, or financial data

  • Modern data stack: Strong opinions about storage technologies, indexing strategies, orchestration tools, and observability

  • AI infrastructure: Curiosity about vector databases, embedding pipelines, and retrieval systems. You don't need to be an ML engineer, but you want to work at the intersection

  • Technical leadership: Experience driving architectural decisions and mentoring engineers

  • Practices: Git workflows, CI/CD, automated testing, data quality frameworks

  • Systems thinker who cares about how ingestion affects transformation, how transformation affects governance, how governance affects what's possible downstream

  • Strong communicator who can articulate technical trade-offs to engineering and business stakeholders

  • Thrives in fast-paced environments with high ownership

  • Financial services experience preferred but not required

Don't check every box? Apply anyway. We prioritize speed of learning, problem-solving skills, attention to detail, and drive to build world-class data infrastructure.

Mentorship & Growth

Direct collaboration with founders who built Palantir Foundry and data infrastructure at Citadel:

  • Weekly 1:1s with founders

  • Deep architectural reviews and guidance on data system design

  • Clear growth path toward staff engineering and leading the data platform team

  • Shape the data platform that becomes the ground truth for AI

Our Technical Stack
  • Frontend: React, Typescript, Vite, Tailwind, Radix, TanStack, Zustand

  • Backend: Rust, Node.js, Python, PostgreSQL, Redis

  • AI/ML: OpenAI, Anthropic, MCP SDK,

  • Infrastructure: AWS (S3, RDS), Docker, Temporal, Kubernetes, Dataflow

  • Tools: Git, GitHub, Pulumi, Auth0, SharePoint

Benefits
  • Comprehensive medical, dental, vision, 401k, insurance for employees and dependents

  • Automatic coverage for basic life, AD&D, and disability insurance

  • Daily lunch in office

  • Development environment budget - latest MacBook Pro, multiple monitors, ergonomic setup, and any development tools you need

  • Unlimited PTO policy

  • "Build anything" budget - dedicated funding for whatever tools, libraries, datasets, or infrastructure you need to solve technical challenges, no questions asked

  • Learning budget - attend any conference, course, or program that makes you better at what we're building

Our Operating Principles
  • Forward-Deployed with Product DNA: We own customer outcomes while building a product company. That means embedding, iterating, and deploying where our customers are. We don't win if they don't win.

  • Extreme Ownership: Big vision, shared ownership. If you notice a problem, you own it. Authority comes from initiative, not job titles. Once you step up, you're accountable for the outcome.

  • Production-First Engineering: We design for critical workloads from day one. Durable execution, blue/green deploys, automated rollbacks, continuous delivery with end-to-end observability. Every change lands safely and stays resilient under real-world load.

  • Trust as the Default: People do their best work when confidence is mutual. We show our work, keep our promises, and flag risks before they bite. Trust isn't an aspiration. It's the baseline.

  • Keep Raising the Bar: We block time for training, code-health sprints, and deep-dive tech talks. A sharper team and a cleaner stack pay compounding dividends. Continuous learning isn't a perk. It's part of the job.

Kepler is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind. We are committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.

Top Skills

Anthropic
Auth0
AWS
Dataflow
Docker
Git
Git
Kubernetes
Mcp Sdk
Node.js
Openai
Postgres
Pulumi
Python
Radix
React
Redis
Rust
Sharepoint
Tailwind
Tanstack
Temporal
Typescript
Vite
Zustand

Kepler New York, New York, USA Office

22 Vanderbilt Ave, New York, New York, United States, 10017 4611

Similar Jobs at Kepler

3 Hours Ago
In-Office
New York City, NY, USA
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Fintech • Software
As a Software Engineer at Kepler, you'll develop backend systems, manage data pipelines, and integrate AI for financial applications, ensuring production excellence and scalability.
Top Skills: AWSDataflowDockerGitGitKubernetesNode.jsOpenaiPostgresPythonReactRustTypescript
3 Hours Ago
In-Office
New York City, NY, USA
Expert/Leader
Expert/Leader
Fintech • Software
Lead the AI research agenda at Kepler, focusing on trustworthy AI for enterprise decisions. Oversee research on agentic systems and evaluation frameworks, and manage a research team. Ensure production deployment of innovative AI solutions based on real financial data.
Top Skills: AWSDockerKubernetesNode.jsPostgresPythonRadixReactRedisRustTailwindTanstackTemporalTypescriptViteZustand
3 Hours Ago
In-Office
New York City, NY, USA
200K-250K Annually
Senior level
200K-250K Annually
Senior level
Fintech • Software
You'll architect and build core systems for Kepler's AI platform, manage backend services, mentor engineers, and drive technical decisions.
Top Skills: AnthropicAWSCi/CdDockerGitKubernetesNode.jsOpenaiPostgresPythonRedisRust

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account