Traversal Logo

Traversal

AI Engineer - AI Platform

Reposted 13 Days Ago
Easy Apply
In-Office
New York, NY, USA
150K-300K Annually
Senior level
Easy Apply
In-Office
New York, NY, USA
150K-300K Annually
Senior level
The AI Platform Engineer designs and builds systems for AI agent infrastructure and performance evaluation, driving AI capabilities for incident management.
The summary above was generated by AI

About Traversal

Traversal is the AI Site Reliability Engineer (SRE) for the enterprise—already trusted by some of the largest companies in the world to troubleshoot, remediate, and even prevent the most complex production incidents. Our mission is to free engineers from endless firefighting and enable them to focus on creative, high-impact work. 

Our roots remain deeply embedded in AI research, and we’re channeling that scientific rigor and creativity into building the premier AI agent lab for the enterprise. Hence, what we’re proudest of is assembling the most talented yet nicest group of individuals, including researchers from MIT, Harvard, and Berkeley, to world-class engineers from industry: Citadel Securities, Cockroach Labs, Datadog, DE Shaw, Meta, Hebbia, Perplexity, Glean, Pinecone, and more, to take on one of the hardest problems for AI to solve. Without the entire team, none of this would be possible.

The Role

As an AI Platform Engineer at Traversal, you’ll work on the core foundations that make Traversal’s AI possible, spanning both agent infrastructure and evaluation systems.

  • Agent Infrastructure — Build the frameworks, orchestration layers, and developer tooling that power Traversal’s AI agents for root cause analysis, alert triage, and “chat with your infrastructure/telemetry.” This involves designing scalable distributed systems and abstractions (e.g., MCP servers, multi-agent orchestration, toolkits) that balance research flexibility with production reliability.
  • Evaluation — Define what “good” looks like for AI performance in the incident management domain. You’ll build live evaluation pipelines, automated scoring systems, and benchmarks; integrate evaluation into the developer lifecycle; and surface these insights to customers as a value-add.

This work combines research (agentic architectures, benchmarking, calibration, finetuning) with engineering (production-scale infra, APIs, distributed systems) to accelerate the entire AI loop: build → evaluate → improve → ship.

Responsibilities

  • Design and build agent frameworks, orchestration layers, and developer tooling for Traversal’s AI agents.
  • Architect scalable distributed systems to support real-time workloads over petabytes of heterogeneous telemetry data.
  • Build live evaluation pipelines, automated scoring systems, and benchmarks to measure and drive AI performance.
  • Integrate evaluation systems into the developer lifecycle to create a fast research-to-production loop.
  • Surface evaluation signals and benchmarks to customers as a core product capability.
  • Partner with research scientists to prototype and productionize agentic architectures.
  • Own observability, latency, and reliability for agents in production.
  • Evolve and scale the agent + evaluation platform as the backbone of Traversal’s AI systems.

Requirements

  • Strong system design skills for distributed systems.
  • Proven production-scale software engineering experience.
  • Experience with LLM-based applications and/or multi-agent systems.
  • Strong data modeling skills and a track record of writing clean, maintainable code.
  • Collaborative, impact-driven mindset and ability to work across research and engineering teams.

Nice to Have

  • Knowledge of software incidents and production SRE workflows.
  • Prior experience with AI benchmarking or evaluation systems.
  • Experience creating quantitative scoring systems or benchmarks in new problem domains.
  • Familiarity with observability stacks (logs, metrics, traces) and telemetry systems.
  • Background in agentic architectures, orchestration frameworks, or applied AI research.

Compensation

We offer competitive compensation, startup equity, health insurance, and additional benefits. The U.S. base salary range for this full-time, in-person role in New York is $150,000–$300,000, plus equity and benefits. Our salary ranges are based on location, level, and role. Individual compensation is determined by experience, skills, and job-related knowledge.

Why You Should Join Us

We’ll make sure you’re fully supported with health insurance, a great tech setup, flexible time off, and plenty of in-office snacks. We offer competitive salary and equity packages, and take thoughtful consideration with every hire on our small, high-impact team.

Traversal is fully in-office, 5 days a week, based in New York near Madison Square Park. We have a collaborative, hard-working culture and are energized by building the future of AI-powered software maintenance.

Working here means owning meaningful parts of the product, having the flexibility to move fast, and learning constantly. This is a place to grow your career, make a real impact, and help define a new category of infrastructure software.

Top Skills

APIs
Distributed Systems
Llm-Based Applications
Telemetry Systems
HQ

Traversal New York, New York, USA Office

New York, New York, United States

Similar Jobs

22 Days Ago
Hybrid
New York, NY, USA
230K-286K Annually
Senior level
230K-286K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The Senior Lead AI Engineer will develop AI components, collaborate with cross-functional teams, and optimize performance for large scale production AI systems.
Top Skills: AWSAzureGoGCPHuggingfaceJavaNemo GuardrailsPythonPyTorchScala
An Hour Ago
Easy Apply
In-Office
New York, NY, USA
Easy Apply
157K-194K Annually
Mid level
157K-194K Annually
Mid level
Software
The AI Analytics Engineer will build scalable AI-powered analytics infrastructure, design evaluation frameworks, and drive cross-functional adoption, transforming how analytics are approached within the organization.
Top Skills: ChatgptClaudeCursorDatabricksDbtSnowflakeSQL
Yesterday
In-Office or Remote
United States
114K-191K Annually
Senior level
114K-191K Annually
Senior level
Automotive
The role involves designing and supporting GPU/Kubernetes clusters for AI/ML workloads, developing automation tools, and troubleshooting infrastructure issues.
Top Skills: AnsibleBashGrafanaInfinibandKubernetesLinuxOpenshiftPrometheusPythonRdmaTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account