Traversal Logo

Traversal

AI Engineer - Site Reliability Researcher

Reposted 10 Days Ago
In-Office
New York, NY
150K-300K Annually
Junior
In-Office
New York, NY
150K-300K Annually
Junior
As an AI Site Reliability Researcher, you will ensure the scalability and reliability of our AI platform, design systems for observability, manage deployments, and develop CI/CD pipelines for hybrid environments.
The summary above was generated by AI
About Traversal

Traversal is the AI Site Reliability Engineer (SRE) for the enterprise—already trusted by some of the largest companies in the world to troubleshoot, remediate, and even prevent the most complex production incidents. Our mission is to free engineers from endless firefighting and enable them to focus on creative, high-impact work. 

Our roots remain deeply embedded in AI research, and we’re channeling that scientific rigor and creativity into building the premier AI agent lab for the enterprise. Hence, what we’re proudest of is assembling the most talented yet nicest group of individuals, including researchers from MIT, Harvard, and Berkeley, to world-class engineers from industry: Citadel Securities, Cockroach Labs, Datadog, DE Shaw, ServiceNow, Glean, Perplexity, Pinecone, and more, to take on one of the hardest problems for AI to solve. Without the entire team, none of this would be possible.

The Role

As an AI Site Reliability Researcher, you’ll play a central role in ensuring the scalability, reliability, and observability of our AI platform. This is a high-impact, cross-functional role where you’ll design systems and processes that keep our AI-driven infrastructure healthy and performant.

We’re entering a phase of rapid growth and scale, driven by the needs of large enterprise customers. That means pressure on everything from deployments to developer workflows. We’re building our own distributed systems, maturing our CI/CD pipelines, and managing complex hybrid environments (SaaS and on-prem). You’ll play a foundational role in establishing the SRE practices that allow us to scale thoughtfully and reliably.

In this role, you’ll define how we do change management across diverse deployment environments, build internal observability from the ground up, and help bring structure to systems that are evolving quickly. You’ll also be a hands-on user of Traversal — your feedback will shape the product directly. And while your focus will be reliability, you’ll collaborate closely with our infra and AI agent teams, with opportunities to influence how AI integrates with real-world production environments.

Responsibilities
  • Brains Of The Product: Distilling SRE Knowledge into Agentic workflows.
  • System Design & Architecture: Build scalable and resilient infrastructure to support AI observability agents in both cloud and on-prem environments.
  • Observability: Built systems to monitor logs, metrics, and traces tied to deployments and developer activity. Power user of observability tools.
  • Incident Management: Define and lead our on-call and incident response processes, including alerting, debugging, and postmortems.
  • CI/CD & Deployment: Design and scale our in-house CI/CD systems to support safe, efficient rollouts across hybrid environments.
  • Infrastructure Automation: Own our infrastructure-as-code stack and improve automation across deployment and provisioning workflows.
Requirements
  • Experience as an SRE, infrastructure engineer or similar role in fast-paced environments.
  • Exceptional debugging skills across complex, distributed systems — proven ability to get to root cause quickly across varied tech stacks.
  • Strong systems design intuition — understands how observability tools fit into architecture and how to leverage them effectively in incident response.
  • Experience with observability tools (e.g., Datadog, Grafana, Prometheus, OpenTelemetry) and incident response.
  • Deep understanding of infrastructure automation and CI/CD systems.
  • Hands-on experience with Terraform, Kubernetes, and cloud environments (AWS or GCP).
  • Ability to debug distributed systems and drive system-level improvements.
  • Experience supporting hybrid cloud/on-prem deployments and complex change management.
Nice to Have
  • Familiarity with AI infrastructure or supporting ML/LLM workloads in production.
  • Background in developer productivity tooling or internal platform teams.
  • Prior experience building systems that connect infra events to developer workflows.
  • Exposure to agentic systems or AI observability platforms.
Compensation

We offer competitive compensation, startup equity, health insurance, and additional benefits. The U.S. base salary range for this full-time, in-person role in New York is $150,000–$300,000, plus equity and benefits. Our salary ranges are based on location, level, and role. Individual compensation is determined by experience, skills, and job-related knowledge.

Why You Should Join Us

We’ll make sure you’re fully supported with health insurance, a great tech setup, flexible time off, and plenty of in-office snacks. We offer competitive salary and equity packages, and take thoughtful consideration with every hire on our small, high-impact team.

Traversal is fully in-office, 5 days a week, based in New York near Madison Square Park. We have a collaborative, hard-working culture and are energized by building the future of AI-powered software maintenance.

Working here means owning meaningful parts of the product, having the flexibility to move fast, and learning constantly. This is a place to grow your career, make a real impact, and help define a new category of infrastructure software.

Top Skills

AI
AWS
Datadog
GCP
Grafana
Kubernetes
Opentelemetry
Prometheus
Sre
Terraform
HQ

Traversal New York, New York, USA Office

New York, New York, United States

Similar Jobs

32 Minutes Ago
Hybrid
Melville, NY, USA
60K-80K Annually
Junior
60K-80K Annually
Junior
Information Technology • Insurance • Software
As a QA Analyst II, you'll design and implement test plans, ensure software meets standards, and work closely with development teams to enhance product quality.
Top Skills: Ca AgileOop Coding ConceptsQtestSafe
35 Minutes Ago
In-Office or Remote
4 Locations
Mid level
Mid level
Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
The Paid Media Manager will drive the strategy and execution of content syndication and media buying, managing vendor relationships and optimizing lead generation efforts.
Top Skills: Marketo
44 Minutes Ago
Remote or Hybrid
69 Locations
124K-280K Annually
Senior level
124K-280K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead the design and implementation of innovative data models for finance and actuarial use cases, mentoring team members and fostering client interactions.
Top Skills: SQL

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account