Medal Logo

Medal

Infra Engineer - API

Reposted 2 Days Ago
Be an Early Applicant
In-Office
New York City, NY, USA
250K-400K Annually
Mid level
In-Office
New York City, NY, USA
250K-400K Annually
Mid level
As Infra Engineer, you will build and manage low-latency APIs for high-traffic applications, focusing on GPU and Kubernetes infrastructure, ensuring reliability and scalability from hundreds to thousands of users.
The summary above was generated by AI
About General Intuition

We are the frontier research lab dedicated to building foundation models for environments that require deep spatial and temporal reasoning. For the past year, we've been pushing the forefront of AI across agents capable of navigating space and time, world models that provide training environments for those agents, and video understanding models with a focus on transfer to the real world.

We raised a seed round of $133M from General Catalyst and Khosla to discover the next generation of intelligence.

The Role

We're hiring an Infra Engineer to own General Intuition's API.

Our research team builds frontier models — agents that reason about space and time, world models, video understanding. Your job is to turn those models into a production API that developers love: low-latency, highly available, billing-grade reliable, and able to scale from our first hundred users to tens of thousands of concurrent ones.

You'll work directly with the founding team. You'll own the API end to end: the client libraries developers integrate with, how we receive frames from clients and stream actions back, how requests route to the right GPU, how sessions spin up and tear down, how k8s clusters get stood up in new regions, and how our GPU fleet scales.

This is a true generalist infrastructure role. We are not looking for a pure API person or a pure GPU person — we are looking for someone who is exceptional at both, and who wants to own the entire surface end-to-end.

Key Responsibilities
  • Own the video streaming protocol. Orchestrating how we receive frames from clients and route them to servers as efficiently as possible.

  • Own the runtime layer of our API. Stateful request routing, GPU session lifecycle, inference orchestration — the whole runtime stack.

  • Scale our k8s footprint across regions. Lead new regional deployments.

  • Own the GPU hosting strategy. Move us from dozens of GPUs today to potentially thousands (and beyond) without breaking the bank or the latency budget.

  • Drive latency and throughput. Own the inference-performance backlog

  • Partner with product engineering. Work closely on developer-facing reliability, observability, metering, and billing-grade uptime.

Qualifications

You almost certainly have:

  • A track record of personally scaling a high-traffic, low-latency API in production, whether at a gaming company, a video streaming company, a payments company, or a hyperscaler.

  • Deep k8s experience, including multi-region deployments.

  • Comfort with SLOs and capacity planning.

  • Strong ownership instinct — you've taken systems end-to-end, not just contributed to them.

Bonus points for any of:

  • Experience deploying streaming video or audio inference models (the dream hire).

  • Experience with low-latency game streaming or video streaming infra.

  • Experience scaling GPU fleets across providers (GCP, Coreweave, Lambda, etc.).

  • Experience with frontier model inference (LLMs, world models, multimodal).

  • Experience with on-device / edge inference (ExecuTorch, Core ML, etc.).

Our stack

  • GPUs: GCP today, Coreweave as we scale.

  • Orchestration: Kubernetes, multi-region.

  • Models: In-house frontier research — agents, world models, video understanding.

  • API surface: Client libraries in TypeScript, Python, Rust, and C

  • In-office, 5 days/week. NYC, Stockholm, London, Paris, or Geneva.

Benefits
  • Competitive salary and meaningful equity

  • Comprehensive medical, dental, and vision coverage

  • 401(k)

  • Wellhub membership for fitness and wellness

  • Mental health support through Spring Health and Headspace

  • Fertility and maternal health benefits

  • Paid parental leave

  • Generous PTO, 11 paid company holidays, and paid sick time

  • Daily meals and commuter benefits at our NYC HQ

  • Learning and development stipend

Benefits vary by country and employment type.

HQ

Medal New York, New York, USA Office

Upper West Side, New York, New York, United States, 10024

Similar Jobs

10 Minutes Ago
Remote or Hybrid
New York, NY, USA
162K-268K Annually
Expert/Leader
162K-268K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead global go-to-market and co-innovation with a strategic partner for Autonomous Workforce. Own the global partner account plan, drive cross-geo alignment, enable product/solution education, coordinate demand generation and field activation, govern performance via QBRs and scorecards, and expand partner capabilities to grow pipeline and revenue across routes-to-market.
Top Skills: AICloudCRMItomItsmSaaSSamServicenowServicenow AiSpm
11 Minutes Ago
Remote or Hybrid
New York, NY, USA
174K-258K Annually
Expert/Leader
174K-258K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Drive and scale ServiceNow's AWS business across Banking, Insurance, and Wealth Management in the Americas. Own quota, pipeline forecasting, and executive relationships with customers and AWS. Lead virtual, cross-functional teams to develop joint solutions, negotiate complex contracts, build AWS field mindshare, ensure customer outcomes, and manage pipeline reporting and go-to-market activities. Up to 50% travel.
Top Skills: AWSAws Cloud PractitionerNowsellServicenowValue Prompter
11 Minutes Ago
Remote or Hybrid
New York, NY, USA
140K-231K Annually
Expert/Leader
140K-231K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead global partner GTM and co-innovation for the Autonomous Workforce practice. Own partner account plans, drive cross-geo alignment, enablement, demand generation, operational playbooks, and governance to grow pipeline, revenue, and partner maturity across ServiceNow-aligned products and AI-driven solutions.
Top Skills: AICloudCRMItomItsmSaaSSamServicenowServicenow AiSpm

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account