Baseten Logo

Baseten

Cloud Platform Engineer

Reposted 9 Days Ago
Remote or Hybrid
Hiring Remotely in New York, NY, USA
165K-330K Annually
Mid level
Remote or Hybrid
Hiring Remotely in New York, NY, USA
165K-330K Annually
Mid level
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
The summary above was generated by AI

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As a Cloud Platform Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is scalable, reliable, and efficient. This can range from automating deployments and monitoring systems to optimizing performance and managing incidents.

We all work closely with our users, learning from their past struggles in operationalizing ML, onboarding them onto our platform, and turning our learnings into ideas for improving Baseten.

EXAMPLE INITIATIVES

You'll get to work on these types of projects as part of our Infrastructure team:

  • Multi-cloud capacity management

  • Inference on B200 GPUs

  • Multi-node inference

  • Fractional H100 GPUs for efficient model serving

RESPONSIBILITIES

  • Build and maintain scalable infrastructure to support the deployment and operation of machine learning models.

  • Establish standards and best practices for reliability and performance across the infrastructure.

  • Automate processes when relevant, particularly for managing CI/CD pipelines.

  • Own products and projects end-to-end, functioning as both an engineer and a project manager, with a focus on user empathy, project specification, and end-to-end execution.

  • Collaborate with cross-functional teams to understand project requirements and translate them into technical solutions.

  • Mentor junior team members and contribute to knowledge sharing within the organization.

  • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity.

  • Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates.

REQUIREMENTS

  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.

  • Extensive experience with Kubernetes.

  • Experience in building and maintaining scalable infrastructure.

  • Experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation, Pulumi) and CI/CD tooling (e.g., GitHub Actions, GitLab CI, Circle CI, Jenkins).

  • Relevant OSS observability experience (Prometheus, ELK stack, Grafana stack, Opentelemetry) is a plus.

  • Ability to own projects end-to-end, from project specification to execution.

  • No prior machine learning experience required, but should be open to learning about it.

BENEFITS

  • Competitive compensation, including meaningful equity.

  • 100% coverage of medical, dental, and vision insurance for employee and dependents

  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

  • Paid parental leave

  • Fertility and family-building stipend through Carrot

  • Company-facilitated 401(k)

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).

Baseten New York, New York, USA Office

New York, NY, United States

Similar Jobs

9 Days Ago
Remote or Hybrid
US
100K-160K Annually
Senior level
100K-160K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Design, build, and operate enterprise-scale multi-cloud infrastructure (Azure primary, GCP, AWS exposure). Own landing zones, Terraform modules, production AKS/GKE Kubernetes, Vault secrets, hybrid networking, CI/CD pipelines, monitoring, DR, and automation (Ansible, Python/Bash). Mentor engineers, document runbooks, and collaborate with security, application teams, and leadership to ensure secure, reliable, cost-optimized cloud platforms.
Top Skills: AksAnsibleApp GatewayArtifact RegistryAWSAwxAzureAzure DevopsAzure MonitorAzure StorageBashBgpBigQueryCloud BuildCloud LoggingCloud RunCloud SqlCloudboltDatadogDnsEc2EksGitlab CiGkeGoogle Cloud MonitoringGoogle Cloud Platform (Gcp)Hashicorp VaultHelmIamJenkinsKubernetesLoad BalancingManaged IdentityNsgPowershellPrivate EndpointsPythonS3SignozTerraformVertex AiVpcVpc Service ControlsVpnWorkload Identity
2 Days Ago
In-Office or Remote
United States
118K-147K Annually
Junior
118K-147K Annually
Junior
Food
Build and maintain Back of House data delivery and related BOH projects using AWS serverless technologies. Write code and unit tests, collaborate with SRE/QE/security/product teams across Yum! Brands, document data flows, and improve tooling and processes to increase restaurant operational efficiency and data integrations.
Top Skills: AWSAws CdkBigfixCortexCursorGitlabPostmanServerless FrameworkTypescriptWindows 11Wiz
6 Days Ago
Remote or Hybrid
United States
45K-60K Annually
Mid level
45K-60K Annually
Mid level
Greentech • Professional Services • Consulting • Hospitality
Design, build and maintain Microsoft Cloud applications using Power Apps/Pages and Azure services. Implement Entra ID authentication, Azure Functions/App Services, Logic Apps and Blob storage. Create Azure Data Factory pipelines and APIs (REST/GraphQL), integrate external services and LLM data pipelines, and ensure performance, security, and scalability while collaborating with customers to turn requirements into streamlined apps.
Top Skills: Azure App ServicesAzure Blob StorageAzure Data FactoryAzure DevopsAzure FunctionsAzure Logic AppsGithub ActionsGraphQLJavaScriptMicrosoft Entra Id (Azure Ad)Microsoft Power AppsMicrosoft Power PagesNotification HubPower BIPythonRestSQL

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account