Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Senior Site Reliability Engineer Jobs in NYC, NY

Mattermost

Lead Site Reliability Engineer

Reposted 2 Days AgoSaved

Remote

New York, NY

170K-200K Annually

Senior level

170K-200K Annually

Senior level

Software

Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.

Top Skills: AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform

Assured

Staff Site Reliability Engineer

Reposted 2 Days AgoSaved

Remote

New York, NY

180K-210K Annually

Senior level

180K-210K Annually

Senior level

Artificial Intelligence • Insurance • Software • Automation

The Staff Site Reliability Engineer will build and scale infrastructure for Assured's platform, automate delivery, enhance observability, and lead mentoring initiatives.

Top Skills: AWSKubernetesPostgresTerraform

Cresta

Senior Infrastructure Engineer/SRE

Reposted 2 Days AgoSaved

Remote

New York, NY

205K-270K Annually

Senior level

205K-270K Annually

Senior level

Artificial Intelligence • Other • Sales • Software

The role involves designing and advancing infrastructure for the engineering team, ensuring the reliability of Kubernetes clusters, automating operations, and building machine learning infrastructure.

Top Skills: ArgoAWSAzureCloudFormationFluxGithub ActionsGoGCPKubernetesPostgresPythonTerraform

Alteryx

Lead Site Reliability Engineer

Reposted 2 Days AgoSaved

Remote

New York, NY

136K-177K Annually

Senior level

136K-177K Annually

Senior level

Big Data • Machine Learning • Software • Analytics

As a Lead Site Reliability Engineer, you will drive the reliability strategy, improve system health, lead incident management, and mentor engineers for a multi-region SaaS platform.

Top Skills: ArgocdC++Ci/CdCloud PlatformsDatadogGitopsGrafanaInfrastructure As CodeJavaJavaScriptKubernetesPython

Photon

SRE Architect | Onsite

3 Days AgoSaved

Remote

New York, NY

Senior level

Agency • Information Technology

Lead SRE role designing and maintaining CI/CD pipelines (GitHub Actions), containerized deployments (Docker, Kubernetes, AKS, Helm), web/mobile app releases, observability, automated testing, and DevOps best practices across cloud environments with cross-functional collaboration and regulatory compliance.

Top Skills: AksAndroidAzure Application InsightsAzure Log AnalyticsAzure MonitorBashBranchingDockerDocker ComposeGitGit HooksGithub ActionsGoogle PlayHelmHerokuiOSIos App StoreJavaKubernetesNpmPowershellPull RequestsPythonSonarqubeVeracodeVercel

Refine Technology Inc

Site Reliability Engineer (In-Person)

12 Days AgoSaved

Hybrid

New York, NY

95K-125K Annually

Mid level

95K-125K Annually

Mid level

Artificial Intelligence • eCommerce • Retail • Software

Build and maintain CI/CD pipelines, manage and automate cloud infrastructure and configurations, implement monitoring/logging and alerting for reliability, enforce security and compliance practices, and collaborate with development teams to support scaling and operations.

Top Skills: Soc ISoc Ii

Solidus Labs

DevOps/SRE

13 Days AgoSaved

Hybrid

New York, NY

Mid level

Cryptocurrency

Own production reliability, availability, and performance for cloud-native systems. Operate and scale Kubernetes (EKS) clusters, manage AWS infrastructure, implement IaC with Terraform and Helm, improve CI/CD, build observability with Prometheus/Grafana/EFK, lead incident response and RCA, participate in on-call rotations, and support security and compliance.

Top Skills: AirflowAws BatchAws Ec2Aws LambdaAws OrganizationsBashClickhouseCloudwatchDatabricksDockerDynamoDBEfk (ElasticsearchEksElasticacheEmrFluentdGitlab Ci/CdGitopsGrafanaHelmHpaKafkaKarpenterKedaKibana)KubernetesLoad BalancingNatPostgresPrometheusPythonRdsRedisS3SnowflakeSparkSqsTerraformTlsVpcVpn

Cohere AI

Site Reliability Engineer, Inference Infrastructure

Reposted 13 Days AgoSaved

In-Office or Remote

New York, NY

Senior level

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI

The Site Reliability Engineer will develop, deploy, and operate AI infrastructure, focusing on high-performance and scalable machine learning systems using Kubernetes and cloud platforms.

Top Skills: AWSAzureC++GCPGoKubernetesOci

FloSports

Staff Site Reliability Engineer

Reposted 4 Days AgoSaved

Remote

New York, NY

Senior level

Digital Media • Social Media • Software • Sports

Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.

Top Skills: Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform

Ditto

Site Reliability Engineer

Reposted 4 Days AgoSaved

Remote

New York, NY

156K-288K Annually

Mid level

156K-288K Annually

Mid level

Computer Vision • Machine Learning • Software

As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.

Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform

AlphaSense

Staff Site Reliability Engineer

5 Days AgoSaved

Remote or Hybrid

New York, NY

150K-225K Annually

Senior level

150K-225K Annually

Senior level

Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence

Lead architecture and implementation of reliability platforms and SRE practices for a production SaaS. Build self-service reliability tooling, drive AIOps automation, advance observability (monitoring, tracing, profiling), lead incident response and postmortems, mentor engineers, and embed production readiness across teams to achieve 99.99% uptime.

Top Skills: AWSAzureContinuous ProfilingDatadogDnsElkGCPGoGrafanaHttp/SKubernetesLoad BalancingOpentelemetryPrometheusPythonTcp/Ip

OXIO

Site Reliability Engineer

Reposted 6 Days AgoSaved

Remote

New York, NY

Mid level

Other

As a Site Reliability Engineer, you will design cloud platforms, automate operations, maintain infrastructure, and support engineering teams in delivering reliable services.

Top Skills: AnsibleAWSAzureBashCircleCICloudFormationDatadogDnsDockerGitlab CiGoGCPGrafanaHTTPHttpsJenkinsKubernetesKvmLinuxPerlPrometheusPythonRubyTcp/IpTerraformUnixVMware

New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free

TherapyNotes, LLC

Senior Database Site Reliability Engineer

Reposted 6 Days AgoSaved

Remote

New York, NY

120K-160K Annually

Senior level

120K-160K Annually

Senior level

Healthtech • Other • Software

As a Senior Database Site Reliability Engineer, you'll design, implement, and maintain PostgreSQL systems, ensure reliability, automate maintenance tasks, and participate in incident response.

Top Skills: AnsibleBashDatadogGrafanaNew RelicPostgresPowershellPrometheusPythonTerraform

OneStream Software

Site Reliability Engineer

Reposted 6 Days AgoSaved

Remote

New York, NY

114K-148K Annually

Senior level

114K-148K Annually

Senior level

Software • Financial Services

Ensure platform reliability, performance, and availability by implementing observability, automating infrastructure, participating in on-call rotations and post-mortems, partnering with Product and Engineering, designing scalable architectures, mentoring teammates, and integrating Dynatrace with Azure DevOps and Jira while supporting compliance (SOC/FedRAMP).

Top Skills: .NetAksAlpineAnsibleAppinsightsArm TemplatesAWSAzure DevopsBashBicepC#ChefCloudFormationDatadogDebianDynatraceEksGCPGitGitGksGrafanaHelmJIRAKubernetesLog AnalyticsAzureNew RelicOnestream SoftwareOpenshiftPowershellPowershell DscPrometheusPuppetPythonRest ApisSQLTerraformUbuntu

Alpaca

Staff Site Reliability Engineer, Database

Reposted 6 Days AgoSaved

Remote

New York, NY

Senior level

Fintech • Information Technology

As a Site Reliability Engineer at Alpaca, you will ensure system reliability and performance, troubleshoot issues, and collaborate with teams to design scalable features.

Top Skills: GoGormLinuxPgxPostgresPrometheusSqlc

Chess.com

Site Reliability Engineer

Reposted 6 Days AgoSaved

Remote

New York, NY

Senior level

Gaming • Software

The Site Reliability Engineer will manage infrastructure stability and scalability, lead cloud migrations, and optimize performance across systems while mentoring team members.

Top Skills: AnsibleAWSAzureBashChefCloudFormationDatadogDockerElk StackGCPGoGrafanaKubernetesPrometheusPuppetPythonTerraformUnix/Linux

Kong

Staff Site Reliability Engineer - Volcano

6 Days AgoSaved

Remote

New York, NY

150K-210K Annually

Senior level

150K-210K Annually

Senior level

Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics

Founding Staff SRE for Volcano: define SLOs/error budgets, architect multi-region Kubernetes infrastructure, build GitOps/CI-CD with ArgoCD/Helm/Terraform, scale managed Postgres/Redis/object storage, implement observability with Datadog/Prometheus/Grafana, lead incident response and SRE culture, and mentor cross-functional teams.

Top Skills: ArgocdCanary DeploymentsCi/CdCniDatadogGitopsGrafanaHelmIngressKubernetesObject StoragePostgresPrometheusRedisService MeshTerraformTerragrunt

WorkOS

Site Reliability Engineer

Reposted 6 Days AgoSaved

Remote

New York, NY

175K-275K Annually

Mid level

175K-275K Annually

Mid level

Software

As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.

Top Skills: AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript

RunSybil

SRE/Infrastructure Engineer

Reposted 16 Days AgoSaved

Hybrid

New York, NY

30K-120K Annually

Senior level

30K-120K Annually

Senior level

Information Technology • Automation

The SRE/Infrastructure Engineer will architect and manage secure, scalable systems for automated penetration testing, optimizing reliability, and enhancing infrastructure based on customer demand. Responsibilities include maintaining production environments, leading technical discussions, and promoting high coding standards.

Top Skills: AWSAzureCloudFormationElkGCPNew RelicOpentelemetryPostgresPrometheusTerraform

Phantom (phantom.com)

Staff Software Engineer (SRE)

Reposted 8 Days AgoSaved

Remote

New York, NY

200K-250K Annually

Senior level

200K-250K Annually

Senior level

Software • Cryptocurrency

Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.

Top Skills: Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform

Supabase

Site Reliability Engineer

9 Days AgoSaved

Remote

New York, NY

Senior level

Database

Embed with service teams to define SLIs/SLOs and error budgets, run Operational Readiness Reviews, improve incident-to-improvement pipelines, advise on resilience and architecture, reduce operational toil through automation, and shape org-wide on-call practices and operational maturity.

Top Skills: AWSCdkGrafanaKubernetesOpentelemetryPostgresPulumiTerraformVictoriametrics

GE Vernova

SRE Platform Engineer

9 Days AgoSaved

Remote

New York, NY

Senior level

Energy • Manufacturing • Solar • Renewable Energy

Operate and harden production EKS Kubernetes clusters across multiple AWS regions. Build IaC (Terraform, Ansible), implement policy-as-code, ensure security and compliance, manage observability (Prometheus/Grafana), perform L3 support and incident RCA, run platform-level testing and DR, automate toil, and partner with application teams for sizing and cost optimization to achieve high availability for critical cloud infrastructure.

Top Skills: AlbAnsibleArgocdAws Ec2Certificate ManagementDatadogDynatraceEksFluxGoGrafanaKubernetesMskPod PriorityPrometheusPythonRdsS3Service MeshSplunkTerraformVpc

HHAeXchange

SRE Technical Project Manager

Reposted 9 Days AgoSaved

Remote

New York, NY

100K-110K Annually

Mid level

100K-110K Annually

Mid level

Healthtech • Software

The SRE Technical Project Manager will lead project delivery, incident management, automation processes, and uptime communication, partnering with SRE and development teams to ensure system stability and scalability.

Top Skills: Ai BotsDatadogJIRAJira Service ManagementMs TeamsOpsgeniePagerduty

SitusAMC

Site Reliability Engineer - AWS - Remote

Reposted 10 Days AgoSaved

Remote

New York, NY

110K-140K Annually

Senior level

110K-140K Annually

Senior level

Real Estate • Financial Services • PropTech

Support and optimize products migrated to AWS, implement cloud best practices, maintain operational coverage, enhance automation, observability, CI/CD/GitOps, and security. Collaborate with development and platform teams to scale, troubleshoot, and ensure reliable SaaS operations.

Top Skills: AmisArgocdAWSAws Elastic BeanstalkAws Transfer FamilyAzure DevopsBashCloudwatchCurlDockerEc2EksFluxcdGitGitopsHTTPIstioKubernetesLinkerdLoad BalancerPowershellPythonRdsSQLTerraformWget

Okta

Staff Site Reliability Engineer - Kubernetes

Reposted 19 Days AgoSaved

In-Office

New York, NY

194K-267K Annually

Senior level

194K-267K Annually

Senior level

Cloud

The Site Reliability Engineer will manage Kubernetes platforms, optimize AWS cloud infrastructure, ensure high availability, and automate deployment while handling troubleshooting and security compliance.

Top Skills: AWSBashCi/CdCloudwatchElk StackGoGrafanaHelmIstioKubernetesPrometheusPythonTerraform

Let Your Resume Do The Work

Upload your resume to be matched with jobs you're a great fit for.

All Filters

Early Applicant

JobType

New Jobs

Job Category

Experience

Industry

Company Name

Find Company

Company Size

Sign up now Access later

Create Free Account

Already have an account? Log In

Top Senior Site Reliability Engineer Jobs in NYC, NY

Lead Site Reliability Engineer

Staff Site Reliability Engineer

Senior Infrastructure Engineer/SRE

Lead Site Reliability Engineer

SRE Architect | Onsite

Site Reliability Engineer (In-Person)

DevOps/SRE

Site Reliability Engineer, Inference Infrastructure

Staff Site Reliability Engineer

Site Reliability Engineer

Staff Site Reliability Engineer

Site Reliability Engineer

Cut your apply time in half.

Senior Database Site Reliability Engineer

Site Reliability Engineer

Staff Site Reliability Engineer, Database

Site Reliability Engineer

Staff Site Reliability Engineer - Volcano

Site Reliability Engineer

SRE/Infrastructure Engineer

Staff Software Engineer (SRE)

Site Reliability Engineer

SRE Platform Engineer

SRE Technical Project Manager

Site Reliability Engineer - AWS - Remote

Staff Site Reliability Engineer - Kubernetes

Top NYC Companies Hiring Senior Site Reliability Engineers

Photon

Assured

Popular Job Searches

Total selected ()