Get the job you really want.

Top Reliability Engineer Jobs in NYC, NY

2 Days AgoSaved
In-Office
New York, NY
139K-204K Annually
Senior level
139K-204K Annually
Senior level
Cloud • Information Technology • Machine Learning
As a Storage Reliability Engineer, you will manage mission-critical storage systems, troubleshoot complex incidents, and improve infrastructure reliability through automation and tooling.
Top Skills: Csi DriversGoKubernetesNfsS3
Reposted 5 Days AgoSaved
Easy Apply
Hybrid
New York, NY
Easy Apply
129K-232K Annually
Senior level
129K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will maintain service uptime, improve automation, and ensure infrastructure reliability while collaborating with engineering teams at Braze.
Top Skills: ChefDockerKafkaKubernetesLinuxMongoDBRedisRuby On RailsTerraformUnix Shell
Reposted 6 Days AgoSaved
Hybrid
New York, NY
205K-225K Annually
Senior level
205K-225K Annually
Senior level
Artificial Intelligence • Fintech • Payments • Social Impact • Analytics • Financial Services • Automation
As a Senior SRE, you'll ensure reliable and scalable systems, develop observability solutions and infrastructure as code, and lead incident response efforts.
Top Skills: AWSCloudFormationDatadogElkPrometheusTerraform
Reposted 7 Days AgoSaved
Easy Apply
In-Office
New York, NY
Easy Apply
100K-250K Annually
Mid level
100K-250K Annually
Mid level
Fintech • Payments • Financial Services
The role involves improving system reliability, building automation, debugging issues, collaborating across teams, and mentoring engineers, focusing on creating a reliable financial ecosystem.
Top Skills: AWSAzureDatadogDockerEc2GCPGoKubernetesRustTerraform
Reposted 7 Days AgoSaved
In-Office
New York, NY
125K-350K Annually
Mid level
125K-350K Annually
Mid level
Information Technology • Software • Financial Services • Quantitative Trading
The Site Reliability Engineer will provide support and diagnose issues within a real-time, distributed environment, focusing on large-scale application and infrastructure management, with basic required skills in UNIX/Linux, networking, SQL, and scripting languages.
Top Skills: BashPythonSQLTcp/IpUdpUnix/Linux
Reposted 8 Days AgoSaved
Easy Apply
Hybrid
New York, NY
Easy Apply
130K-232K Annually
Senior level
130K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll maintain and improve the data export system, focusing on observability, reliability, and scalability while guiding junior engineers and adhering to best practices.
Top Skills: BuildkiteDocker SwarmGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPostgresRubySidekiqSnsSqs
Reposted 9 Days AgoSaved
Easy Apply
Hybrid
New York, NY
Easy Apply
179K-212K Annually
Senior level
179K-212K Annually
Senior level
Healthtech • Pharmaceutical • Telehealth
As a Senior Site Reliability Engineer, you'll ensure production system reliability, design resilient infrastructures, and improve operational excellence while collaborating with cross-functional teams.
Top Skills: AWSDatadogEksElasticacheGoPulumiPythonRdsRoute53S3Terraform
Reposted 9 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
Manage continuous delivery infrastructure for reliable code deployment. Collaborate with teams to streamline onboarding, support deployment systems, and participate in on-call rotations.
Top Skills: Argo WorkflowsArgocdAWSAzureGoGoogle Cloud PlatformKubernetesPython
Reposted 9 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
147K-289K Annually
Senior level
147K-289K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills: AnsibleAWSAzureCloudFormationGCPGoTerraform
YesterdaySaved
Remote or Hybrid
New York, NY
Senior level
Senior level
Fintech • Software
The Principal Site Reliability Engineer - Cloud is responsible for managing and optimizing SaaS cloud infrastructure, ensuring performance, reliability, and security, while automating operations and collaborating within teams.
Top Skills: .NetAnsibleAppdynamicsAWSAzureAzure DevopsC#DatadogDynatraceHarnessIderaJavaJenkinsKubernetesNew RelicRedgateSolarwindsSQLTerraform
Reposted 11 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills: AWSGCPAzureMongoDB
Reposted 2 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
195K-270K Annually
Expert/Leader
195K-270K Annually
Expert/Leader
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Principal Software Engineer on the SRE team, lead best practices adoption, mentor engineers, and improve system reliability and user experience through automation and collaboration.
Top Skills: CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
3 Days AgoSaved
Remote
New York, NY
150K-220K Annually
Senior level
150K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.
Top Skills: AWSBashGoKubernetesPythonSlurmTerraform
Reposted 12 Days AgoSaved
In-Office
New York, NY
175K-275K Annually
Expert/Leader
175K-275K Annually
Expert/Leader
Artificial Intelligence • Cloud • Enterprise Web • Natural Language Processing • Software • App development • Automation
Design and implement large-scale distributed systems that integrate AI safely and reliably, focusing on infrastructure, observability, and security.
Top Skills: Cloud NetworkingContainersDistributed SystemsEvent Driven RuntimesKedaKnativeKubernetesMulti Cloud ArchitectureOperating SystemsScalability
Reposted 12 Days AgoSaved
Hybrid
New York, NY
194K-260K Annually
Senior level
194K-260K Annually
Senior level
Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
As a Senior Reliability Engineer at Celonis, you will lead reliability efforts for cloud-based microservices, enhance performance, automate processes, and collaborate with engineering teams to improve system reliability.
Top Skills: ArgocdAWSAzureCi/CdDatadogGCPGithub ActionsJavaKubernetesKustomizeLinuxPythonSpringTerraform
Reposted 12 Days AgoSaved
In-Office or Remote
New York, NY
105K-300K Annually
Entry level
105K-300K Annually
Entry level
Information Technology • Software • Financial Services • Big Data Analytics
SREs at Citadel focus on optimizing and maintaining system reliability, performance, and automation for investment applications, collaborating closely with teams.
Top Skills: Ci/CdCSSJavaScriptPythonReactSQL
Reposted 3 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
187K-224K Annually
Senior level
187K-224K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Seeking a Senior Software Engineer, Site Reliability to ensure system stability, scalability, and reliability, while optimizing AWS infrastructure using modern DevOps practices and tools like Terraform, Docker, and Kubernetes.
Top Skills: AWSCircleCICronitorDatadogDockerGithub ActionsJenkinsKubernetesMySQLPagerdutyReactRedisRuby On RailsSentrySidekiqTerraform
14 Days AgoSaved
Hybrid
New York, NY
209K-286K Annually
Senior level
209K-286K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
This role involves driving the product management strategy for Cyber SRE by embedding reliability in products, creating automated solutions, and enhancing cybersecurity practices while collaborating with engineering leaders.
Top Skills: AIOpentelemetry
8 Days AgoSaved
In-Office
New York, NY
182K-249K Annually
Senior level
182K-249K Annually
Senior level
Cloud
As a Database Reliability Engineer, oversee MySQL database services, ensure performance and availability, coordinate infrastructure tuning, and enhance operational processes.
Top Skills: ChefCloudsqlDockerGrafanaKubernetesLinuxMySQLPostgresRds AuroraTerraform
14 Days AgoSaved
Easy Apply
In-Office
New York, NY
Easy Apply
160K-300K Annually
Senior level
160K-300K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
Own and improve critical production services end-to-end by writing production-quality code: instrumenting services, eliminating performance bottlenecks, building deployment and observability platforms, defining SLOs, running incident response and post-mortems, capacity planning and cost optimization, maintaining CI/CD, and embedding with product teams to design reliable systems.
Top Skills: AWSC++Ci/CdContainer OrchestrationGoObservability StacksPythonRust
15 Days AgoSaved
Easy Apply
Hybrid
New York, NY
Easy Apply
151K-297K Annually
Expert/Leader
151K-297K Annually
Expert/Leader
Big Data • Cloud • Software • Database
Lead a 6–8 person team managing the Kubernetes fleet and core runtime components (CoreDNS, cert-manager, Gatekeeper). Define technical vision and roadmap, guide migration from Terraform to Operator-driven lifecycle management, perform hands-on architectural reviews and PR reviews, resolve operational incidents, and collaborate with engineering leaders and stakeholders.
Top Skills: AlertingAWSAzureCert-ManagerContainerizationCorednsCrossplaneGatekeeperGCPKubernetesLoad BalancingObservabilityOperatorsService MeshTerraform
Reposted 5 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
140K-170K Annually
Senior level
140K-170K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills: ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 9 Days AgoSaved
Easy Apply
In-Office
New York, NY
Easy Apply
Mid level
Mid level
Information Technology • Consulting
The Systems Reliability Engineer supports operations platforms by ensuring system reliability, automation improvements, and troubleshooting across diverse technologies.
Top Skills: AnsibleApicaAzureAzure Virtual MachinesAzuresqlBashDb2GitGrafanaLinuxPerlPrometheusPythonRubyServicebusSplunkSQLSybaseUnix
Reposted 15 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills: AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Reposted 7 Days AgoSaved
Remote or Hybrid
New York, NY
148K-185K Annually
Senior level
148K-185K Annually
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Lead SRE responsible for architecting and automating fault-tolerant, scalable infrastructure across cloud and on-prem, driving deployment, monitoring, and performance tuning while mentoring engineers to improve reliability and SLAs.
Top Skills: .NetAnsibleAWSAws GreengrassC#ChefDockerElixirGCPGitopsGoJavaKubernetesLinuxNutanixPythonRubyTerraformVsphere
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account