Get the job you really want.

Top Reliability Engineer Jobs in NYC, NY

Reposted 2 Days AgoSaved
In-Office
New York, NY
90K-122K Annually
Mid level
90K-122K Annually
Mid level
Fintech • Analytics
The Site Reliability Engineer will manage production monitoring, incident response, and enhance automation using various tools. They will ensure observability and participate in SRE process improvements.
Top Skills: AWSCucumberDatadog ApmDatadog DbmDynamoDBEc2EcsElkJavaJenkinsPagerdutyPlaywrightRdsS3Secrets ManagerSeleniumServicenowSplunkSpring Boot
Reposted 2 Days AgoSaved
Hybrid
New York, NY
178K-240K Annually
Mid level
178K-240K Annually
Mid level
Edtech • Machine Learning • Mobile • Other • Software
As a Senior Site Reliability Engineer at Duolingo, you'll improve system reliability and scalability, collaborate with teams, and consult on infrastructure design.
Top Skills: DockerGoJavaKotlinKubernetesMesosNomadPython
Reposted 25 Days AgoSaved
In-Office
New York, NY
Senior level
Senior level
Artificial Intelligence • Software
As a Senior/Staff Network Reliability Engineer, you'll optimize and maintain Fluidstack's network platform, ensuring performance and reliability for AI and HPC workloads. Responsibilities include tuning networking protocols, deploying and validating switches, automating telemetry, conducting root-cause analyses, and collaborating with vendors.
Top Skills: BgpDpdkEbpfEvpnGeneveGoPythonRdmaRustTcp/IpVxlanXdp
Reposted 3 Days AgoSaved
In-Office or Remote
New York, NY
150K-250K Annually
Mid level
150K-250K Annually
Mid level
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills: Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
4 Days AgoSaved
In-Office
New York, NY
140K-205K Annually
Senior level
140K-205K Annually
Senior level
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills: AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
17 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
130K-150K Annually
Mid level
130K-150K Annually
Mid level
Marketing Tech
The Cloud Reliability Engineer develops, configures, and deploys cloud tools, enhances applications, ensures observability, and participates in on-call rotations.
Top Skills: AWSCi/CdDockerGithub ActionsGoGoogle BigqueryGCPKubernetesLinuxPythonSQLTerraform
4 Days AgoSaved
Hybrid
New York, NY
149K-222K Annually
Senior level
149K-222K Annually
Senior level
Fintech • Mobile • Software
The Staff Site Reliability Engineer will design and manage AWS infrastructure, optimize Kubernetes operations, automate workflows, and troubleshoot systems for improved reliability and performance.
Top Skills: AWSCi/CdDatadogDockerEksGithub ActionsGoKafkaKubernetesNginxPrivatelinkPythonTerraformTransit GatewayVpc
Reposted 5 Days AgoSaved
In-Office
New York, NY
110K-130K Annually
Senior level
110K-130K Annually
Senior level
Financial Services
As a Site Reliability Engineer, you'll optimize and manage cloud infrastructure, implement automation, and maintain system reliability for a global financial platform.
Top Skills: AWSGCPGoHelmKubernetesLinuxPythonTerraform
Reposted 5 Days AgoSaved
In-Office
New York, NY
Senior level
Senior level
Artificial Intelligence
As an Applied AI Engineer, you will onboard customers, deploy AI solutions, work on complex projects, and provide technical guidance. You'll contribute to open-source projects and communicate effectively with stakeholders.
Top Skills: AnsibleAWSAzureDockerGCPKubernetesPythonTerraform
Reposted 25 Days AgoSaved
Remote
New York, NY
Expert/Leader
Expert/Leader
Artificial Intelligence • Productivity • Software • Automation
As a Staff Site Reliability Engineer at Zapier, you will lead reliability strategies to enhance observability, mentor engineers, and drive adoption of reliability practices. You will design for scale, influence organizational culture, and integrate AI tools into workflows for improved performance.
Top Skills: ArgocdAWSDatadogGitlabGoGrafanaKafkaKubernetesOpensearchPrometheusPythonRedisSentryTerraformTypescript
Reposted 25 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
124K-266K Annually
Senior level
124K-266K Annually
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer at GitLab, you will automate and manage the lifecycle of GitLab environments, ensuring reliability and scalability while leading incident responses and architectural decisions.
Top Skills: AnsibleAWSElkGCPGoGrafanaKubernetesPrometheusRubyTerraform
Reposted 21 Days AgoSaved
Remote
New York, NY
Senior level
Senior level
Artificial Intelligence • Cybersecurity
The Database Reliability Engineer will ensure database availability, performance, scalability, and security across AWS, collaborating with application and security teams.
Top Skills: AWSCrossplaneDatadogGitlab Ci/CdKubernetesNoSQLOpensearchPostgresTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 9 Days AgoSaved
Easy Apply
In-Office
New York, NY
Easy Apply
150K-300K Annually
Junior
150K-300K Annually
Junior
Software
As an AI Site Reliability Researcher, you will ensure the scalability and reliability of our AI platform, design systems for observability, manage deployments, and develop CI/CD pipelines for hybrid environments.
Top Skills: AIAWSDatadogGCPGrafanaKubernetesOpentelemetryPrometheusSreTerraform
Reposted 9 Days AgoSaved
Easy Apply
In-Office
New York, NY
Easy Apply
180K-210K Annually
Senior level
180K-210K Annually
Senior level
AdTech • Marketing Tech • Analytics
The Staff SRE DevOps Engineer will manage customer applications, improve system reliability, collaborate on architecture discussions, and support infrastructure needs across teams.
Top Skills: AWSBashDatadogDockerKafkaKibanaKubernetesLinuxPostgresPythonRedshiftSparkTerraform
Reposted 38 Minutes AgoSaved
In-Office or Remote
New York, NY
144K-270K Annually
Senior level
144K-270K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Senior Site Reliability Engineer (SRE) at NVIDIA is responsible for designing, building, and maintaining large-scale production systems, focusing on reliability and efficiency, automation, and continuous improvement.
Top Skills: ContainersGoKubernetesLinuxNetworkingOpenstackPerlPythonRuby
Reposted 39 Minutes AgoSaved
In-Office or Remote
New York, NY
144K-270K Annually
Senior level
144K-270K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Senior Site Reliability Engineer will design, implement, and maintain an observability platform, ensuring reliability and performance while supporting production systems and optimizing operational practices.
Top Skills: DockerGoGrafanaKubernetesLinuxNetworkingOpenstackOpentelemetryPerlPrometheusPythonRuby
Reposted 40 Minutes AgoSaved
In-Office or Remote
New York, NY
168K-334K Annually
Senior level
168K-334K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and maintain large scale Kubernetes clusters, ensuring reliability through monitoring, automation and incident response.
Top Skills: DockerGoKubernetesLinuxNetworkingOpenstackPerlPythonRuby
Reposted 2 Hours AgoSaved
In-Office or Remote
New York, NY
248K-391K Annually
Expert/Leader
248K-391K Annually
Expert/Leader
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Principal Staff SRE will lead initiatives in building and optimizing core infrastructure services on-prem and cloud, deploying and managing services at scale, and improving performance with automation and monitoring tools.
Top Skills: DhcpDnsEbpfGoLdapLinuxNtpPythonTerraformXdp
Reposted 2 Hours AgoSaved
In-Office or Remote
New York, NY
168K-334K Annually
Senior level
168K-334K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Senior Site Reliability Engineer will manage deployments, operations, and incident handling for large-scale AI GPU platforms while ensuring high performance and resilience in configurations.
Top Skills: C++KubernetesLinuxPython
Reposted 4 Hours AgoSaved
Remote
New York, NY
150K-190K Annually
Senior level
150K-190K Annually
Senior level
Hardware • Machine Learning • Security • Software
The Site Reliability Engineer will manage software deployment for IoT devices, improve observability, maintain dashboards, automate processes, and collaborate on incident responses.
Top Skills: AnsibleAWSBashC/C++DatadogGrafanaGroovyJavaJavaScriptNoSQLPostgresPrometheusPythonRSigmaSQLTerraform
4 Hours AgoSaved
Remote or Hybrid
New York, NY
183K-245K Annually
Senior level
183K-245K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Machine Learning • Mobile • Software
The Staff Site Reliability Engineer will design, implement, and optimize infrastructure for AI services, ensure reliability and performance, and drive automation and observability excellence across engineering teams.
Top Skills: AzureAzure DevopsDockerElk StackGithub ActionsGrafanaKubernetesMimirPostgresPrometheusSQL ServerTeamcityTerraform
Reposted 10 Days AgoSaved
In-Office
New York, NY
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Security • Cybersecurity
Lead DevOps initiatives to enhance microservice infrastructure, mentor engineering teams, and manage production issues, with a focus on security and automation.
Top Skills: AWSCircleCIGithub ActionsGoJenkinsKubernetesPythonTerraform
YesterdaySaved
In-Office or Remote
New York, NY
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves architecting and operating large-scale observability systems, designing resilient telemetry pipelines, automating operations, and leading incident responses while collaborating with various teams.
Top Skills: ElasticsearchFlinkGoJaegerKafkaLokiMimirOpensearchOpentelemetryPrometheusPythonSparkTempoThanos
11 Days AgoSaved
In-Office
New York, NY
Senior level
Senior level
Consulting
The SRE Engineer will manage application support, automate solutions, improve system stability, and partner with various teams for application deployment, focusing on infrastructure management and SRE principles.
Top Skills: AirflowApacheClouderaDatabasesGrafanaHadoopKafkaKubernetesLinuxNginxOpenshiftPrometheusPythonRedisShellUnix
Reposted YesterdaySaved
In-Office or Remote
New York, NY
Senior level
Senior level
Software
The role involves managing compute infrastructure for decentralized applications, requiring critical thinking, documentation skills, and experience in Kubernetes and blockchain management.
Top Skills: BlockchainGitopsInfrastructure-As-CodeKubernetesProgramming Languages
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account