Get the job you really want.

Top Senior Site Reliability Engineer Jobs in NYC, NY

Reposted 2 Days AgoSaved
Remote
New York, NY
Mid level
Mid level
Software • Analytics
This SRE role involves deep ownership of production systems, focusing on improving AWS infrastructure, operational tooling, and automation for scaling ClickHouse installations at petabyte scale.
Top Skills: AnsibleAWSClickhouseEc2LinuxTerraform
Reposted 2 Days AgoSaved
Remote
New York, NY
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills: AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
Reposted 2 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
Senior level
Senior level
Fitness
The Staff Site Reliability Engineer will establish SRE best practices, drive observability strategy, implement software solutions, and mentor engineers. Responsibilities include improving platform resilience, managing risks, and participating in incident response processes.
Top Skills: AnsibleAWSAzureBashCloudFormationGCPGoKubernetesPulumiPythonTerraform
3 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
170K-230K Annually
Senior level
170K-230K Annually
Senior level
Cloud • Security • Software
As a Senior Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, ensuring software deployment through automated CI/CD pipelines, while collaborating with teams to enhance service delivery.
Top Skills: Ci/CdCloud PlatformsDockerGoGoKubernetes
Reposted 3 Days AgoSaved
Remote
New York, NY
107K-147K Annually
Senior level
107K-147K Annually
Senior level
Healthtech
Lead SRE initiatives to ensure reliability, scalability, and performance of cloud and on‑prem systems. Architect IaC and CI/CD, manage incidents and on‑call rotations, implement monitoring/SLOs, and drive operational excellence for AI/ML workloads while mentoring engineers and collaborating across teams.
Top Skills: AWSAzureAzure DatabricksBashCi/Cd (Ado Yaml Pipelines)Container OrchestrationFile TransferGCPGoKafkaOraclePostgresPysparkPythonRest ApiSnowflakeSQL
Reposted 12 Days AgoSaved
In-Office or Remote
New York, NY
140K-205K Annually
Senior level
140K-205K Annually
Senior level
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills: AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
Reposted 12 Days AgoSaved
Hybrid
New York, NY
180K-275K Annually
Senior level
180K-275K Annually
Senior level
Financial Services
Design, develop, and deploy robust platform solutions while ensuring reliability, scalability, and security of the system. Collaborate with teams to enhance tooling and automation.
Top Skills: GCPKubernetesTerraform
3 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
50K-120K Annually
Senior level
50K-120K Annually
Senior level
Artificial Intelligence • Other • Sales • Software
Design and advance core infrastructure for engineering teams, ensuring reliability of multi-cloud Kubernetes clusters, and build dev tools to empower deployment pipelines.
Top Skills: AWSAzureCi/CdCloudFormationGitopsGoGCPGrafanaKubernetesPostgresPythonTerraform
Reposted 4 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
Senior level
Senior level
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills: ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Reposted 4 Days AgoSaved
Remote
New York, NY
140K-180K Annually
Senior level
140K-180K Annually
Senior level
Fintech
As a Site Reliability Engineer, you will enhance system reliability through scalable infrastructure, observability practices, automation, and collaboration with engineering teams.
Top Skills: AWSDatadogGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonTerraform
Reposted 4 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
89K-287K Annually
Mid level
89K-287K Annually
Mid level
3D Printing • Artificial Intelligence • Software • Design
The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.
Top Skills: Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform
Reposted 4 Days AgoSaved
In-Office or Remote
New York, NY
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 14 Days AgoSaved
Hybrid
New York, NY
150K-225K Annually
Mid level
150K-225K Annually
Mid level
Software
As an AI Support Engineer, you'll manage support requests, resolve user issues, optimize ML models, and contribute to product development.
Top Skills: Tensorrt
Reposted 5 Days AgoSaved
Easy Apply
In-Office or Remote
New York, NY
Easy Apply
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
6 Days AgoSaved
Remote
New York, NY
116K-255K Annually
Senior level
116K-255K Annually
Senior level
Big Data • Information Technology • Security • Software
The Senior Developer will drive observability roadmaps using SRE Golden Signals, establish monitoring strategies, enhance system reliability, and act as an expert in New Relic technology for performance management.
Top Skills: BashCri-OCshKubernetesNew RelicPerlWindows Powershell
16 Days AgoSaved
In-Office
New York, NY
129K-168K Annually
Senior level
129K-168K Annually
Senior level
Healthtech
Build and harden AWS cloud environments and CI/CD pipelines, manage IaC and container platforms, own observability and incident response, enforce security and HA/DR, and automate operational tasks to support a regulated medical-imaging platform.
Top Skills: Aws,Vpc,Iam,Kms,Terraform,Cdk,Cloudformation,Docker,Kubernetes,Eks,Ecs,Prometheus,Grafana,Aws Cloudwatch Insights,Github,Apache Airflow,Python,Bash,Vpn,Privatelink,Direct Connect,Sbom,Dicom,Hl7
Reposted 6 Days AgoSaved
Remote
New York, NY
72K-190K Annually
Mid level
72K-190K Annually
Mid level
Cloud • Software
The Site Reliability Engineer (SRE) will manage reliable, scalable systems, focusing on software development, infrastructure automation, and incident response. Responsibilities include monitoring, CI/CD pipeline management, security compliance, and cost optimization while collaborating with various teams.
Top Skills: AWSAzureDockerElk StackGCPGitGrafanaJavaKubernetesPHPPrometheusPythonShellTerraform
7 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
170K-215K Annually
Senior level
170K-215K Annually
Senior level
Blockchain • Fintech • Social Media • Cryptocurrency • NFT • Web3
Design, build, and operate scalable, highly available infrastructure and platform software for Zora's blockchain services (indexer, APIs, data pipelines). Automate workflows, maintain core systems, improve developer experience, participate in on-call rotation, and contribute strategic technical direction.
Top Skills: AsyncioBaseBridgesCephCloudflare Pages FunctionsDatadogDockerEthereumGoIpfsKubernetesMongoDBOpentelemetryOptimismOptimistic RollupsPlasmaPolygonPostgresPythonRpc NodesSidechainsVercelZk-Rollups
7 Days AgoSaved
Remote
New York, NY
Mid level
Mid level
Security • Software • Analytics
Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.
Top Skills: Aws,Docker,Kubernetes,Amazon Eks,Terraform,Pulumi,Linux,Github Actions,Gitlab,Circleci,Llms,Golang,Monitoring And Observability Tools
Reposted 16 Days AgoSaved
In-Office
New York, NY
110K-130K Annually
Senior level
110K-130K Annually
Senior level
Financial Services
As a Site Reliability Engineer, you'll optimize and manage cloud infrastructure, implement automation, and maintain system reliability for a global financial platform.
Top Skills: AWSGCPGoHelmKubernetesLinuxPythonTerraform
8 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
215K-250K Annually
Senior level
215K-250K Annually
Senior level
Security • Cybersecurity
Lead the design and implementation of observability, SLO/SLA frameworks, and AI-enabled infrastructure automation. Architect scalable AWS infrastructure, improve incident management and on-call practices, and drive organization-wide adoption of telemetry and reliability standards.
Top Skills: Ai-Assisted ToolingAWSCi/CdClaudeCodexCursorGrafanaHoneycombInfrastructure-As-CodeObservabilityPulumiSupabaseTelemetryTerraformVercel
Reposted 13 Days AgoSaved
Easy Apply
Remote or Hybrid
New York, NY
Easy Apply
140K-170K Annually
Senior level
140K-170K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills: ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 8 Days AgoSaved
Easy Apply
Remote
New York, NY
Easy Apply
200K-300K Annually
Expert/Leader
200K-300K Annually
Expert/Leader
Payments
As a Principal Site Reliability Engineer, you'll architect scalable infrastructure, drive reliability, mentor engineers, and lead AI enablement efforts, ensuring high-performance across systems.
Top Skills: AWSCi/CdDatadogElasticsearchGoGrafanaKubernetesNew RelicPrometheusPythonRds (Mysql/Postgres)Sql-Based RdbmsTypescript
Reposted 8 Days AgoSaved
Remote
New York, NY
115K-135K Annually
Mid level
115K-135K Annually
Mid level
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
9 Days AgoSaved
Remote
New York, NY
Senior level
Senior level
Software • Consulting
Lead production support for external web applications: manage incidents, perform root cause analysis, expand observability (Splunk/OpenTelemetry), build dashboards, collaborate with dev and platform teams, and participate in 24x7 on-call rotations to improve availability and reliability.
Top Skills: Splunk,Opentelemetry,Appdynamics,Datadog,Aws,Kubernetes,Python,Servicenow,Mulesoft,Postman,Linux,Shell Scripting,Openshift,Azure,Gcp,Api Testing
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account