Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in NYC, NY
Artificial Intelligence • Machine Learning
Lead development of AI-assisted reliability tooling, own incident response end-to-end, improve observability and SLO/SLI frameworks, scale single-tenant SaaS operations, mentor engineers, and reduce recurring operational toil through engineering and automation.
Top Skills:
Cloud PlatformsGoKubernetesLinuxLlm/Ai ToolingLogs And TracingObservability ToolingPythonSlo/Sli Frameworks
Reposted 21 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Senior Site Reliability Engineer, you'll design and build complex systems, support Atlas platform operations, automate processes, and ensure high availability of services.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Reposted 14 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Information Technology • Security • Software • Cybersecurity
This internship role focuses on SRE skills, requiring collaboration and problem-solving in dynamic environments for Zscaler's Zero Trust Exchange team.
Top Skills:
AnsibleAws EcsKubernetesLinuxPythonTerraform
Reposted 14 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
As a Cloud Cost Utilization SRE at GitLab, you'll manage cloud spending, improve tracking and optimization of cloud usage, and collaborate with finance and engineering teams to enhance cost efficiency across AWS and GCP.
Top Skills:
AnsibleAWSElkGCPGrafanaLokiMimirPrometheusTempoTerraform
Cloud • Information Technology • Machine Learning
As a Senior Site Reliability Engineer, you'll ensure the reliability and performance of a Kubernetes-based data platform, focusing on scaling infrastructure, enhancing security, and optimizing deployment processes.
Top Skills:
AirflowArgo CdFlinkGithub ActionsGrafanaHelmIstioKafkaKubernetesLinkerdOpentelemetryPrometheusPulumiSparkTerraform
Reposted 3 Days AgoSaved
Cloud • Software
Responsible for maintaining FedRAMP-compliant infrastructure, collaborating with software engineers, and ensuring system availability and security. Duties include infrastructure design, automation, monitoring, and incident response.
Top Skills:
AWSGoKubernetesPuppetPythonTerraform
Healthtech • Pharmaceutical • Telehealth
As a Senior Site Reliability Engineer, you will ensure the reliability and scalability of production systems, drive incident response, and collaborate with cross-functional teams on best practices for resilience and observability.
Top Skills:
AWSDatadogEksElasticacheGoPulumiPythonRdsRoute53S3Terraform
19 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Own reliability, monitoring, and incident response for AI infrastructure; build automation and CI/CD tooling; manage Kubernetes/Docker production workloads; partner with infrastructure, security, and compliance; improve observability and documentation; develop internal full‑stack tooling in Go or Python.
Top Skills:
AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxLog AggregationNetwork SecurityPuppetPythonRubySaltTerraform
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
As a Senior Site Reliability Engineer, you will enhance platform reliability, lead incident management, and drive AI-driven improvements in operational workflows.
Top Skills:
Amazon Web ServicesDatadogDynamoDBEnvoyEvent Driven ArchitecturesGrpcHTTPIstioJSONKotlinKubernetesLaunchdarklyModern JavaMySQLProtocol BuffersTerraformVitess
Information Technology • Consulting
As a Senior Staff Site Reliability Engineer, you will lead the SRE team, advocate best practices, ensure resilience in cloud architecture, and mentor team members.
Top Skills:
ArgocdCircleCIGoogle Cloud PlatformKubernetesPulumiTerraformTypescript
Artificial Intelligence
Seeking an experienced Site Reliability Engineer to enhance platform reliability, scalability, and performance by balancing operations with long-term software engineering improvements.
Top Skills:
AIBashDatadogDockerElk StackFluxGoGrafanaKubernetesPrometheusPythonTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Fintech • Financial Services
The SRE Application Support Engineer is responsible for ensuring operational reliability, stability, and optimizing performance of production systems, managing outages, troubleshooting issues, and developing documentation and standards for production applications.
Top Skills:
AuroraAWSEc2EcsFargateGrafanaJavaKibanaLambdaPostgresPrometheusPythonS3Splunk
Artificial Intelligence • Fintech • Payments • Social Impact • Analytics • Financial Services • Automation
As a Senior SRE, you'll ensure reliable and scalable systems, develop observability solutions and infrastructure as code, and lead incident response efforts.
Top Skills:
AWSCloudFormationDatadogElkPrometheusTerraform
Fintech • Financial Services
The Director of Splunk Platform Engineering & SRE owns the enterprise Splunk platform, drives incident resolution, optimizes systems, and mentors engineers, focusing on automation and performance.
Top Skills:
AnsibleGitGoJavaKubernetesLinux/UnixMoogPrometheusPythonSplunk
eCommerce • Fintech • Payments • Software
The role involves ensuring software reliability and performance, managing incidents, developing infrastructure automation, and mentoring junior engineers within a platform team.
Top Skills:
AWSCloudFormationDatadogKubernetesOpentelemetryRubyRuby On RailsTerraform
Software
The role involves managing compute infrastructure for decentralized applications, requiring critical thinking, documentation skills, and experience in Kubernetes and blockchain management.
Top Skills:
BlockchainGitopsInfrastructure-As-CodeKubernetesProgramming Languages
Cloud • Information Technology • Consulting • Cybersecurity
Design, templatize and deploy scalable infrastructure in public clouds (AWS, GCP) using IaC (CloudFormation). Support architects, troubleshoot developer escalations, ensure compliance, and build stable platform services; work within agile teams to create configuration templates and automated deployments.
Top Skills:
AWSAws CloudformationAws EfsEc2GCPPythonRdsRuby
Software
Design, deploy, and automate VMware-based private cloud infrastructure across global datacenters. Administer Linux and Windows Server platforms, integrate Active Directory, manage storage, networking, ADCs (F5/AVI), and ensure availability, security, and compliance. Build automation (PowerCLI/Ansible/Python), participate in on-call rotations, document systems, and mentor junior engineers while driving infrastructure modernization and reliability improvements.
Top Skills:
Active DirectoryAnsibleAvi (Nsx Advanced Load Balancer)CentosCi/CdDnsF5 Big-IpGitNasPowercliPowershellPythonRhelSanTcp/IpUbuntuVcenter)Vmware Vsphere (EsxiVpnWindows Server
Cloud
The role involves building and managing observability infrastructure in GCP, automating deployments, and optimizing data processes for high reliability.
Top Skills:
GkeGoGCPGrafanaKubernetesOpentelemetryPythonRubySplunkTerraform
Cloud • Security • Software
As a Site Reliability Engineer, you will design, deliver, and maintain cloud-based infrastructure, ensuring resilient and secure enterprise software solutions through optimized CI/CD processes.
Top Skills:
Ci/CdDockerGCPGitGoKubernetes
Fitness • Healthtech • Information Technology • Payments • Software
The Site Reliability Engineer will enhance system reliability, manage cloud infrastructure, automate processes, support CI/CD pipelines, and troubleshoot production issues.
Top Skills:
AnsibleAWSBashChefDockerGitGitlabJenkinsKubernetesMySQLPostgresPythonSQL ServerTerraformVMware
Digital Media • Gaming • News + Entertainment • Sports
The Site Reliability Engineer II contributes to system stability and scalability by implementing automation, enhancing observability, and participating in incident response and root cause analysis.
Top Skills:
Argo CdAWSAzureBashCi/CdCloudFormationDatadogDockerEfkElkFluxGCPGithub ActionsGitlab CiGoGrafanaJavaScriptJenkinsKubernetesLinuxNew RelicPrometheusPythonSplunkTerraform
Fintech
The SRE/DevOps Engineer will enhance observability and monitoring tools, improve system reliability, conduct post-incident reviews, and collaborate with developers to optimize workflows and CI/CD processes.
Top Skills:
AWSAzureAzure BicepAzure DevopsChaos MeshCloud FormationDatadogDockerElasticsearchGCPGithub ActionsGitlab Ci/CdGrafanaGremlinJenkinsKafkaKubernetesTerraform
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top NYC Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs in NYC
.NET Developer Jobs in NYC
Android Developer Jobs in NYC
C# Jobs in NYC
C++ Jobs in NYC
DevOps Jobs in NYC
Engineering Manager Jobs in NYC
Front End Developer Jobs in NYC
Golang Jobs in NYC
Hardware Engineer Jobs in NYC
iOS Developer Jobs in NYC
Java Developer Jobs in NYC
Javascript Jobs in NYC
Linux Jobs in NYC
Perl Jobs in NYC
PHP Developer Jobs in NYC
Python Jobs in NYC
QA Jobs in NYC
Ruby Jobs in NYC
Sales Engineer Jobs in NYC
Salesforce Developer Jobs in NYC
Scala Jobs in NYC
Artificial Intelligence Jobs in NYC
Artificial Intelligence Engineer Jobs in NYC
AWS Engineer Jobs in NYC
Backend Engineer Jobs in NYC
DevOps Engineer Jobs in NYC
Director of Engineering Jobs in NYC
Engineering Jobs in NYC
Full Stack Engineer Jobs in NYC
Infrastructure Engineer Jobs in NYC
Lead Software Engineer Jobs in NYC
Network Engineer Jobs in NYC
Platform Engineer Jobs in NYC
Principal Architect Jobs in NYC
Principal Engineer Jobs in NYC
Principal Software Engineer Jobs in NYC
Quality Assurance Automation Engineer Jobs in NYC
Reliability Engineer Jobs in NYC
Senior Backend Engineer Jobs in NYC
Senior Cloud Engineer Jobs in NYC
Senior Full-Stack Engineer Jobs in NYC
Senior Platform Engineer Jobs in NYC
Senior Python Engineer Jobs in NYC
Senior Site Reliability Engineer Jobs in NYC
Solutions Architect Jobs in NYC
Solutions Engineer Jobs in NYC
Staff Engineer Jobs in NYC
Staff Software Engineer Jobs in NYC
Systems Engineer Jobs in NYC
Vice President of Engineering Jobs in NYC
All Filters
Total selected ()
No Results
No Results






.png)
.png)







.png)


.png)












