Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in NYC, NY
Cloud • Information Technology • Machine Learning
As a Storage Reliability Engineer, you will manage mission-critical storage systems, troubleshoot complex incidents, and improve infrastructure reliability through automation and tooling.
Top Skills:
Csi DriversGoKubernetesNfsS3
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will maintain service uptime, improve automation, and ensure infrastructure reliability while collaborating with engineering teams at Braze.
Top Skills:
ChefDockerKafkaKubernetesLinuxMongoDBRedisRuby On RailsTerraformUnix Shell
Artificial Intelligence • Fintech • Payments • Social Impact • Analytics • Financial Services • Automation
As a Senior SRE, you'll ensure reliable and scalable systems, develop observability solutions and infrastructure as code, and lead incident response efforts.
Top Skills:
AWSCloudFormationDatadogElkPrometheusTerraform
Fintech • Payments • Financial Services
The role involves improving system reliability, building automation, debugging issues, collaborating across teams, and mentoring engineers, focusing on creating a reliable financial ecosystem.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKubernetesRustTerraform
Information Technology • Software • Financial Services • Quantitative Trading
The Site Reliability Engineer will provide support and diagnose issues within a real-time, distributed environment, focusing on large-scale application and infrastructure management, with basic required skills in UNIX/Linux, networking, SQL, and scripting languages.
Top Skills:
BashPythonSQLTcp/IpUdpUnix/Linux
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll maintain and improve the data export system, focusing on observability, reliability, and scalability while guiding junior engineers and adhering to best practices.
Top Skills:
BuildkiteDocker SwarmGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPostgresRubySidekiqSnsSqs
Healthtech • Pharmaceutical • Telehealth
As a Senior Site Reliability Engineer, you'll ensure production system reliability, design resilient infrastructures, and improve operational excellence while collaborating with cross-functional teams.
Top Skills:
AWSDatadogEksElasticacheGoPulumiPythonRdsRoute53S3Terraform
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
Manage continuous delivery infrastructure for reliable code deployment. Collaborate with teams to streamline onboarding, support deployment systems, and participate in on-call rotations.
Top Skills:
Argo WorkflowsArgocdAWSAzureGoGoogle Cloud PlatformKubernetesPython
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills:
AnsibleAWSAzureCloudFormationGCPGoTerraform
Fintech • Software
The Principal Site Reliability Engineer - Cloud is responsible for managing and optimizing SaaS cloud infrastructure, ensuring performance, reliability, and security, while automating operations and collaborating within teams.
Top Skills:
.NetAnsibleAppdynamicsAWSAzureAzure DevopsC#DatadogDynatraceHarnessIderaJavaJenkinsKubernetesNew RelicRedgateSolarwindsSQLTerraform
Reposted 11 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills:
AWSGCPAzureMongoDB
Reposted 2 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Principal Software Engineer on the SRE team, lead best practices adoption, mentor engineers, and improve system reliability and user experience through automation and collaboration.
Top Skills:
CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
3 Days AgoSaved
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.
Top Skills:
AWSBashGoKubernetesPythonSlurmTerraform
Artificial Intelligence • Cloud • Enterprise Web • Natural Language Processing • Software • App development • Automation
Design and implement large-scale distributed systems that integrate AI safely and reliably, focusing on infrastructure, observability, and security.
Top Skills:
Cloud NetworkingContainersDistributed SystemsEvent Driven RuntimesKedaKnativeKubernetesMulti Cloud ArchitectureOperating SystemsScalability
Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
As a Senior Reliability Engineer at Celonis, you will lead reliability efforts for cloud-based microservices, enhance performance, automate processes, and collaborate with engineering teams to improve system reliability.
Top Skills:
ArgocdAWSAzureCi/CdDatadogGCPGithub ActionsJavaKubernetesKustomizeLinuxPythonSpringTerraform
Information Technology • Software • Financial Services • Big Data Analytics
SREs at Citadel focus on optimizing and maintaining system reliability, performance, and automation for investment applications, collaborating closely with teams.
Top Skills:
Ci/CdCSSJavaScriptPythonReactSQL
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Seeking a Senior Software Engineer, Site Reliability to ensure system stability, scalability, and reliability, while optimizing AWS infrastructure using modern DevOps practices and tools like Terraform, Docker, and Kubernetes.
Top Skills:
AWSCircleCICronitorDatadogDockerGithub ActionsJenkinsKubernetesMySQLPagerdutyReactRedisRuby On RailsSentrySidekiqTerraform
Fintech • Machine Learning • Payments • Software • Financial Services
This role involves driving the product management strategy for Cyber SRE by embedding reliability in products, creating automated solutions, and enhancing cybersecurity practices while collaborating with engineering leaders.
Top Skills:
AIOpentelemetry
Cloud
As a Database Reliability Engineer, oversee MySQL database services, ensure performance and availability, coordinate infrastructure tuning, and enhance operational processes.
Top Skills:
ChefCloudsqlDockerGrafanaKubernetesLinuxMySQLPostgresRds AuroraTerraform
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
Own and improve critical production services end-to-end by writing production-quality code: instrumenting services, eliminating performance bottlenecks, building deployment and observability platforms, defining SLOs, running incident response and post-mortems, capacity planning and cost optimization, maintaining CI/CD, and embedding with product teams to design reliable systems.
Top Skills:
AWSC++Ci/CdContainer OrchestrationGoObservability StacksPythonRust
Big Data • Cloud • Software • Database
Lead a 6–8 person team managing the Kubernetes fleet and core runtime components (CoreDNS, cert-manager, Gatekeeper). Define technical vision and roadmap, guide migration from Terraform to Operator-driven lifecycle management, perform hands-on architectural reviews and PR reviews, resolve operational incidents, and collaborate with engineering leaders and stakeholders.
Top Skills:
AlertingAWSAzureCert-ManagerContainerizationCorednsCrossplaneGatekeeperGCPKubernetesLoad BalancingObservabilityOperatorsService MeshTerraform
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills:
ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
Information Technology • Consulting
The Systems Reliability Engineer supports operations platforms by ensuring system reliability, automation improvements, and troubleshooting across diverse technologies.
Top Skills:
AnsibleApicaAzureAzure Virtual MachinesAzuresqlBashDb2GitGrafanaLinuxPerlPrometheusPythonRubyServicebusSplunkSQLSybaseUnix
Reposted 15 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Lead SRE responsible for architecting and automating fault-tolerant, scalable infrastructure across cloud and on-prem, driving deployment, monitoring, and performance tuning while mentoring engineers to improve reliability and SLAs.
Top Skills:
.NetAnsibleAWSAws GreengrassC#ChefDockerElixirGCPGitopsGoJavaKubernetesLinuxNutanixPythonRubyTerraformVsphere
Popular Job Searches
All Software Engineer Jobs in NYC
.NET Developer Jobs in NYC
Android Developer Jobs in NYC
C# Jobs in NYC
C++ Jobs in NYC
DevOps Jobs in NYC
Engineering Manager Jobs in NYC
Front End Developer Jobs in NYC
Golang Jobs in NYC
Hardware Engineer Jobs in NYC
iOS Developer Jobs in NYC
Java Developer Jobs in NYC
Javascript Jobs in NYC
Linux Jobs in NYC
Perl Jobs in NYC
PHP Developer Jobs in NYC
Python Jobs in NYC
QA Jobs in NYC
Ruby Jobs in NYC
Sales Engineer Jobs in NYC
Salesforce Developer Jobs in NYC
Scala Jobs in NYC
Artificial Intelligence Jobs in NYC
Artificial Intelligence Engineer Jobs in NYC
AWS Engineer Jobs in NYC
Backend Engineer Jobs in NYC
DevOps Engineer Jobs in NYC
Director of Engineering Jobs in NYC
Engineering Jobs in NYC
Full Stack Engineer Jobs in NYC
Infrastructure Engineer Jobs in NYC
Lead Software Engineer Jobs in NYC
Network Engineer Jobs in NYC
Platform Engineer Jobs in NYC
Principal Architect Jobs in NYC
Principal Engineer Jobs in NYC
Principal Software Engineer Jobs in NYC
Quality Assurance Automation Engineer Jobs in NYC
Reliability Engineer Jobs in NYC
Senior Backend Engineer Jobs in NYC
Senior Cloud Engineer Jobs in NYC
Senior Full-Stack Engineer Jobs in NYC
Senior Platform Engineer Jobs in NYC
Senior Python Engineer Jobs in NYC
Senior Site Reliability Engineer Jobs in NYC
Solutions Architect Jobs in NYC
Solutions Engineer Jobs in NYC
Staff Engineer Jobs in NYC
Staff Software Engineer Jobs in NYC
Systems Engineer Jobs in NYC
Vice President of Engineering Jobs in NYC
All Filters
Total selected ()
No Results
No Results

.jpg)


.png)




.jpeg)



















