Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in NYC, NY
Fintech • Payments • Financial Services
The Site Reliability Engineer will assist clients with Redline products, manage production environments, troubleshoot issues, and ensure automation and customer satisfaction.
Top Skills:
C/C++JavaLinuxPython
Software
The Staff Systems Engineer is responsible for architecting and maintaining VMware-based infrastructure, automating operations, and collaborating with cross-functional teams to enhance system performance and reliability.
Top Skills:
Active DirectoryAnsibleAutomation FrameworksAviAzure DevopsF5 Big-IpGitJenkinsLinuxPowercliPythonTcp/IpTerraformVMwareWindows Server
Financial Services
Design, develop, and deploy robust platform solutions while ensuring reliability, scalability, and security of the system. Collaborate with teams to enhance tooling and automation.
Top Skills:
GCPKubernetesTerraform
Software
The Infrastructure Reliability Engineer will ensure uptime for payment systems, evolve data infrastructure, reduce latency, and enhance reliability across the organization.
Top Skills:
AWSCachingDatabasesLoad BalancingNode.jsObservabilityRuby on RailsRust
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills:
AWSDockerGCPKubernetes
Artificial Intelligence • Big Data • Cloud • Software • Analytics • Infrastructure as a Service (IaaS) • Big Data Analytics
As an Airflow Reliability Engineer, you'll provide expertise in Apache Airflow, solve challenges for customers, and contribute to open-source projects, while enhancing your technical and customer-facing skills.
Top Skills:
Apache AirflowAWSAzureDockerGCPKubernetesPostgresPythonSQL
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills:
AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills:
AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
Reposted 20 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills:
Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
Software
As an AI Support Engineer, you'll manage support requests, resolve user issues, optimize ML models, and contribute to product development.
Top Skills:
Tensorrt
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills:
Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Marketing Tech
The Cloud Reliability Engineer develops, configures, and deploys cloud tools, enhances applications, ensures observability, and participates in on-call rotations.
Top Skills:
AWSCi/CdDockerGithub ActionsGoGoogle BigqueryGCPKubernetesLinuxPythonSQLTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Healthtech
Build and harden AWS cloud environments and CI/CD pipelines, manage IaC and container platforms, own observability and incident response, enforce security and HA/DR, and automate operational tasks to support a regulated medical-imaging platform.
Top Skills:
Apache AirflowAWSAws Cloudwatch InsightsBashCdkCloudFormationDicomDirect ConnectDockerEcsEksGitGrafanaHl7IamKmsKubernetesPrivatelinkPrometheusPythonSbomTerraformVpcVpn
Music
The Senior Site Reliability Engineer at Spotify will manage cloud infrastructure, implement reliability strategies, mentor engineers, and enhance developer experience through AI-driven tooling.
Top Skills:
AWSGCPKubernetesPythonReactTerraformTypescript
Database • Analytics
Drive reliability, availability, scalability, and performance of ClickHouse Core. Build alerts, run incident response and blameless postmortems, debug production issues, submit fixes, lead chaos engineering and on-call/escalation processes.
Top Skills:
Clickhouse,Clickhouse Cloud,Sql,Shell,Python,C++,Aws,Azure,Google Cloud Platform
Fintech
Lead adoption of SRE practices to improve reliability, observability, automation, and incident response. Implement and maintain observability tooling, instrumentation, CI/CD, and infrastructure-as-code. Partner with developers, participate in on-call rotations, drive postmortems, and reduce operational overhead through automation.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Reposted 8 Days AgoSaved
Fintech
Support and evolve SRE practices: implement and maintain observability, monitoring, alerting, automation, and resilience for services. Participate in on-call rotations, incident response, postmortems, and collaborate with engineering teams to improve reliability and operational efficiency.
Top Skills:
AnthropicAWSAws EcsAws EksAzureC#DockerGitlab CiGrafanaLinuxOpenaiPrometheusPuppetPythonSplunkTerraformTypescriptWindows
Music
As a Site Reliability Engineer, you'll build and maintain cloud infrastructure for Spotify's AI-native developer platform, ensuring reliability and performance, while collaborating with senior engineers.
Top Skills:
AWSGCPPythonReactTerraformTypescript
Artificial Intelligence • Cloud • Machine Learning • Software • Database • App development • Generative AI
As a Site Reliability Engineer at Replit, you'll enhance system reliability through observability, automation, incident management, and performance optimization, serving millions globally.
Top Skills:
AnsibleDatadogGoGoogle Cloud PlatformGrafanaKubernetesPrometheusPulumiPythonTerraform
Artificial Intelligence • Cloud • Machine Learning • Software • Database • App development • Generative AI
As a Staff Site Reliability Engineer at Replit, you will ensure infrastructure reliability, drive automation, lead incident management, and mentor the engineering team while enhancing system performance and observability.
Top Skills:
DatadogGoGoogle Cloud PlatformGrafanaKubernetesOpentelemetryPrometheusPythonTerraform
Software
The Site Reliability Engineer will ensure the reliability and performance of Cloaked's services, contributing to the company's mission of protecting consumer data privacy.
Fintech • Software • Financial Services
The Site Reliability Engineer will manage AWS infrastructure, automate processes, and ensure system reliability while collaborating with engineering teams.
Top Skills:
AWSBashGoJavaKubernetesPythonTerraform
Other • Social Impact
The Senior Site Reliability Engineer is responsible for maintaining Wikimedia's infrastructure, improving reliability, automating tasks, and mentoring peers while participating in incident management.
Top Skills:
Apache Traffic ServerBashDebianEnvoyGoGrafanaHaproxyKubernetesNginxPrometheusPuppetPythonRubyVarnish
Top NYC Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs in NYC
.NET Developer Jobs in NYC
Android Developer Jobs in NYC
C# Jobs in NYC
C++ Jobs in NYC
DevOps Jobs in NYC
Engineering Manager Jobs in NYC
Front End Developer Jobs in NYC
Golang Jobs in NYC
Hardware Engineer Jobs in NYC
iOS Developer Jobs in NYC
Java Developer Jobs in NYC
Javascript Jobs in NYC
Linux Jobs in NYC
Perl Jobs in NYC
PHP Developer Jobs in NYC
Python Jobs in NYC
QA Jobs in NYC
Ruby Jobs in NYC
Sales Engineer Jobs in NYC
Salesforce Developer Jobs in NYC
Scala Jobs in NYC
Artificial Intelligence Jobs in NYC
Artificial Intelligence Engineer Jobs in NYC
AWS Engineer Jobs in NYC
Backend Engineer Jobs in NYC
DevOps Engineer Jobs in NYC
Director of Engineering Jobs in NYC
Engineering Jobs in NYC
Full Stack Engineer Jobs in NYC
Infrastructure Engineer Jobs in NYC
Lead Software Engineer Jobs in NYC
Network Engineer Jobs in NYC
Platform Engineer Jobs in NYC
Principal Architect Jobs in NYC
Principal Engineer Jobs in NYC
Principal Software Engineer Jobs in NYC
Quality Assurance Automation Engineer Jobs in NYC
Reliability Engineer Jobs in NYC
Senior Backend Engineer Jobs in NYC
Senior Cloud Engineer Jobs in NYC
Senior Full-Stack Engineer Jobs in NYC
Senior Platform Engineer Jobs in NYC
Senior Python Engineer Jobs in NYC
Senior Site Reliability Engineer Jobs in NYC
Solutions Architect Jobs in NYC
Solutions Engineer Jobs in NYC
Staff Engineer Jobs in NYC
Staff Software Engineer Jobs in NYC
Systems Engineer Jobs in NYC
Vice President of Engineering Jobs in NYC
All Filters
Total selected ()
No Results
No Results
.png)


.png)
.png)
.png)



.png)















