hatch I.T. Logo

hatch I.T.

Site Reliability Engineer (SRE)

Reposted 21 Days Ago
Remote
Hiring Remotely in USA
Mid level
Remote
Hiring Remotely in USA
Mid level
The Site Reliability Engineer at CardioOne will enhance the reliability and performance of production systems, implement automation, and lead incident response efforts while collaborating with development teams.
The summary above was generated by AI
hatch I.T. is partnering with CardioOne to find a Site Reliability Engineer (SRE) to join their team. See deteails below:

About the Role:
CardioOne is seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, security, and performance of their production systems and services. The SRE will bridge the gap between software development and operations, implementing automation, monitoring, and best practices to enable rapid, reliable delivery of applications. You will report directly to the Senior Director of Engineering.

About the Company:
CardioOne partners with independent cardiologists to provide innovative solutions that improve patient outcomes and reduce costs. Their platform helps their physician partners thrive in today’s fee-for-service environment and prepare for success in value-based care. In February 2024, they partnered with WindRose Health Investors as well as top physician services and payor executives to grow their team and invest in their next phase of growth.

CardioOne offers a magnificent work environment, good working conditions, and competitive pay. They offer medical, dental, vision, and a 401k plan with a match to benefit eligible employees. They offer PTO (Personal Time Off) and sick time to full-time employees. They take pride in creating a culture of employee engagement that translates into an exemplary patient experience. Join them in their mission to positively impact US cardiology.

Responsibilities:

  • Ensure high availability, scalability, and performance of production systems.
  • Implement and maintain SLIs, SLOs, and SLAs for critical services.
  • Conduct capacity planning and performance tuning.
  • Automate infrastructure provisioning using IaC tools such as Terraform and Terragrunt , ansible
  • Develop automation to minimize manual operations and improve deployment workflows.
  • Build CI/CD pipelines to support rapid and reliable deployments.
  • Design and maintain monitoring, logging, and alerting systems (Datadog).
  • Participate in on-call rotations and lead incident response efforts.
  • Perform root-cause analysis and develop postmortems to prevent recurring issues.
  • Manage cloud infrastructure (AWS, Azure) and container orchestration platforms (Kubernetes, ECS).
  • Optimize system architecture for reliability and fault tolerance.
  • Implement best practices for security, networking, and service resilience.
  • Work closely with development teams to design reliable microservices and distributed systems.
  • Advocate for SRE principles and drive operational excellence across engineering teams.
  • Mentor engineers on reliability practices, tooling, and automation strategies.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
  • 3–7 years of experience in SRE, DevOps, or Systems Engineering roles.
  • Strong proficiency with Linux systems and shell scripting.
  • Experience with cloud platforms (AWS, Azure).
  • Hands-on experience with Kubernetes/ECS and container technologies (Docker).
  • Proficiency in at least one programming language: Python or Java
  • Experience with CI/CD pipelines and DevOps tooling.
  • Strong understanding of distributed systems, networking, and security fundamentals.
  • Strong analytical and problem-solving skills.
  • Excellent communication and cross-team collaboration.
  • Ability to thrive in fast-paced, high-stakes environments.
  • A mindset focused on continuous improvement and operational excellence.

Prefered Qualifications:

  • Experience with observability stacks (OpenTelemetry).
  • Knowledge of database management (PostgreSQL).
  • Experience with configuration management tools (Ansible, Chef, Puppet).
  • Familiarity with zero-downtime deployments and chaos engineering practices.

Similar Jobs

Yesterday
Remote or Hybrid
New York, NY, USA
147K-278K Annually
Senior level
147K-278K Annually
Senior level
Cloud • Software
This role involves leading a Site Reliability Engineering team, managing FedRAMP-compliant infrastructure, collaborating across teams, and ensuring operational excellence and security.
Top Skills: ArchitectureAutomationCloudCybersecurityFedrampIncident ResponseMonitoringMulti-Tiered Architecture
8 Days Ago
In-Office or Remote
135K-231K Annually
Senior level
135K-231K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
This role involves leading site reliability engineering initiatives, ensuring operational excellence and security for digital platforms within the organization, and collaborating across teams to improve system performance.
Top Skills: Automation ToolsAWSAzureGCPIds/IpsMonitoring SystemsSecurity FrameworksSIEM
8 Days Ago
Easy Apply
Remote or Hybrid
Easy Apply
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Responsible for managing operations within classified environments, overseeing cloud infrastructure, automating tasks, and ensuring system stability in a high-security setting.
Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account