The Site Reliability Architect will lead technical strategy for reliability in a SaaS environment, mentor teams, and ensure system resiliency while defining performance indicators and optimizing cloud spend.
HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solution to help people who are aging or have disabilities thrive in their homes and communities. Our employees are passionate about transforming the healthcare space by building the only homecare ecosystem that fully connects patients, personal care providers, managed care organizations, and states.
The Site Reliability Engineering Architect will lead our technical strategy providing reliability and resiliency across our enterprise SaaS-based ecosystem. This role will influence the architecture of our environments, ensuring 99.9%+ availability through proactive cloud system design, advanced network implementation, in a hybrid cloud/on-premise environment. As SRE Architect you will provide technical mentorship across all SRE engineering levels as we grow the culture of our SRE practice globally.
To perform this job successfully, an individual must be able to perform each essential job duty satisfactorily with or without reasonable accommodation. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
This is a fully remote opportunity for candidates located in the EST or CST time zones within the US only.
Essential Job Duties
- Architect with a resiliency-by-design intent, for self-healing, fault-tolerant systems, focusing on proactive readiness rather than reactive correction.
- Operate within a secure high-volume, high-volatility application environment, utilizing advanced networking and compute structures, in cloud hosted environments (AWS/GCP).
- Move the organization from "firefighting" to a proactive culture through habits and systems supporting feature flagging, production readiness reviews, architectural decision records, and chaos engineering.
- Support the incident management practice, mentoring SREs and Software engineers alike in utilizing our monitoring and observability toolsets for effective troubleshooting.
- Define SLIs, SLOs, and error budgets that balance feature velocity with platform stability, supporting a shift to service ownership.
- Underscore an automation-first perspective using Terraform, CDK, and other cloud-formation infrastructure as code toolsets to ensure repeatable, audit-ready environments.
Other Job Duties
- Other duties as assigned by supervisor or HHAeXchange leader.
Travel Requirements
- Travel up to 10%, including overnight travel
Required Education, Experience, Certifications and Skills
- Bachelor's or Master's degree in Computer Science, Information Systems, or related field and applicable experience.
- 10 + years in SRE/DevOps with 4 of that in an enterprise SaaS environment.
- 4+ years in software development contributing to a SaaS-based, cloud-hosted product line.
- Proven track record in a distributed SaaS environment managing multi-cloud or multi-region workloads.
- Proficiency in modern cloud networking, including DNS, TCP/IP, Load Balancing, and Zero Trust security models.
- Strong coding skills in Go, Python, Java, C#, or others, to build internal reliability tools and automate complex operational workflows.
- Expert-level knowledge of Kubernetes (EKS/GKE) architecture, including multi-cluster management and stateful workloads.
- Ability to optimize cloud spend while maintaining high performance and reliability.
- Experience operating in a DevSecOps context with compliance guardrails (e.g., GDPR, HIPAA, HITRUST) across varied infrastructures
- Willingness to explore and adopt AI tools responsibly to enhance productivity and innovation in your role
The base salary range for this US-based, full-time, and exempt position is $170,000-185,000/yr, not including variable compensation. An employee’s exact starting salary will be based on various factors including but not limited to experience, education, training, merit, location, and the ability to exemplify the HHAeXchange core values.
This is a benefits-eligible position. HHAeXchange offers competitive health plans, paid time-off, company paid holidays, 401K retirement program with a Company elected match, including other company sponsored programs.
HHAeXchange is an equal-opportunity employer. The Company offers employment opportunities to all applicants and employees without regard to race, color, religion, national origin, sex, sexual orientation, gender identity or expression, age, disability, medical condition, marital status, veteran status, citizenship, genetic information, hairstyles, or any other status protected by local or federal law.
Top Skills
AWS
C#
Cdk
GCP
Go
Java
Kubernetes
Python
Terraform
HHAeXchange New York, New York, USA Office
130 W 42nd St, 2nd Floor, New York, NY, United States, 10036
Similar Jobs
Information Technology • Cybersecurity
Support end-to-end setup and deployment of multi-channel B2B marketing campaigns. Build emails, lists, templates, and workflows in the MAP, maintain data integrity across Zendesk and HubSpot, triage priority inbox, perform rigorous QA, assist reporting and tracking, and help manage project boards to ensure timely launches.
Top Skills:
Hubspot,Marketo,Iterable,Sfmc,Braze,6Sense,Hightouch,Sendoso,Zoom Webinar,Monday.Com,Clickup,Asana,Airtable,Jira,Zendesk,Hubspot Crm
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
As a Senior Talent Acquisition Partner, you will recruit top sales talent across Mid-Market and Enterprise segments, manage full-cycle recruiting, build talent pipelines, and collaborate with leadership on hiring strategies.
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Provide production support for Salesforce CRM and workflow platforms (Bizflow/Pega), manage incidents end-to-end, perform root cause analysis, monitor jobs and integrations, support releases and operational readiness, coordinate across teams, and drive continuous improvements to reduce incidents and improve platform stability.
Top Skills:
Salesforce,Bizflow,Pega,Salesforce Service Cloud,Splunk,Apis,Deployment Pipelines
What you need to know about the NYC Tech Scene
As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory



