Merative Jobs

Site Reliability and DevOps Engineering Lead

Merative

Site Reliability and DevOps Engineering Lead

Posted Yesterday

Remote

Hiring Remotely in United States

131K-197K Annually

Senior level

Remote

Hiring Remotely in United States

131K-197K Annually

Senior level

Lead and grow a Platform/DevOps team to ensure a highly available, performant, secure clinical SaaS platform. Own SRE practices (SLIs/SLOs, error budgets), CI/CD and release automation, incident leadership, observability, capacity planning, vendor governance, and platform strategy. Drive automation, reliability engineering, and AI-enabled pipeline optimization while participating in on-call rotation and cross-team collaboration.

The summary above was generated by AI

Micromedex by Merative is a trusted clinical decision support solution used by clinicians in thousands of hospitals, health systems, payers, and government agencies worldwide. For over 50 years, we’ve delivered evidence-based drug, toxicology, and disease information to help clinicians make confident, timely decisions and educate patients at the point of care. Today, Micromedex is evolving. With a modernized homepage and AI-powered search, clinicians can now find precise answers faster—supported by rigorously validated, evidence-based content. Our portfolio includes drug reference, IV compatibility, pediatric dosing, toxicology databases, and integrated calculators, all accessible via web and mobile. By combining authoritative content with intuitive, AI-enhanced tools, Micromedex empowers healthcare organizations to improve medication safety, reduce adverse events, and deliver better patient outcomes.
Micromedex is seeking a highly skilled Platform Reliability & DevOps Engineering Lead who combines deep hands-on expertise in cloud services, infrastructure, and automation with a strong architectural understanding of distributed, high-availability systems.
You will lead the platform team, ensuring our mission-critical clinical platform is highly available (24×7), performant, scalable, and secure.
This role is both strategic and hands-on: you will define and drive the platform reliability and DevOps strategy, continuously improving system resilience and CI/CD capability, while partnering closely with engineering teams and vendors to embed operational excellence across the software lifecycle.
You will be accountable for the end-to-end reliability, operability, and delivery capability of the Micromedex platform, unifying Site Reliability Engineering, DevOps, and CI/CD ownership into a single platform function. This includes owning platform reliability outcomes, DevOps enablement, and delivery pipelines to support scalable, high-availability systems and faster, safer releases.
You are passionate about automation, proactive in addressing reliability and performance challenges, and committed to maintaining the trust of clinicians worldwide through resilient system design, strong operational discipline, and rapid incident response.

Responsibilities:

People & Team Leadership

Lead, mentor, and grow Platform / DevOps engineers
Build a high-performing Platform team
Drive accountability for platform reliability and delivery outcomes
Lead vendors to deliver capabilities in production.

Production Engineering & Platform Operations

Ensure platform capabilities accelerate product delivery, remove bottlenecks.
Defines and enforces platform engineering standards and DevOps practices across all teams and vendors
Lead capacity planning, performance optimization, and cost efficiency
Define operational standards, runbooks, and reliability practices
Accountable for platform reliability outcomes at enterprise/product level

Platform Strategy and Leadership

Act as technical authority across platform, reliability, and delivery
Define platform strategy and roadmap
Govern delivery across internal teams and vendors

Platform Reliability Ownership

Own SLIs, SLOs, and error budgets
Lead resilience engineering, observability, and failure design
Drive proactive risk reduction and continuous improvement
Own incident management frameworks and continuous improvement

CI/CD and Release Engineering

Own end-to-end pipeline architecture and release automation
Standardize, secure, and fully automate pipelines
Drive continuous integration, delivery, and validation practices

Incident Leadership

Lead Sev1 response, escalation, and recovery
Own RCA and drive systemic fixes (not point fixes)

Introduce AI-enabled pipeline optimization and quality gates

Embed AI into monitoring, risk prediction, and CI/CD optimization
Drive automation to reduce operational toil and improve decision-making

Required Skills:

Bachelor’s degree in computer science, Engineering, or a related field.
6-10 years of hands-on experience in software operations, DevOps and Site Reliability Engineering, including managing large-scale, mission-critical systems.
Clear and confident communication skills with ability to lead teams and collaborate effectively across engineering, product, and architecture teams.
Proven track record ensuring high availability and performance in production environments, with expertise in fault-tolerant, distributed system design.
Excellent understanding of modern software delivery pipelines and DevOps practices, including CI/CD, configuration management, and version control (Git).
Exceptional problem-solving skills, with experience diagnosing complex system issues under pressure and driving them to resolution.
Strong proficiency in at least one programming or scripting language (e.g., Python, Bash, or Java) for automation and tool integration.
Self-driven and proactive, with a passion for automating manual processes and continuously improving systems to enhance reliability and team productivity.

Key Skills and Experience:

Proven experience:

Releasing into and running mission-critical, high-availability SaaS platforms
Technically leading a Platform team and influence stakeholders and vendors.
Stakeholder engagement across Product, Architecture, and Operations

Deep expertise in:

Site Reliability Engineering (SLI/SLO, error budgets, incident management)
DevOps operating models and platform engineering (engineering transformation)
CI/CD architecture and release automation
Cloud, Systems & Infrastructure (DB2, Oracle, Infinispan, OpenLiberty)
Automation-first engineering with proven usage of AI (self-healing, triage)
Java application platforms and runtimes (performance tuning, troubleshooting, production operations)

Strong experience with:

Cloud platforms (Azure preferred)
Distributed systems and fault-tolerant architectures
Performance Tuning and Scaling
Database optimisation (DB2, Oracle, PostgreSQL)
Multi-region / active-active environments
Monitoring, logging, tracing frameworks
Experience embedding reliability practices into the SDLC

Hands-on with:

DB2, Oracle, Infinispan, OpenLiberty, Azure
Infrastructure as Code (Terraform or similar)
Containerisation and orchestration (Docker/Kubernetes)

Work Environment

This is a remote-first role, collaborating daily with global teams across engineering, product, architecture, and DevOps. The SRE/DevOps Lead Engineer will interact with colleagues across multiple time zones and must occasionally flex working hours to ensure smooth handoffs and incident coverage. Participation in an on-call rotation is expected as part of our commitment to 24×7 support of a clinical-grade platform. We are a fast-paced, collaborative environment that values continuous learning, proactive problem-solving, and the sharing of ideas. Minimal travel may be required for periodic team on-sites or company engineering summits.

Compensation

The salary range provided in this job posting is intended to reflect the general market value for the position. The actual salary offered may vary based on factors such as the candidate’s experience, qualifications, skills, and the specific requirements of the role. This range may also be subject to change as market conditions evolve. We encourage open communication throughout the interview process to discuss compensation expectations. For base-salary + commission sales roles, the range represents On-Target Earnings.

Min – Max :

$131,381.86 - $197,072.78 (USD)

Benefits

The benefits described represent the current offerings at our organization, however, benefits are subject to change and may vary by location and employment status. We strive to provide a comprehensive benefits package that supports our employees’ health, wellness, and financial goals. Please note that benefits may be discussed in more detail during the hiring process.

Remote first / work from home culture
Flexible vacation to help you rest, recharge, and connect with loved ones
Paid leave benefits
Health, dental, and vision insurance
401k retirement savings plan
Infertility benefits
Tuition reimbursement, life insurance, EAP – and more!

It is the policy of Merative to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, Merative will provide reasonable accommodations for qualified individuals with disabilities.

Merative participates in the federal E-Verify program to confirm the identity and employment authorization of all newly hired employees. For further information about the E-Verify program, please click here: http://www.uscis.gov/e-verify/employees

Similar Jobs

Zscaler

Account Executive

41 Minutes Ago

Easy Apply

Remote or Hybrid

New York, NY, USA

Easy Apply

113K-162K Annually

Senior level

113K-162K Annually

Senior level

Cloud • Information Technology • Security • Software • Cybersecurity

Drive revenue for Zscaler's Zero Trust Cloud suite across Northeast enterprise accounts. Act as primary specialist, partner with solution engineers, build account-based strategies, engage C-level and technical stakeholders, and collaborate with sales leadership to land and expand cloud security solutions.

Top Skills: AICloud SecurityCloud-NativeMicrosegmentationMulti-CloudZero Trust CloudZero Trust Exchange

Zscaler

Senior Sales Engineer

42 Minutes Ago

Easy Apply

Remote or Hybrid

Texas, USA

Easy Apply

155K-221K Annually

Senior level

155K-221K Annually

Senior level

Cloud • Information Technology • Security • Software • Cybersecurity

Lead technical sales for major Texas accounts: architect Zscaler Zero Trust solutions, deliver technical presentations and demos, gather requirements, run end-to-end Proof of Value engagements, configure custom solutions, and collaborate with internal teams to close enterprise deals.

Top Skills: Cloud-NativeNetwork Security TechnologiesZero TrustZscaler Zero Trust Exchange

Samsara

Sales Engineer

50 Minutes Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

198K-233K Annually

Senior level

198K-233K Annually

Senior level

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software

Pre-sales technical role supporting enterprise IoT solutions: lead discoveries, demos, POCs, deployment plans, hardware installation guidance, API-based integrations and scripting, and act as liaison between product and sales while driving customer outcomes and upsell opportunities.

Top Skills: APIsArduinoBashCan BusCarrier NetworksCloudComputer NetworkingComputer VisionIotJavaScriptOpen ApiPythonRaspberry Pi

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
Key Industries: Artificial intelligence, Fintech
Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory