Standard Template Labs is an AI-native startup reimagining the future of IT Service and Configuration Management. Backed by leading investors, we're leveraging AI to transform how enterprises manage and engage with their IT ecosystems.
About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to own the reliability, performance, and scalability of our AI-native platform. You’ll operate at the intersection of software engineering and infrastructure, building systems that keep our platform highly available, observable, and resilient in production.
This is a hands-on engineering role where you’ll write production code (primarily in Python) while also owning on-call operations and incident response.
ResponsibilitiesReliability & Production OwnershipOwn the availability, latency, and performance of critical production systems
Participate in and improve a 24/7 on-call rotation, responding to incidents and driving resolution
Lead incident response, root cause analysis (RCA), and postmortems
Design systems that fail gracefully and recover automatically
Write production-grade Python code to:
Automate infrastructure workflows
Build internal reliability tools
Improve deployment, rollback, and recovery systems
Eliminate manual operational work through automation and self-healing systems
Design and implement:
Metrics, logging, tracing
Alerting systems (reduce noise, improve signal)
Build dashboards and tooling to give real-time visibility into system health
Operate and improve systems running on:
Cloud platforms (AWS/GCP/Azure)
Containers (Docker, Kubernetes)
Scale systems to handle enterprise workloads and high-throughput traffic
Improve deployment pipelines, CI/CD, and infrastructure-as-code
Define and enforce:
SLAs / SLOs / error budgets
Conduct:
Load testing
Chaos testing
Build resilient systems that can tolerate failure
Partner with product and backend engineers to:
Improve system reliability
Embed observability into services
Help teams design production-ready systems from day one
Strong software engineering background (not just ops)
Proficiency in Python (required) for building tools and services
Experience operating production systems at scale
Experience with:
Kubernetes / Docker
Cloud platforms (AWS/GCP/Azure)
Distributed systems
Experience with:
On-call rotations and incident response
Monitoring tools (Grafana, Prometheus, etc.)
Debugging production issues under pressure
Experience with:
AI/ML systems or data pipelines
Event-driven architectures
High-availability systems
Build foundational product features for an AI-first enterprise platform
The opportunity to take ownership of critical systems that scale to millions of users
A culture that values craftsmanship, autonomy, and technical excellence
Competitive compensation, equity, and benefits package
Work from our Flatiron District, Manhattan office, where you’ll be side-by-side with the founding team in a supportive, collaborative setting. Our team works on-site five days a week, growing and building together, and the location is easy to reach with plenty of public transportation options.
As an equal opportunity employer, we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws. The reasonably estimated yearly salary for this role at is: $160,000—$250,000 USD.
Standard Template Labs New York, New York, USA Office
Flatiron District
Similar Jobs at Standard Template Labs
What you need to know about the NYC Tech Scene
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

