Benchmark Education Company Logo

Benchmark Education Company

Lead Engineer, Site Reliability

Sorry, this job was removed at 08:08 p.m. (EST) on Wednesday, Aug 13, 2025
Be an Early Applicant
Remote
Hiring Remotely in USA
Remote
Hiring Remotely in USA

Similar Jobs

13 Days Ago
Remote
US
Senior level
Senior level
Big Data • Healthtech • Information Technology • Analytics
As a Lead Site Reliability Engineer, you'll design and manage scalable cloud infrastructure on GCP, optimize CI/CD processes, and ensure system reliability through observability and incident response, while mentoring others in a cross-product SRE group.
Top Skills: BashGitlab Ci/CdGkeGoogle Cloud PlatformJenkinsPythonSentrySumo LogicTerraform
10 Days Ago
Easy Apply
Remote
United States
Easy Apply
170K-200K Annually
Senior level
170K-200K Annually
Senior level
Software
The Lead Site Reliability Engineer will oversee the architecture and operational excellence of Mattermost's infrastructure, mentoring teams and driving strategic initiatives for performance and reliability in regulated sectors.
Top Skills: AWSGrafanaKubernetesPrometheusTerraform
2 Hours Ago
Easy Apply
Remote
United States
Easy Apply
142K-210K Annually
Junior
142K-210K Annually
Junior
Big Data • Fintech • Mobile • Payments • Financial Services
As a Software Engineer II in Backend, you will develop and launch backend systems, collaborate on tech projects, and support operational availability.
Top Skills: AWSKotlinKubernetesMySQLPython

Position Purpose:

We are seeking a Lead Software Engineer, Site Reliability to drive reliability, scalability, and operational excellence for our cloud-based platforms. This role will lead the design and development of reliability-focused features, mentor engineers, and champion best practices in Site Reliability Engineering.

As a technical leader, you will work closely with cross-functional teams to ensure our services are performant, reliable, and resilient, while continuously improving operational processes and tooling.

Responsibilities:

  • Lead the design and development of features that improve reliability, scalability, and observability.
  • Drive and foster a strong SRE culture across the organization, advocating for reliability as a shared responsibility.
  • Define and maintain SLIs, SLOs, and error budgets; embed reliability considerations early in the software development lifecycle.
  • Lead the evolution of observability practices and tooling to provide actionable insights into system health and performance.
  • Architect and implement solutions leveraging AWS services (e.g., CloudWatch, Lambda, EC2, S3, RDS) for operational efficiency and resiliency.
  • Own and improve the organization’s incident management processes, including incident response, postmortem analysis, and continuous learning to reduce incident recurrence.
  • Mentor and guide engineers on reliability engineering, cloud-native architecture, and operational excellence.
  • Automate infrastructure, deployments, and operational tasks to reduce toil and improve efficiency.
  • Collaborate with security and compliance teams to ensure secure, compliant, and resilient cloud operations.
  • Partner with cross-functional teams to align technical solutions with business objectives and ensure production readiness.

Qualifications:

  • 8+ experience in Site Reliability Engineering, DevOps, or Software Engineering roles with a reliability focus.
  • Strong expertise in AWS cloud services and architecture.
  • Proficiency in automation and scripting (e.g., Python, Bash).
  • Hands-on experience with observability tools (e.g., CloudWatch, Datadog, Prometheus, Grafana), networking, and scaling strategies.
  • Strong understanding of incident management, operational readiness, and performance tuning for large-scale distributed systems.
  • Experience mentoring engineers and leading technical initiatives.
  • Knowledge of infrastructure-as-code tools (e.g., Terraform, CloudFormation) and CI/CD pipelines.
  • Excellent collaboration and communication skills.

Preferred Qualifications:

  • AWS certifications (e.g., AWS Certified DevOps Engineer – Professional, AWS Certified Solutions Architect – Professional).
  • Experience leading on-call rotations and operational process improvements.
  • Familiarity with microservices architectures and container orchestration (e.g., Docker, Kubernetes/EKS).
  • Knowledge of PHP or Java a plus.

Key Attributes:

  • Leadership mindset with a strong focus on reliability and operational excellence.
  • Passion for mentoring and technical growth of team members.
  • Strong problem-solving and analytical skills.
  • Ability to work cross-functionally and influence best practices adoption.

ABOUT BENCHMARK EDUCATION COMPANY

Benchmark Education Company (BEC) is a leading publisher of core, supplemental, and intervention literacy and language resources in English and Spanish, with valid and reliable digital assessments that inform instruction. BEC is also a provider of exceptional professional development to educators.

BEC is recognized as a responsive publisher that offers equally rigorous and engaging digital, print, and hybrid learning materials grounded in the Science of Reading research. BEC monitors research outcomes carefully to create effective foundational resources that include strong decoding materials with systematic and explicit instruction and high-quality resources focused on language development and comprehension. BEC’s content-rich and authentic texts offer instruction in close reading and analysis, multiple perspectives, and authentic literature while building world knowledge and reflecting the individuality of every student in each diverse classroom.

Family owned and operated for more than 25 years, BEC is committed to partnering with educators to provide the best for all students through resources of exceptional quality, world-class professional learning, and effective and dedicated customer support.

Benchmark Education Publishing (BEC) and its affiliates are proud to be an Equal Opportunity Employer.

  

For further information, visit us at: https://www.benchmarkeducation.co

HQ

Benchmark Education Company New Rochelle, New York, USA Office

145 Huguenet St, New Rochelle, NY, United States, 10801

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account