FloSports Logo

FloSports

Staff Site Reliability Engineer

Reposted 8 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
The summary above was generated by AI

FloSports is a world-class sports media company strategically positioned to be the essential  destination for passionate sports fans, delighting them with live event coverage, breaking news,  highlights, stats, rankings, and team and player profiles. We are growing Our Sports every day  by continuing to invest in our ever-expanding ecosystem, which consists of over a dozen sport  verticals and hundreds of streaming partners. FloSports is creating the home for college  conferences/leagues and sports like grappling, hockey, track & field, racing, cheer, wrestling,  and more, and we are looking for innovative and passionate people like you to help us! 

THE ROLE: 

At FloSports, SRE is the team that acts as a force multiplier for our engineering organization. Our mission is to be the "wind in the sails" for our developers, enabling them to ship features faster, safer, and with more confidence. We are a "code-first" group that believes in automating away toil and solving problems with software. We don't click buttons in a console; we write code, build tools, and manage our infrastructure through GitOps.

As a Staff SRE, you will be a technical leader on a highly skilled and senior team. You will be a key driver of our architecture, reliability, and developer enablement strategy. This role requires a balance of high-impact individual contribution, technical leadership, and close collaboration with other Staff and Senior engineers to set the technical direction for the entire organization.

Our culture is built on principles of shared stability responsibility and pragmatism. We are guided by a philosophy of simplicity (if you've read grugbrain.dev, you'll fit right in). We believe it's more fun to be competent, and we're looking for another expert to join our team.

RESPONSIBILITIES: 

  • Lead the technical architecture and execution of our landmark migration from a legacy GCP environment to a modern, scalable infrastructure on AWS EKS.

  • Architect, design, and drive our core infrastructure, defining the patterns for Terraform and GitOps that the rest of the organization will follow.

  • Champion and drive our SLO-driven culture, setting the strategy for how we define, measure, and implement SLOs for critical user journeys, guided by the four Golden Signals (Latency, Traffic, Errors, and Saturation).

  • Lead the design and development of critical tooling and automation in Node.js and Go to solve entire classes of problems for our developers.

  • Lead the architectural evolution of our in-house, K6-based load testing platform, ensuring it can scale to meet future product demands.

  • Act as a primary subject matter expert for our Istio service mesh, driving its architecture, adoption, and optimization.

  • Spearhead and own high-priority initiatives, including the development of agentic workflows and intelligent automation for SRE domains like proactive scaling and automated remediation.

Act as a technical leader by participating in our blameless on-call rotation, mentoring other engineers through complex incidents and ensuring all post-mortems lead to systemic, long-term improvements.

KNOWLEDGE, SKILLS AND ABILITIES:  

  • Extensive Experience: 8-10+ years in SRE, DevOps, or Software Engineering, with a proven track record of operating at a Staff level.

  • Proven Technical Leadership: You have a history of mentoring other senior engineers, influencing technical direction across multiple teams, and leading large-scale projects to completion.

  • Expert Coder: You are a polyglot with deep expertise in languages like Node.js or Go and a history of building and maintaining critical automation and services.

  • Kubernetes Architect: You have an expert-level, architectural understanding of Kubernetes (EKS preferred), including networking, custom controllers, and control plane optimization.

  • Infrastructure as Code Expert: You are a Terraform expert who has designed and implemented large-scale, reusable, and secure IaC frameworks, not just consumed them.

  • Observability Architect: You have designed and implemented observability strategies from the ground up, leveraging platforms like Datadog to create actionable SLOs and provide deep system insight.

  • CI/CD Architect: You have designed, built, and scaled complex CI/CD systems (ideally with GitHub Actions and self-hosted runners) that are used by an entire engineering organization.

Expert Systems Thinker: You can decompose highly ambiguous, complex, cross-functional problems into solvable parts and lead the technical solution from concept to production.

BONUS

  • Agentic Systems & Intelligent Automation: You have successfully designed and deployed agentic systems or other forms of intelligent automation to solve SRE problems and can speak to the tangible results.

  • Architectural Leadership in a large-scale cloud migration (e.g., GCP to AWS).

  • Performance Testing: Deep experience building or scaling custom load testing frameworks, especially with K6.

  • Istio Expertise: Deep, practical experience managing Istio in a large, multi-tenant production environment.

  • Familiarity with serverless architectures, especially SST.

  • Experience orchestrating the deprecation and removal of legacy configuration management systems.

OUR COMMITMENT TO DIVERSITY:

At FloSports, we are bonded by our passion for sports and our purpose to unite communities around experiences that finally give underserved sports the love they deserve. We recognize the need to build a company that seeks out, embraces, and celebrates our individual differences, ideas, and talent. FloSports is committed to the pursuit of a fair, equal and inclusive workplace where everyone is given the opportunity to grow to their fullest potential. 

 

OUR BENEFITS:

  • Recognized three years in a row as a Top Workplace by the Austin-American Statesman

  • Flexibility at work - you can take control of your profession and personal schedule

  • All-hands events hosted in beautiful Austin, Texas 

  • Annual equity awards for all top performers

  • Competitive and comprehensive medical, dental and vision plans

  • Peace of mind through company-paid short-term disability, long-term disability and life insurance

  • Generous 401(K) company match vested immediately

  • Progressive parental leave policies

  • Flexible paid time off

  • Hack-a-thons and a full calendar of team-building and social events

  • Company donation to youth teams and leagues that our employees coach

  • Stocked snack bar, catered lunch and breakfast tacos every week

Top Skills

Aws Eks
Datadog
Github Actions
Go
Istio
K6
Kubernetes
Node.js
Terraform

Similar Jobs

2 Days Ago
Remote or Hybrid
New York, NY, USA
130K-180K Annually
Senior level
130K-180K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Staff Software Engineer will oversee SAP BTP CPI applications' operational support, manage incidents, collaborate with various teams, and ensure high system performance.
Top Skills: AbapCloud ApplicationsCpiErp SystemsIdocJSONOdataRestSap AribaSap BtpSap C4CSap CallidusSap Success FactorsSfapiSftpSoapWorkdayXML
20 Days Ago
Remote
United States of America
153K-205K Annually
Senior level
153K-205K Annually
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Senior Site Reliability Engineer manages production infrastructure, ensuring performance and reliability using AI tools, Kubernetes, and CI/CD pipelines while mentoring teams.
Top Skills: Apache AirflowAWSAws LambdaAzureChatgptCi/CdCrossplaneGCPGeminiGithub CopilotGoKubernetesOpensearchPostgresPythonRedisSnowflakeTerraform
23 Days Ago
Easy Apply
Remote
USA
Easy Apply
219K-245K Annually
Expert/Leader
219K-245K Annually
Expert/Leader
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills: AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account