Realtor.com Logo

Realtor.com

Sr Site Reliability Engineer

Posted 2 Hours Ago
Be an Early Applicant
Hybrid
Austin, TX
Senior level
Hybrid
Austin, TX
Senior level
Senior SRE responsible for reliability, observability, and operational excellence of a large AWS/Kubernetes platform. Duties include maintaining EKS/Fargate infrastructure, monitoring SLIs/SLOs, implementing observability with NewRelic, driving cost optimization and FinOps practices, executing chaos engineering and incident response, contributing automation and IaC, and supporting security/compliance and developer experience.
The summary above was generated by AI

Recognized as the No. 1 site trusted by real estate professionals, Realtor.com® has been at the forefront of online real estate for over 25 years, connecting buyers, sellers, and renters with trusted insights and expert guidance to find their perfect home. Through its robust suite of tools, Realtor.com® not only makes a significant impact on the real estate industry at large, but for consumers, navigating the biggest purchase they will make in their life, by providing a user experience that is easy to use, easy to understand, and most of all, easy to make decisions.

Join us on our mission to empower more people to find their way home by breaking barriers to entry, making the right connections, and building confidence through expert guidance.

We are seeking a Senior Site Reliability Engineer to join our newly formed Operations Excellence organization, reporting to the Director, Operations Excellence. This role will contribute to the reliability, observability, and operational excellence of our platform infrastructure serving millions of users. As a Senior SRE, you will be a strong technical contributor who implements best practices, solves complex problems, and enables our 600+ engineers to deliver exceptional customer experiences. You will work on critical platform systems including EKS infrastructure, Skyway (CI/CD), Frontdoor (Tyk API Gateway), Pantheon (Apollo GraphQL Federation), and our observability stack, while contributing to chaos engineering practices and cost optimization initiatives with measurable ROI.

What You'll Do:

Platform Reliability & Infrastructure

  • Implement and maintain highly available AWS infrastructure including EKS clusters, Fargate (ECS), and multi-region architectures
  • Support reliability of critical services: Skyway (CI/CD), Frontdoor (Tyk), Pantheon (Apollo GraphQL), and supporting infrastructure
  • Monitor SLIs, SLOs, and error budgets for Tier 1/2/3 systems; participate in architectural reviews for reliability and cost-efficiency
  • Implement reliability patterns including circuit breakers, graceful degradation, and automated failover

Observability & Cost Optimization

  • Implement observability solutions using NewRelic for APM, distributed tracing, metrics, and logging for rapid troubleshooting
  • Build dashboards and alerts that reduce MTTD and MTTR; contribute to observability standards across teams
  • Identify infrastructure cost optimization opportunities and implement FinOps practices including rightsizing and resource lifecycle management
  • Support cost-conscious architecture decisions and CI/CD spend optimization (CircleCI, Argo CD)

Chaos Engineering & Incident Response

  • Execute chaos engineering experiments to identify system weaknesses; contribute to frameworks for safe production testing
  • Participate in game day exercises and disaster recovery simulations; create runbooks and automation for resilience
  • Participate in on-call rotation for critical systems; conduct post-incident reviews and implement improvements
  • Support incident response processes and contribute to System Health Scorecard

Technical Contribution

  • Contribute as a strong technical individual contributor to the Operations Excellence team
  • Collaborate with Platform Engineering, Quality Engineering, and product teams on reliability initiatives
  • Support security initiatives including AWS Secrets Manager migration and compliance requirements (SOC 2, PCI, GDPR)
  • Contribute to Developer Experience metrics and platform adoption goals
  • May provide technical guidance to junior team members

What You'll Bring:

  • 5+ years in Site Reliability Engineering, DevOps, or Infrastructure Engineering with demonstrated success improving system reliability
  • Bachelor’s degree or equivalent experience
  • 3+ years hands-on experience with AWS (EKS, EC2, RDS, S3, CloudWatch, IAM) and Kubernetes including cluster management
  • Proficient programming skills (Python, Go, or Java) with infrastructure automation and Infrastructure as Code experience (Terraform, CloudFormation)
  • Production experience with observability tools (NewRelic, Datadog, Prometheus, Grafana, Splunk) and distributed systems
  • Experience with CI/CD platforms and GitOps workflows (CircleCI, Argo CD, Jenkins); on-call rotation and incident response
  • Preferred: Exposure to chaos engineering tools, API Gateway technologies (Tyk/Kong), GraphQL federation (Apollo), cost optimization initiatives, FinOps principles

Technical Skills

  • Cloud & Infrastructure: AWS (EKS, Fargate, Lambda, VPC, Route53, CloudFront), Kubernetes, Docker, Istio Service Mesh
  • CI/CD & GitOps: Argo CD, CircleCI, Jenkins, GitHub Actions
  • Observability: NewRelic - APM, distributed tracing, metrics & logging; Splunk - logging
  • IaC & Automation: Terraform, CloudFormation, Helm, Kustomize, Python/Go/Bash
  • Platform Services: Tyk Gateway, Apollo GraphQL, AWS Secrets Manager, Vault
  • Incident Management: OpsGenie, PagerDuty, ServiceNow

Professional Qualities

  • Strong communication skills with ability to explain technical concepts to diverse audiences
  • Collaborative approach working across engineering, product, and business teams
  • Self-motivated with ability to solve complex problems within established practices and policies
  • Data-driven decision making with customer-centric approach and empathy for developer experience

How We Work:

We balance creativity and innovation on a foundation of in-person collaboration. For most roles, our employees work three or more days in our offices, where they have the opportunity to collaborate in-person, adding richness to our culture and knitting us closer together.

How We Reward You:

Realtor.com is committed to investing in the health and wellbeing of our employees and their families. Our benefits programs include, but are not limited to:

  • Inclusive and Competitive medical, Rx, dental, and vision coverage
  • Family forming benefits
  • 13 Paid Holidays
  • Flexible Time Off
  • 8 hours of paid Volunteer Time off
  • Immediate eligibility into Company 401(k) plan with 3.5% company match
  • Tuition Reimbursement program for degreed and non-degreed programs
  • 1:1 personalized Financial Planning Sessions
  • Student Debt Retirement Savings Match program
  • Free snacks and refreshments in each office location

Do the best work of your life at Realtor.com®

Here, you’ll partner with a diverse team of experts as you use leading-edge tech to empower everyone to meet a crucial goal: finding their way home. And you’ll find your way home too. At Realtor.com®, you’ll bring your full self to work as you innovate with speed, serve our consumers, and champion your teammates. In return, we’ll provide you with a warm, welcoming, and inclusive culture; intellectual challenges; and the development opportunities you need to grow.

Diversity is important to us, therefore, Realtor.com® is an Equal Opportunity Employer regardless of age, color, national origin, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, marital status, status as a disabled veteran and/or veteran of the Vietnam Era or any other characteristic protected by federal, state or local law. In addition, Realtor.com® will provide reasonable accommodations for otherwise qualified disabled individuals.

Realtor.com New York, New York, USA Office

New York, NY, United States

Similar Jobs at Realtor.com

5 Hours Ago
Hybrid
Senior level
Senior level
Big Data • Real Estate • Software
Lead analytics for lead intelligence and revenue optimization: develop forecasting, modeling, and simulation to prioritize lead inventory, recommend allocation rules, build KPIs, and partner cross-functionally to implement rules and drive revenue outcomes.
Top Skills: AWSPower BIPythonRShinySnowflakeSQLStreamlitTableau
17 Hours Ago
Hybrid
Senior level
Senior level
Big Data • Real Estate • Software
Lead design and delivery of scalable, reliable backend systems; provide architecture guidance across teams; mentor engineers; collaborate with product and design; prototype new technologies; use AI/LLMs to accelerate development while validating outputs; ensure high-quality code and engineering practices.
Top Skills: Ai Coding AssistantsAWSDatabase SystemsFastapiLlmsNumpyPandasPydanticPythonReact
Yesterday
Hybrid
Mid level
Mid level
Big Data • Real Estate • Software
As a Software Test Engineer at Realtor.com, you will enhance mobile client-side quality, execute testing strategies, automate tests, and collaborate with teams to ensure seamless user interfaces.
Top Skills: Android StudioAppiumCi/CdDetoxFlipperGraphQLJavaScriptReact NativeTypescriptXcode

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account