DoubleVerify Logo

DoubleVerify

Sr. Site Reliability Engineer I

Posted 20 Days Ago
In-Office
New York, NY, USA
89K-178K Annually
Senior level
In-Office
New York, NY, USA
89K-178K Annually
Senior level
The role involves enhancing the reliability and performance of media measurement platforms, managing incidents, implementing observability practices, automating processes, and ensuring high availability of cloud and on-premises infrastructures.
The summary above was generated by AI

Hybrid (3 days per week in office) 

Who We Are 

DV is the leader in digital performance solutions, helping our advertiser and agency partners Verify the quality of their digital campaigns, Optimise to improve performance and Prove that they’re achieving their business outcomes, through unbiased 3rd party data and analytics. DV’s mission is to be the definitive source of transparency and data-driven insights into the quality and effectiveness of digital advertising for the world’s largest brands, agencies, publishers, and digital ad platforms.  Since 2008, DV has helped hundreds of Fortune 500 companies gain the most from their media spend by delivering best-in-class solutions across the digital advertising ecosystem, helping to build a better industry. Learn more at www.doubleverify.com.

What You’ll Do

  • Build and maintain the reliability, scalability, and performance of our digital media measurement platforms
  • Implement observability best practices, including metrics collection, dashboarding, and alerting strategies that support proactive reliability improvements
  • Reduce MTTR for critical incidents through automation, improved observability, and proactive monitoring
  • Respond to incidents and drive them to resolution, managing Sev1/Sev2 situations
  • Monitor and maintain high availability infrastructure and services across GCP, AWS, OCI, and on-premises environments
  • Lead technical projects from planning through deployment, ensuring proper stakeholder communication and team enablement.
  • Build and deploy automations to eliminate operational toil and improve efficiency across deployment workflows, validation scripts, and self-service capabilities
  • Leverage AI-assisted development tools  to accelerate automation development and problem resolution
    Build custom integrations and MCP servers for monitoring platforms to enable programmatic access and AI-driven analysis
  • Implement Infrastructure-as-Code using Terraform, Helm charts, Python and scripts, and configuration management tools to ensure repeatable, version-controlled infrastructure deployments
    Develop production automations for routine operational tasks, reducing manual intervention and accelerating task completion
  • Create and maintain documentation, runbooks, and SOPs in Confluence to ensure consistent incident response across the team
  • Participate in on-call rotations and post-incident reviews to minimize downtime and prevent recurrence

Required Experience & Skills

  • 4+ years in Site Reliability Engineering, DevOps, or related operational roles with proven experience in Linux/Unix systems administration
    proficiency in scripting and programming languages such as Python, Bash, or Go for automation and tool development
  • Strong experience with cloud infrastructure and services across GCP, AWS, and OCI, as well as container orchestration tools like Kubernetes
  • Expertise in monitoring and observability tools such as Prometheus, Grafana, Splunk, Nagios,
  • Hands-on experience with Infrastructure-as-Code tools like Terraform, Ansible, or Helm
  • Proven ability to develop and track SLIs, SLOs, and SLAs to drive reliability improvements

 Technical Knowledge

  • Deep understanding of networking, DNS, load balancing, and CDN technologies
  • Familiarity with databases (SQL, NoSQL, Vertica, MongoDB, Snowflake) and data pipeline technologies
  • Knowledge of CI/CD pipelines, GitLab, and deployment automation
  • Experience with workflow automation platforms is a strong plus

Soft Skills & Mindset

  • Exceptional communication skills with the ability to collaborate across teams and explain technical concepts clearly
  • Proactive problem-solving approach with a focus on automation and continuous improvement
  • Ownership mentality — you take full responsibility for complex challenges and reliably deliver outcomes
  • Trailblazing spirit — innovative use of AI, automation, and new technologies to solve problems and drive improvements
  • Passion for mentorship and knowledge sharing, elevating the capabilities of the entire team

Preferred Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, or related field
  • Industry certifications such as
    • AWS Certified DevOps Engineer
    • Google Professional Cloud DevOps Engineer
    • Certified Kubernetes Administrator (CKA), or Terraform/Grafana certifications
  • Experience with AI-assisted development using tools like ChatGPT, Cursor, Glean, or Copilot
  • Familiarity with security best practices in cloud and containerized environments

The successful candidate’s starting salary will be determined by a number of non-discriminatory factors, including qualifications for the role, level, skills, experience, location, and internal equity relative to peers at DV. The estimated salary range for this role, based on the qualifications set forth in the job description, is between $89,000.00 - $178,000.00. This role will also be eligible for bonus/commission (as applicable), equity, and benefits. 

The range above is for the expectations as laid out in the job description; however, we are often open to a wide variety of profiles and recognize that the person we hire may be more or less experienced than this job description as posted. 

Not-so-fun fact: Research shows that while men apply to jobs when they meet an average of 60% of job criteria, women and other marginalized groups tend to only apply when they check every box. So if you think you have what it takes but you’re not sure that you check every box, apply anyway!

 

Similar Jobs

2 Days Ago
In-Office
New York City, NY, USA
170K-220K Annually
Mid level
170K-220K Annually
Mid level
Artificial Intelligence • Legal Tech • Software
As a Site Reliability Engineer, you'll ensure platform reliability, improve production services, build observability tools, and collaborate with engineers.
Top Skills: Cloud InfrastructureKubernetesObservability Tooling
An Hour Ago
Easy Apply
Hybrid
New York City, NY, USA
Easy Apply
160K-187K Annually
Senior level
160K-187K Annually
Senior level
Fintech • Information Technology • Software • Financial Services
The Strategic Operations Manager will lead cross-functional programs, optimize operational processes, and enhance collaboration between teams while leveraging data and AI for informed decision-making.
Top Skills: Ai Capabilities
2 Hours Ago
In-Office
New York City, NY, USA
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Software
Own demand generation at Onshore by building systems, improving conversion, documenting GTM strategies, and enhancing content and paid media efforts.
Top Skills: Ai-Assisted WorkflowsModern Marketing Tools

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account