Software Development Engineer IV (Staff)
Overview
Who We Are
The Direct to Consumer Group (DTC) within Discovery is responsible for building the global streaming video platform that supports a broad collection of Discovery's diverse brands around the world including Discovery, TLC, Food Network, Investigation Discovery, Animal Planet, Science Channel, HGTV, Eurosport, MotorTrend, and many more.
DTC's many software engineering teams build applications for the web, mobile, tablets, connected TVs, consoles, and other streaming devices. Within DTC the Observability teams are responsible for the ingestion, processing, retention, metering, analysis, and visualization of operational data such as logs, metrics, and distributed traces. These technologists provide solutions to DTC's broader engineering community which helps them more effectively operate web, mobile, tablet, connected TV, console, and streaming devices and services.
We are a fast-growing, global engineering group crucial to Discovery's future.
What you'll do
We're hiring talented Staff Engineers that are passionate about using software-based approaches to solve complex data-driven challenges and automate those solutions. Within our organization, you'll lead efforts aimed at scaling our existing operational data offerings and establish the technical strategy for how we can better equip engineers and leaders with operational intelligence. You'll build a deep understanding of the Discovery+ digital streaming service and use that knowledge, coupled with your engineering, infrastructure, data and cloud knowledge, to optimize and evolve how we understand our technical ecosystem.
To be successful, you'll need to be deeply technical and capable of holding your own with other strong peers. You possess excellent collaboration and diplomacy skills. You have experience practicing infrastructure-as-code, standing up logging and metrics data pipelines, site reliability engineering, DevOps, and data lake management. In addition, you'll have strong systems knowledge and troubleshooting abilities.
An ideal candidate will be a creative thinker and passionate about technology as well as drive innovation and produce delightful experiences for customers. This person will demonstrate data-driven open minded decision making, have an insatiable curiosity, love to invent and innovate to solve difficult challenges in a fast-paced global environment.
If you love solving problems at scale, prefer to build scalable, reliable, and testable software to automate infrastructure management, are an ace troubleshooter, and are deeply technical, then this is the role for you!
Responsibilities
• Plan, lead, and execute complicated technical projects that interact with a wide variety of teams within the company.
• Work with internal customers and stakeholders to drive the design, development and support of our Observability platform.
• Work on providing a highly automated infrastructure for ingesting, processing, storing, retaining, analyzing and visualizing petabytes of operational data.
• Develop software and tooling to facilitate greater automation and operability of services.
• Make high impact decisions driving how and what software gets built. Your decisions are often right, and you are persuasive in delivering your suggestions and ideas to your team.
• Mentor senior engineers, overseeing their designs, code quality, and integration into a team. Your success is judged as much on your own productivity as on the positive impact you have on engineers around you.
• Provide guidance on design, coding, and operational best practices, and have a track-record of applying these best practices to software that you have worked on. You can propose and create best practices proactively where none exist.
• Utilize your deep experience and problem solving skills to help prevent and investigate production issues as well as participate in a shared on-call rotation.
Qualifications
• At least 9 years of overall experience in software, systems, and infrastructure
• At least 4 years of experience managing public cloud infrastructure, such as AWS, GCP, or Azure, including design, implementation, and maintenance of large scale environments.
• Strong software development skills in languages such as Go/Java/Python. Must have CS fundamentals and a track record of implementing highly reliable software.
• Strong knowledge and implementation history of Terraform, Ansible, Salt, Pulumi, CloudFormation and/or other similar tooling.
• Deep understanding of distributed systems, microservice architecture, logging/metric/tracing platforms (ELK, Prometheus, Zipkin, etc), and security.
• Experience with container orchestration technologies such as Kubernetes, OpenShift, DC/OS (Mesos / Marathon), Titus, AWS EKS, or Google GKE is strongly desired.
• Able to calmly and efficiently debug, troubleshoot, and resolve complex technical issues.
• Solid interpersonal skills conducive to a team environment.
• Self-driven & motivated, with a strong sense of ownership, work ethic, and a passion for problem solving.
• Exceptional written and verbal communication skills. Can effectively communicate vision and plan with the audience.
• Experience in working across product, engineering, and analytics teams to evaluate new ideas, discuss technical concepts, create scalable designs, implement new models, and make tradeoffs to remove roadblocks.
• BS/CS, MS/CS or equivalent.
• Must have the legal right to work in the United States