Team Lead, Engineering - Alerting Platform
About Datadog:
We're on a mission to build the best platform in the world for engineers to understand and scale their systems, applications, and teams. We operate at high scale—trillions of data points per day—providing always-on alerting, metrics visualization, logs, and application tracing for tens of thousands of companies. Our engineering culture values pragmatism, honesty, and simplicity to solve hard problems the right way.
The team:
As Datadog builds more and more products integrated together, they all flow eventually in the Alerting platform, they all have their own failure modes and it’s a challenge to build a reliable platform on top of all of these different pipelines. Everything has to be built with robustness in mind.
#monitor-intake is one of the 4 teams of the platform. They work on one of the most critical systems that allows the alerting platform to minimize the chances of issuing false positive notifications. In order to do so, the team has developed a system that ingests and stores observability data and makes it available for other alerting teams.
Besides, because in Datadog every product we create eventually will implement “Alerting” on top of their product, the team needs to work with many development teams to improve the platform ability to “onboard” a new product.
The opportunity:
As a team lead, the first layer of management at Datadog, you will be both a technical leader and a people manager.
You will empower engineers that work on one of the most critical systems. You'll help drive the architecture of our internal client and the back-end. You’ll join at an ideal time to make a big impact, the system is seeing very high growth, with many new features to build as well as a need for scaling up dramatically. You will be a key part of the success of the platform.
You will:
- Manage a team of 3-5 talented engineers, ensuring they deliver high quality, timely work and that they’re happy, motivated, and growing
- Write a significant amount of code, lead architectural decisions for new and existing services
- Drive the team to reach Operational Excellence with performance testing and release goals
Requirements:
- You have been building applications for 4+ years and know the systems you’ve worked on from top to bottom
- You have managed a team of software engineers
- You have architected, built, and operated distributed systems to solve problems at high scale
Bonus points:
- You have managed teams programming in Go and/or Python
- You’ve worked extensively with (or for) a major cloud provider
- You've worked at high scale with systems like Redis, Cassandra, Kafka
- You have experience working with geographically distributed engineering teams.
- You have deep experience with observability products
Why You Should Apply:
- Generous and competitive benefits
- New hire stock equity (RSUs) and employee stock purchase plan
- Continuous career development and pathing opportunities
- Product training to develop an in-depth understanding of our product and space
- Best in breed onboarding
- Internal mentor and buddy program cross-departmentally
- Friendly and inclusive workplace culture
#LI-Remote
This is a remote position
#LI-Remote This is a remote position
Equal Opportunity at Datadog:
Datadog is an Affirmative Action and Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.
Your Privacy:
Any information you submit to Datadog as part of your application will be processed in accordance with Datadog’s Applicant and Candidate Privacy Notice.