Dynatrace Logo

Dynatrace

Senior/Principal Engineer (m/f/x) for Evaluations Generative AI

Posted 16 Hours Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Boston, MA
74K-112K Annually
Senior level
Remote or Hybrid
Hiring Remotely in Boston, MA
74K-112K Annually
Senior level
Design and build evaluation and simulation systems for generative AI agents using Dynatrace observability data. Create large-scale simulation pipelines, define metrics/datasets/judging strategies, build developer CLIs, generate adversarial scenarios, and measure agent tool-use and failure modes. Prototype, run feedback cycles, set technical strategy, and mentor engineers.
The summary above was generated by AI
Your role at Dynatrace

Most AI developer tools operate without any knowledge of how software actually behaves in production. Dynatrace is in a unique position to change that.

We're looking for a Senior or Principal Generative AI Engineer to design and build the evaluation and simulation capabilities at the core of our product. You'll work across the stack, from CLI tooling that engineers run locally, to large-scale simulation pipelines, to LLM-as-a-judge evaluation frameworks running against real Dynatrace AI Observability data.

This role sits inside Dynatrace's Engineering organization and works closely with product, design, and the platform teams that power Dynatrace's AI-observability stack.

Your responsibilities:

  • Conduct research in the field of Generative AI
  • Design and build systems that let users replay and stress-test AI Agents at scale. Detect regressions across model versions, prompt changes, and data drift. Define the metrics, datasets, and judging strategies that make results trustworthy.
  • Build infrastructure to simulate multi-turn, tool-using agents in realistic environments. Generate adversarial scenarios, measure task completion, tool-use correctness, and failure modes. Help teams ship agents with confidence.
  • Own developer-facing CLIs that run evaluations on top of Dynatrace AI Observability data, from trace ingestion to judge configuration to reporting. Make it the tool AI engineers reach for first when debugging a production behavior.
  • Prototype quickly, run user feedback cycles, and ship to production
  • Define technical strategy for the team's AI systems, set architectural direction, and mentor other engineers
  • Collaborate with product and design to identify which developer problems are most worth solving
What will help you succeed
  • 5+ years (Senior) or 10+ years (Principal) of professional software engineering experience
  • Demonstrated experience shipping production systems that use LLMs, including prompting, tool calling, evaluation, and iteration
  • Strong foundation in at least one of: developer tooling (IDEs, compilers, static analysis, code intelligence), AI/ML engineering, or large-scale distributed systems
  • Hands-on experience with agentic patterns: planning, tool use, retrieval, memory management
  • Ability to evaluate and critique AI-generated output. You understand when a model is wrong, not just that it is.
  • Clear communication with cross-functional partners across product and engineering
  • Background in observability, APM, or infrastructure monitoring
  • Familiarity with engineering platforms at scale: CI/CD systems, developer portals, internal tooling
  • Hands-on experience with LLMs: prompt engineering, evaluation frameworks (e.g. LLM-as-a-judge, golden datasets, pairwise comparisons), or agent frameworks.
Why you will love being a Dynatracer
  • Dynatrace is a leader in unified observability and security.
  • We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance.
  • Our employees work with the largest cloud providers, including AWS, Microsoft, and Google Cloud, and other leading partners worldwide to create strategic alliances.
  • You'll get to work at the forefront of innovation with Dynatrace Intelligence—the industry's first agentic operations system. Bringing together deterministic and agentic AI, it helps teams understand what's happening, why it matters, and what to do next— automatically.
  • Over 50% of the Fortune 100 companies are current customers of Dynatrace.
Compensation and Rewards
  • We offer attractive compensation packages and stock purchase options with numerous benefits and advantages.
  • Due to legal reasons, we are obliged to list a salary range for this position, which is €74,000 up to €112,000 gross per year based on full-time employment (38.5 h/week). We’ve listed the salary range for transparency, but if your experience and skills bring unique value, we’d still love to hear from you—please apply even if you’re outside the range.
Equal Employment Opportunity

Dynatrace provides equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or any other protected characteristic. We actively foster an inclusive workplace that celebrates differences and promotes accessibility, collaboration, and growth for all.

Similar Jobs at Dynatrace

4 Hours Ago
Remote or Hybrid
United States
146K-220K Annually
Senior level
146K-220K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Design, build, and ship production-grade agentic AI systems that connect code context with runtime observability. Implement end-to-end LLM systems (prompting, tool calling, retrieval, memory, agents), define evaluation metrics and datasets, integrate with development workflows and Dynatrace, set technical strategy, mentor engineers, and own monitored production systems.
Top Skills: Agentic Ai/AgentsAWSDynatraceGoGCPIdesLarge Language Models (Llms)Memory ManagementAzurePromptingPythonRetrieval (Rag)Static AnalysisTool Calling
16 Hours Ago
Remote or Hybrid
61K-92K Annually
Senior level
61K-92K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Lead development of OpenTelemetry ingest for GenAI workloads, validate integrations across AI SDKs and frameworks, align auto-instrumentation with OneAgent, drive OpenTelemetry GenAI semantic conventions, prototype and ship production features, set technical strategy, and mentor engineers.
Top Skills: Agent FrameworksAgentic PatternsAWSAzureDynatrace OneagentGCPLlmsLogs)MetricsOpentelemetryOpentelemetry Genai SigTelemetry (Traces
Yesterday
Remote or Hybrid
74K-74K Annually
Senior level
74K-74K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Build high-performance UI components using React, contribute to product design, collaborate in a team environment, and lead technical initiatives.
Top Skills: JavaScriptReactTypescript

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account