Sumo Logic Logo

Sumo Logic

Staff Machine Learning Engineer - AI Tech Lead

Posted 2 Days Ago
Easy Apply
Remote
Hiring Remotely in United States
221K-260K Annually
Senior level
Easy Apply
Remote
Hiring Remotely in United States
221K-260K Annually
Senior level
As a Staff Machine Learning Engineer - AI Tech Lead, you will lead the design of AI systems for security operations, focusing on scalable architectures, AI safety, and mentoring. Responsibilities include overseeing technical evaluations, production workflows, LLM fine-tuning, and collaboration with teams to enhance AI capabilities.
The summary above was generated by AI
Staff Machine Learning Engineer – AI Tech Lead 

Location: USA

The proliferation of AI and machine log data has the potential to give organizations unprecedented real-time visibility into their infrastructure and security operations. With this opportunity comes significant technical challenges around ingesting, managing, and reasoning over massive, heterogeneous, high-velocity data streams at global scale.

As a Staff Machine Learning Engineer – AI Tech Lead, you will lead the design and delivery of the next generation of Agentic AI systems for Security Operation Center (Agentic SOC). You will evaluate, prototype, and productionize state-of-the-art agentic AI technologies and build scalable multi-agent architectures that reason over large-scale machine data to drive real-time detection, investigation, and response.

This is a highly technical leadership role with deep ownership of AI agent architecture, evaluation, LLM fine-tuning, and production AI infrastructure. You will help define the technical direction for Sumo Logic’s agentic AI platform and play a key role in bringing advanced AI capabilities to customers at global scale.

Responsibilities
  • Lead and partner with fellow leadership members and teams on technical evaluation and adoption of cutting-edge agentic AI platforms, including Anthropic (Claude), LangChain/LangGraph, AWS Bedrock, and other emerging agent frameworks.
  • Architect, prototype, and productionize multi-agent AI systems for Agentic SOC use cases, including detection, triage, investigation, and response workflows.
  • Own the design of core agent architecture components, including planning, execution, tool orchestration, memory, context engineering, and long-running agent workflows.
  • Lead AI agent evaluation systems, including offline and online evaluation pipelines, golden datasets, synthetic data generation, human- and LLM-based judging, and continuous quality monitoring.
  • Drive LLM fine-tuning and alignment efforts to improve domain-specific reasoning, accuracy, and reliability for security and observability use cases.
  • Design scalable LLMOps and AI agent infrastructure, including inference routing, latency optimization, cost control, and production observability for agent systems.
  • Partner with product, security, and data platform leadership and teams to deliver end-to-end AI agent capabilities from prototype to customer-facing production systems.
  • Lead and partner on technical direction and mentorship for AI engineers working on agentic AI and LLM systems.
  • Define and implement best practices for AI safety, reliability, evaluation, and monitoring in production agentic systems.
  • Operate as a senior technical owner in ambiguous problem spaces—setting technical direction, breaking down complex problems, and driving delivery across teams.
Required Qualifications
  • B.Tech, M.Tech, or Ph.D. in Computer Science, Machine Learning, Data Science, or a related technical field.
  • 5+ years of hands-on industry experience building, operating, and leading production ML/AI systems, with demonstrated technical leadership and ownership.
  • Strong foundation in machine learning, distributed systems, data pipelines, and large-scale system design.
  • Deep industry understanding of LLMs, prompt engineering, context engineering, agentic AI design patterns, and reasoning workflows.
  • Strong proficiency in Python and modern ML/AI ecosystems.
  • Experience designing and operating evaluation frameworks for ML/LLM systems (offline + online).
  • Proven ability to lead complex technical initiatives across teams and influence architecture decisions.
  • Excellent communication skills and ability to translate complex AI systems into business impact.
Desired Qualifications
  • Hands-on experience building and scaling agentic AI systems or multi-agent architectures in production.
  • Experience with modern agent frameworks such as LangGraph, LangChain, CrewAI, or similar.
  • Experience with major foundation model platforms such as Anthropic, OpenAI, AWS Bedrock, or Vertex AI.
  • Experience with LLM fine-tuning pipelines (SFT, RLHF/RLAIF, preference learning, domain adaptation).
  • Strong background in LLMOps, including inference optimization, latency/cost management, observability, and production monitoring.
  • Experience with ML infrastructure and tooling such as PyTorch, MLflow, Airflow, Docker, Kubernetes, and cloud platforms (AWS/GCP/Azure).
  • Experience applying AI/ML to security, observability, or large-scale log/telemetry data is a strong plus.
About Us

Sumo Logic, Inc. empowers the people who power modern, digital business. Sumo Logic enables customers to deliver reliable and secure cloud-native applications through its Sumo Logic SaaS Analytics Log Platform, which helps practitioners and developers ensure application reliability, secure and protect against modern security threats, and gain insights into their cloud infrastructures. Customers worldwide rely on Sumo Logic to get powerful real-time analytics and insights across observability and security solutions for their cloud-native applications. For more information, visit www.sumologic.com.

Sumo Logic Privacy Policy. Employees will be responsible for complying with applicable federal privacy laws and regulations, as well as organizational policies related to data protection.

The expected annual base salary range for this position is $221,000 - $260,000. Compensation varies based on a variety of factors, which include (but aren’t limited to) role level, skills and competencies, qualifications, knowledge, location, and experience. In addition to base pay, certain roles are eligible to participate in our bonus or commission plans, as well as our benefits offerings and equity awards. 

Must be authorized to work in the United States at the time of hire and for the duration of employment. At this time, we are not able to offer non-immigrant visa sponsorship for this position.

Top Skills

Airflow
Anthropic
AWS
Aws Bedrock
Azure
Crewai
Docker
GCP
Kubernetes
Langchain
Langgraph
Ml/Ai
Mlflow
Openai
Python
PyTorch
Vertex Ai

Similar Jobs

11 Days Ago
Remote
USA
163K-274K Annually
Senior level
163K-274K Annually
Senior level
Other • Real Estate • PropTech
The Senior Machine Learning Engineer will develop, deploy, and optimize machine learning applications, collaborating with cross-functional teams to enhance AI services.
Top Skills: AWSGenerative AiLarge Language ModelsMachine LearningNatural Language ProcessingPyTorchScikit-LearnTransformersXgboost
4 Days Ago
In-Office or Remote
13 Locations
140K-186K Annually
Senior level
140K-186K Annually
Senior level
Fintech • Payments
The Senior Machine Learning Engineer will design, implement, and maintain machine learning algorithms and systems. Collaborate with teams, manage version control, and integrate AI models into applications, emphasizing speed and regulatory compliance.
Top Skills: AWSAzureGCPGitNumpyPandasPythonPyTorchTensorFlowTerraform
14 Days Ago
Easy Apply
Remote
USA
Easy Apply
170K-185K Annually
Senior level
170K-185K Annually
Senior level
Digital Media • Other • Software • Analytics
As a Machine Learning Engineer, you will design, develop, and deploy machine learning systems, collaborate on AI features, and ensure reliable production performance.
Top Skills: KafkaKubernetesMlflowPythonPyTorchSnowflakeSparkSQLTensorFlow

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account