Cresta Logo

Cresta

Senior Machine Learning Engineer

Reposted 24 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
As a Senior Machine Learning Engineer, develop LLM evaluation frameworks and agentic AI workflows to extract insights from conversations and optimize customer experiences.
The summary above was generated by AI

Cresta unlocks the true potential of the customer experience, turning every conversation into a competitive advantage. Cresta’s unified AI platform combines conversational AI agents, real-time human agent augmentation, and comprehensive conversation intelligence to drive revenue and efficiency gains across every channel. The world’s leading companies, including United Airlines, Cox Communications, and Marriott, use Cresta to power world-class customer experiences every day. 

Born from the Stanford AI Lab, Cresta has raised more than $270 million from the world’s leading investors, including a16z, Greylock, and Sequoia. Cresta’s leadership includes some of the leading minds in AI today. Our CEO, Ping Wu, founded and led Google's Contact Center AI and Vertex AI platforms before joining Cresta to build the future of AI-driven customer experiences.

Over the next few years, AI is going to redefine how people all over the world interact with businesses every day. Come build that future at Cresta.

About the role:

Machine Learning Engineers at Cresta work across several high-impact AI initiatives. Final team placement is determined based on experience, strengths, and business needs.

Current focus areas include:

  • Agentic Assist: Lead and build next-generation agentic AI systems that augment contact center agents in real time. This track requires strong pre-LLM ML foundations, deep expertise in LLMs and modern prompting techniques, a rapid prototyping mindset, and a proven ability to translate cutting-edge research into scalable, production-grade systems.
  • Agent & System Quality: Design evaluation frameworks and improve the reliability, robustness, and performance of LLM-powered agents. This includes diagnosing and mitigating failure modes such as hallucinations, retrieval errors, tool misuse, context drift, prompt brittleness, and multi-step reasoning breakdowns, while defining measurable quality metrics (e.g., accuracy, faithfulness, task completion, latency, and cost) for complex, non-deterministic systems.
  • Insights: Architect and scale LLM and retrieval-augmented generation pipelines that ground models in enterprise data. This track focuses on building high-performance ML systems that process complex data, extract structured insights, and deliver real-time, actionable intelligence at scale.

Responsibilities:

  • Lead the design and development of Cresta’s next-generation AI Agents and Agentic Assist systems, defining system architecture and core modeling approaches.
  • Architect intelligent, multi-step agent workflows that combine real-time guidance, knowledge retrieval, reasoning, summarization, and automated actions into cohesive production systems.
  • Design, deploy, and optimize LLM-powered systems, including Retrieval-Augmented Generation (RAG) pipelines, multi-agent orchestration, and domain-adapted models.
  • Improve reasoning, planning, and tool-use capabilities in real-world AI applications.
  • Develop evaluation strategies for complex, non-deterministic systems, including offline benchmarking, online experimentation, and LLM-as-a-judge methodologies.
  • Diagnose and mitigate real-world failure modes such as hallucinations, retrieval errors, tool misuse, prompt brittleness, and multi-step reasoning breakdowns.
  • Define and measure quality metrics (e.g., accuracy, faithfulness, task completion, latency, cost, robustness) to improve system reliability and performance.
  • Optimize AI systems for scalability, latency, security, and cost efficiency in production environments.
  • Collaborate cross-functionally with product, frontend, and backend teams to integrate AI capabilities seamlessly into Cresta’s platform.
  • Mentor engineers, contribute to technical strategy, and help shape the roadmap for Cresta’s AI systems.

Qualifications We Value:

  • Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or Ph.D. preferred.
  • 5–8+ years of industry experience building and deploying machine learning systems in production, including significant experience working with LLMs.
  • Strong expertise in NLP, Generative AI, transformer architectures, embeddings, and retrieval systems.
  • Proven experience designing and deploying Retrieval-Augmented Generation (RAG) systems in enterprise environments.
  • Experience building and evaluating complex agentic or multi-step LLM workflows.
  • Strong knowledge of modern ML frameworks and tools (e.g., PyTorch, TensorFlow, Hugging Face) and distributed/cloud-based infrastructure.
  • Demonstrated ability to optimize real-time ML systems for performance, scalability, and reliability.
  • Strong technical leadership skills, with the ability to influence cross-functional decisions and raise the engineering bar.

Perks & Benefits:

We offer a comprehensive and people-first benefits package to support you at work and in life:

  • Comprehensive medical, dental, and vision coverage with plans to fit you and your family
  • Flexible PTO to take the time you need, when you need it
  • Paid parental leave for all new parents welcoming a new child
  • Retirement savings plan to help you plan for the future
  • Remote work setup budget to help you create a productive home office
  • Monthly wellness and communication stipend to keep you connected and balanced
  • In-office meal program and commuter benefits provided for onsite employees

Compensation at Cresta: 

Cresta’s approach to compensation is simple: recognize impact, reward excellence, and invest in our people. We offer competitive, location-based pay that reflects the market and what each individual brings to the table.

The posted base salary range represents what we expect to pay for this role in a given location. Final offers are shaped by factors like experience, skills, education, and geography. In addition to base pay, total compensation includes equity and a comprehensive benefits package for you and your family.

Salary Range: $205,000–$270,000 + Offers Equity

We have noticed a rise in recruiting impersonations across the industry, where scammers attempt to access candidates' personal and financial information through fake interviews and offers. All Cresta recruiting email communications will always come from the @cresta.ai domain. Any outreach claiming to be from Cresta via other sources should be ignored.  If you are uncertain whether you have been contacted by an official Cresta employee, reach out to [email protected] 

Similar Jobs

4 Hours Ago
Easy Apply
Remote
USA
Easy Apply
180K-212K Annually
Senior level
180K-212K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Design and build machine learning systems for Coinbase, responsibly use generative AI tools and copilots, apply human-in-the-loop practices, and deliver measurable efficiency, cost, and quality improvements while collaborating in a remote-first environment with periodic in-person surges.
Top Skills: GeminiGenerative AiGleanLibrechat
13 Days Ago
In-Office or Remote
New York, NY, USA
195K-343K Annually
Senior level
195K-343K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
As a Senior Machine Learning Engineer, you will lead model validation for AI systems, challenge model soundness, and build validation tools for high-stakes areas such as credit and fraud prevention.
Top Skills: AWSCiDatabricksGCPGcp Vertex AiGitJIRALightgbmLinearMlflowNumpyPandasPrefectPythonPyTorchScikit-LearnSnowflakeXgboost
19 Days Ago
Remote
United States
175K-230K Annually
Senior level
175K-230K Annually
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software • Generative AI
The Sr. Machine Learning Engineer will develop and deploy ML solutions for healthcare, manage data pipelines, and work with large datasets to enhance healthcare delivery.
Top Skills: AWSC++KubernetesPythonPyTorchScikit-LearnSparkTensorFlow

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account