Featherless AI Logo

Featherless AI

Machine Learning Engineer — Distillation

Posted 22 Days Ago
In-Office or Remote
Hiring Remotely in World Golf Village, FL
Mid level
In-Office or Remote
Hiring Remotely in World Golf Village, FL
Mid level
Design and implement knowledge distillation pipelines, optimize training and inference performance, and collaborate with research on production-ready ML models.
The summary above was generated by AI
About the Role

We’re looking for a Machine Learning Engineer focused on model distillation to help us build smaller, faster, and more efficient models without sacrificing quality. You’ll work at the intersection of research and production—taking cutting-edge techniques and turning them into systems that scale.

This is a hands-on role with real ownership: you’ll design distillation pipelines, run large-scale experiments, and ship models used in production.

What You’ll Do
  • Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.)

  • Distill large foundation models into smaller, faster, and cheaper models for inference

  • Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs

  • Collaborate with research to translate new distillation ideas into production-ready code

  • Optimize training and inference performance (memory, throughput, latency)

  • Contribute to internal tooling, evaluation frameworks, and experiment tracking

  • (Optional) Contribute back to open-source models, tooling, or research

What We’re Looking For
  • Strong background in machine learning or deep learning

  • Hands-on experience with model distillation (LLMs or other neural networks)

  • Solid understanding of training dynamics, loss functions, and optimization

  • Experience with PyTorch (or JAX) and modern ML tooling

  • Comfort running experiments on multi-GPU or distributed setups

  • Ability to reason about model quality vs. performance tradeoffs

  • Pragmatic mindset: you care about shipping, not just papers

Nice to Have
  • Experience distilling LLMs or large sequence models

  • Experience with inference optimization (quantization, pruning, kernels, etc.)

  • Familiarity with evaluation for language models

  • Open-source contributions or research publications

  • Experience in early-stage or fast-moving startups

Why Join
  • Work on core model quality and cost efficiency—not side projects

  • High ownership and direct impact on product and roadmap

  • Small, senior team with strong research + engineering culture

  • Competitive compensation + meaningful equity

  • Remote-friendly, async-first environment

Top Skills

Deep Learning
Distributed Computing
Jax
Machine Learning
Model Distillation
Multi-Gpu
Pruning
PyTorch
Quantization

Similar Jobs

21 Days Ago
Remote
United States
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The Senior Machine Learning Engineer will build and own production ML systems, manage end-to-end workflow, debug issues, and mentor others.
Top Skills: JaxPythonPyTorch
20 Days Ago
Remote
United States
Senior level
Senior level
Big Data • Analytics • Business Intelligence • Big Data Analytics
The Machine Learning Engineer will develop software solutions, optimize data pipelines, and deploy machine learning models using cloud platforms and best practices.
Top Skills: AzureAzuremlDatabricksDockerMlflowPythonSpark
24 Days Ago
Easy Apply
Remote
United States
Easy Apply
180K-225K Annually
Mid level
180K-225K Annually
Mid level
Information Technology • Software
As a Machine Learning Engineer, you will fine-tune transformer models, deploy machine learning solutions, and work with cross-functional teams on innovative projects.
Top Skills: PythonPyTorchTensorFlow

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account