Akamai Technologies Logo

Akamai Technologies

Principal Performance Engineer Lead

Posted 4 Days Ago
In-Office or Remote
Hiring Remotely in United States
169K-305K Annually
Expert/Leader
In-Office or Remote
Hiring Remotely in United States
169K-305K Annually
Expert/Leader
The Principal Performance Engineer Lead will optimize AI model inference performance, apply techniques for model optimization, and mentor engineers.
The summary above was generated by AI

Do you want to push the boundaries of AI inference speed and accuracy at global scale?

Are you passionate about optimizing how models perform in production serving environments?

Join the Akamai Inference Cloud Team!

The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design and operate AI platforms that enable customers to run models with unmatched performance, compliance, and economics. The Model Intelligence & Lifecycle team owns the end-to-end model lifecyclefrom validation and security scanning through quantization, optimization, and monitoring. We ensure every model meets rigorous standards for quality, safety, and performance.

Partner with the best

As an ML Performance Engineer, you will optimize inference performance across the Akamai Inference Cloud. Your focus will be at the intersection of speed and accuracyapplying techniques like quantization, speculative decoding, and hardware-aware scheduling to maximize throughput and minimize latency.
You will collaborate closely with hardware performance engineers to deliver end-to-end optimization.

As an ML Performance Engineer Principal Lead, you will be responsible for:

  • Applying and evaluating quantization, distillation, and pruning techniques to optimize model performance while preserving accuracy
  • Designing hardware-aware model placement and scheduling strategies to match models with optimal compute resources
  • Implementing and tune speculative decoding, KV-cache optimization, and batching strategies to improve inference throughput and latency
  • Building benchmarking and profiling pipelines to measure model-layer performance across architectures, hardware, and serving configurations
  • Mentoring and guiding engineers on the team through code reviews, design discussions, and technical problem-solving
  • Collaborating with hardware performance engineers to identify and resolve end-to-end performance bottlenecks across the inference stack

Do what you love

To be successful in this role you will:

  • 12+ years of relevant experience with a Bachelor's or Master's degree in Computer Science, Machine Learning, or a related field
  • Possess hands-on experience optimizing LLM inference performance (quantization, speculative decoding, model compression, etc.)
  • Have a solid understanding of transformer architectures and how design choices impact latency, throughput, and accuracy
  • Possess experience with inference serving frameworks such as vLLM, TensorRT-LLM, Triton, or similar systems
  • Be proficient in Python and C++ with experience profiling and optimizing compute-intensive workloads
  • Have familiarity with hardware-aware optimization, including GPU/accelerator scheduling and memory management trade-offs

Work in a way that works for you

FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply. 
Learn what makes Akamai a great place to work

#AIC

Connect with us on social and see what life at Akamai is like!      

We power and protect life online, by solving the toughest challenges, together.

At Akamai, we're curious, innovative, collaborative and tenacious. We celebrate diversity of thought and we hold an unwavering belief that we can make a meaningful difference. Our teams use their global perspectives to put customers at the forefront of everything they do, so if you are people-centric, you'll thrive here.

Working for you

At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:

  • Your health
  • Your finances
  • Your family
  • Your time at work
  • Your time pursuing other endeavors

Our benefit plan options are designed to meet your individual needs and budget, both today and in the future.

About us

Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

Join us

Are you seeking an opportunity to make a real difference in a company with a global reach and exciting services and clients? Come join us and grow with a team of people who will energize and inspire you!
Akamai Technologies is an Affirmative Action, Equal Opportunity Employer that values the strength that diversity brings to the workplace. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of gender, gender identity, sexual orientation, race/ethnicity, protected veteran status, disability, or other protected group status.
If no date is displayed, applications are being accepted on an ongoing basis until the job is filled.

Compensation

Akamai is committed to fair and equitable compensation practices. For US based candidates only - the base salary for this position ranges from $169,300 - $304,700/year; a candidate’s salary is determined by various factors including, but not limited to, relevant work experience, skills, certifications and location. Compensation for candidates outside the US will vary. The compensation package may also include incentive compensation opportunities in the form of annual bonus or incentives, equity awards and an Employee Stock Purchase Plan (ESPP). Akamai provides industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation (in the form of PTO), sick time, family friendly benefits including parental leave and an employee assistance program including a focus on mental and financial wellness; Eligibility requirements apply.

Top Skills

Ai Platforms
Akamai Inference Cloud
C++
Hardware-Aware Scheduling
Llm Inference
Python
Quantization
Speculative Decoding
Tensorrt-Llm
Transformer Architectures
Triton
Vllm

Similar Jobs

An Hour Ago
Remote or Hybrid
106K-160K Annually
Senior level
106K-160K Annually
Senior level
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Lead and manage complex tax compliance engagements, provide client guidance, perform detailed reviews, and mentor staff while seeking new business opportunities.
Top Skills: AdobeAxcessCasewareExcelGo File RoomPowerPointRiaWord
An Hour Ago
Remote or Hybrid
United States
142K-191K Annually
Senior level
142K-191K Annually
Senior level
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The Cloud Engineering Manager will lead cloud strategy, implementation, operations, and manage cloud teams to ensure secure and scalable cloud environments.
Top Skills: Ci/Cd PipelinesInfrastructure-As-CodeAzureTerraform
3 Hours Ago
Remote or Hybrid
United States
55K-75K Annually
Entry level
55K-75K Annually
Entry level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The role involves liaising between clients and data services, overseeing data conversion projects, and managing customer expectations during data migration efforts.
Top Skills: Applied Systems ProductsData Migration

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account