NVIDIA Logo

NVIDIA

Senior Product Manager, Local AI and Agents for Enterprise

Reposted 17 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA
168K-328K Annually
Senior level
In-Office
Santa Clara, CA
168K-328K Annually
Senior level
The Senior Product Manager will lead AI product strategies for local deployment on NVIDIA platforms, focusing on developers and enterprise needs.
The summary above was generated by AI

We are looking for a technical and hands-on Product Manager to lead our product efforts for local AI on Linux and developers. Client AI is the technology platform on top of NVIDIA's client hardware — GeForce RTX, RTX PRO, DGX Spark, DGX Station, and N1X — that enables AI and agents, content creation, and developer workflows. This Product Manager will define how developers, researchers, and enterprise teams build, run, and deploy AI on NVIDIA client platforms running Linux, with a strong focus on enterprise.

Generative AI is moving from the cloud to the workstation and the edge. Developers want to prototype, fine-tune, and run frontier models locally. Enterprises want to deploy agents against their private data on-prem. Inference stacks like vLLM, SGLang, TensorRT-LLM, and PyTorch are becoming the default runtime for these workflows. This Product Manager will help NVIDIA win the Linux side of this shift — making our client platforms the best place to build and run modern AI.

What you'll be doing:

  • Define and lead the enterprise agent use case — understand how enterprises deploy agents on-prem, what they need from the platform, and where NVIDIA should invest.

  • Collaborate with Product Managers that are working on cloud inference backends (vLLM, SGLang, TensorRT-LLM, and PyTorch) to drive and prioritize requirement for local AI.

  • Own the product strategy and roadmap for the Linux developer experience on NVIDIA client platforms (DGX Spark, DGX Station, RTX PRO workstations, RTX Spark).

  • Research the developer and enterprise AI ecosystem: interview customers, build personas and user journeys, and map workflows across training, fine-tuning, inference, and agent deployment.

  • Work hands-on with the latest models, frameworks, and agent tooling so you can represent the developer's point of view in every decision.

  • Lead cross-functional teams — engineering, DevRel, marketing, partnerships — to ship features and grow adoption.

  • Influence NVIDIA's GPU, system, and software roadmaps based on what Linux developers and enterprise AI teams actually need.

  • Build product positioning, technical demos, and sales and partner enablement material for a developer audience.

What we need to see:

  • 8+ years of product management experience, with meaningful time on AI/ML, developer tools, or infrastructure products.

  • First-hand experience as a developer or engineer — you have shipped code in production and can debug a CUDA, PyTorch, or Docker issue alongside an engineer, not just manage around it.

  • Deep familiarity with modern AI workflows: training and fine-tuning, inference serving, agent frameworks, RAG pipelines, and evaluation.

  • Working knowledge of at least one major inference backend (vLLM, SGLang, TensorRT-LLM, or PyTorch-based serving).

  • Fluency in Linux as a development and deployment environment.

  • Strong written communication and the ability to translate technical depth for both engineers and executives.

  • Bachelor's degree in Computer Science, Electrical Engineering, or equivalent experience.

Ways to stand out from the crowd:

  • Prior role as an AI/ML engineer, inference systems engineer, or application developer building with LLM APIs and agent frameworks (LangChain, LlamaIndex, MCP).

  • Experience with model optimization — quantization, distillation, speculative decoding, KV-cache strategies.

  • Hands-on with CUDA, Triton, or low-level GPU programming.

  • Background in enterprise software, on-prem deployments, or private AI.

  • Open-source contributions to AI/ML, inference, or agent projects.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 258,750 USD for Level 4, and 208,000 USD - 327,750 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 8, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

28 Minutes Ago
Hybrid
70K-114K Annually
Mid level
70K-114K Annually
Mid level
eCommerce • Fashion • Retail • Sales • Wearables • Design
Lead store operations to drive sales, profitability, and exceptional customer service. Recruit, coach, and develop a high-performing team; manage inventory, payroll, loss prevention, visual merchandising, staffing, and scheduling; resolve customer issues; execute business plans and represent the brand in the community.
Top Skills: Inventory SystemsLabor Management SystemsMS OfficeSales Reporting Tools
30 Minutes Ago
Remote or Hybrid
New York, NY, USA
212K-244K Annually
Senior level
212K-244K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
The Anthropic Alliance Manager at PwC focuses on building partnerships, driving revenue growth, and executing marketing strategies to enhance brand visibility and client engagement. Responsibilities include relationship management, strategic planning, and team leadership to deliver on client expectations and organizational goals.
Top Skills: Microsoft Office SuiteSalesforce
31 Minutes Ago
Hybrid
2 Locations
63K-153K Annually
Junior
63K-153K Annually
Junior
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Design and build scalable data infrastructure and pipelines using Azure Data Factory and Databricks. Implement data integration, transformation, modeling, and optimization. Validate data quality and security, collaborate with teams and clients, and contribute to client projects while developing technical skills and personal brand.
Top Skills: Azure Data FactoryMicrosoft Azure DatabricksSnowflake

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account