SemiAnalysis Logo

SemiAnalysis

Technical Consultant

Posted Yesterday
Be an Early Applicant
In-Office or Remote
2 Locations
Entry level
In-Office or Remote
2 Locations
Entry level
Lead ClusterMAX consulting and technical due diligence on GPU/TPU cloud infrastructure. Translate benchmarking and TCO analysis into actionable recommendations, develop benchmarking methodologies, author technical research, and collaborate with hyperscalers, OEMs, and AI ecosystem partners.
The summary above was generated by AI

Employment Type: Full-Time
Work Setting: Onsite/Remote
Work Location: New York/San Francisco/Remote
Work Hours: Office Hours
Find out more here: https://semianalysis.com

About SemiAnalysis

SemiAnalysis is an independent research and analysis firm specializing in the Semiconductor and AI industries. Our in-depth coverage spans the entire supply chain, from semiconductor fabrication processes to cutting-edge AI Models, software, and infrastructure. We are recognized as the leading authority on the semiconductor supply chain, with the highest concentration of industry experts within one team, and a deep-rooted passion for delving into the intricacies.
We’re a global team of over 50 analysts, each with extensive networks across the semiconductor supply chain and AI ecosystem, publishing industry shaping articles while participating in 40+ conferences annually.
Our newsletter reaches more than 200,000 subscribers worldwide, including senior management and c-suite leaders at the leading semiconductor and AI companies.
We also offer three core products:

  • Industry Models – we develop and publish industry models on accelerator shipments, datacentre demand and supply, GPU total cost of ownership, and more. We work with hyperscalers, neoclouds, many of the world’s largest hedge funds, and government agencies.

  • Core Research – our public equity markets product, geared towards financial investors, distils our deep technical research and knowledge into key insights on technology and product trends.

  • Consulting and Technical Due Diligence – We conduct custom research and project work to guide key strategic and investment decisions for the largest private equity funds, leading venture capital firms, companies across the AI ecosystem, and government agencies.

Position Overview
We are seeking a Technical Consultant to join our team working on ClusterMAX™, the industry standard GPU Cloud rating system. We are hiring at all experience levels with competitive compensation.

Responsibilities

  • Lead ClusterMAX™ consulting engagements, including technical due diligence projects related to neoclouds, AI accelerators, AI infrastructure, AI labs, and adjacent ecosystems.

  • Translate ClusterMAX™ benchmarking and testing insights into actionable recommendations and investment decisions for clients.

  • Contribute to the development of next-generation benchmarking methodologies, TCO analysis frameworks, and future ClusterMAX™ research initiatives.

  • Collaborate with executives, engineers, and technical teams across major neocloud providers, including Amazon Web Services, Microsoft Azure, Google Cloud, Oracle, CoreWeave, Nebius, Crusoe, Lambda, and Together.

  • Build and maintain relationships with AI accelerator manufacturers, OEMs, and ecosystem partners, including NVIDIA, AMD, Intel, Google, Amazon, Cerebras, Groq, Dell Technologies, Hewlett Packard Enterprise, Lenovo, and Cisco.

  • Strengthen relationships with AI labs, investors, startups, and technical communities to better understand industry requirements and operational challenges.

  • Author detailed technical research reports evaluating architecture design, benchmark performance, reliability, scalability, and operational usability of neocloud and AI infrastructure providers.

  • Stay informed on emerging trends and technologies through participation in major industry conferences such as NeurIPS, MLSys, NVIDIA GTC, OCP, SC, and Hot Chips.

Requirements

  • Strong understanding of ML frameworks such as PyTorch and JAX.

  • Familiarity with GPU and TPU cluster environments running orchestration platforms such as Kubernetes or Slurm.

  • Understanding of distributed storage technologies including Weka, VAST, Lustre, and S3-based storage systems.

  • Knowledge of high-performance networking technologies such as InfiniBand and RoCEv2.

  • Understanding of ML systems benchmarking and performance testing tools, including GEMMs, nccl-tests, vLLM, sglang, fio, TorchTitan, Megatron, and related frameworks.

  • Experience working at a hyperscaler, neocloud provider, server OEM, AI accelerator company, or large-scale AI infrastructure environment is preferred.

  • Ability to work proactively and independently within a globally distributed team environment.

  • Strong analytical, technical communication, and problem-solving capabilities.

Growth Areas

  • Develop deep expertise in AI infrastructure, neocloud ecosystems, accelerators, and large-scale ML system benchmarking.

  • Gain exposure to technical due diligence and investment decision-making processes across frontier AI and semiconductor markets.

  • Build relationships with leading hyperscalers, AI labs, infrastructure startups, accelerator vendors, and institutional investors.

  • Contribute to industry-recognized benchmarking methodologies and technical research publications.

  • Expand technical knowledge across distributed systems, networking, storage, AI infrastructure, and performance optimization.

  • Work directly with globally recognized companies shaping the future of AI compute and infrastructure.

  • Increase visibility within the AI and semiconductor ecosystem through conferences, technical collaborations, and published research.

Similar Jobs

14 Days Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves consulting with customers on ServiceNow's Risk & Resilience products, configuring solutions, and facilitating workshops while ensuring successful project delivery and integration components. Requires strong technical expertise and customer engagement.
Top Skills: BootstrapCSSHTMLIntegration HubJavaScriptRestRisk & Resilience ProductsServicenowSoapWeb ServicesXML
Yesterday
In-Office or Remote
26-30 Hourly
Entry level
26-30 Hourly
Entry level
Business Intelligence • Consulting
Support HRIS implementations by configuring cloud-based HR systems, performing system setup, data validation, testing, and documentation. Participate in projects from kickoff through go-live under senior guidance, manage client and vendor relationships, and complete vendor training and certifications to ensure timely, accurate implementations.
Top Skills: Cloud-Based Business ApplicationsHrisExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft Word
4 Days Ago
In-Office or Remote
2 Locations
100K-160K Annually
Senior level
100K-160K Annually
Senior level
Software • Semiconductor • Manufacturing
Senior mainframe infrastructure consultant providing architecture, implementation, upgrades, troubleshooting, demos/POCs, and technical leadership for Broadcom mainframe products. Requires coding/scripting (REXX/COBOL/HLASM), deep z/OS expertise, customer engagement, and travel to client sites.
Top Skills: Acf2BmcBroadcom Mainframe ProductsCicsCobolCommon ServicesDb2HlasmIbm Mainframe ProductsImsJesMainframe Application TunerMq SeriesNetmasterOps/MvsRacfRexxRocketSysviewTssVsamZ/OsZ/VmZ/Vse

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account