Prime Intellect Logo

Prime Intellect

Research Engineer - Distributed Training

Reposted 5 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Mid level
In-Office or Remote
2 Locations
Mid level
The Research Engineer will advance decentralized AI by optimizing distributed training, developing open-source tools, publishing research, and enhancing platform capabilities.
The summary above was generated by AI

Building Open Superintelligence Infrastructure

Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.

As a Research Engineer working on Distributed Training, you'll play a crucial role in shaping our technological direction, focusing on our decentralizing AI training stack. If you love scaling things and maximizing training efficiency, this role is for you.

Responsibilities
  • Lead and participate in novel research to build a massive scale, highly reliable and secure decentralized training orchestration solution

  • Optimize the performance, cost, and resource utilization of AI workloads by leveraging the most recent advances for compute & memory optimization techniques.

  • Contribute to the development of our open-source libraries and frameworks for distributed model training.

  • Publish research in top-tier AI conferences such as ICML & NeurIPS.

  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.

  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, decentralized training research and proactively identify opportunities to enhance our platform's capabilities and user experience.

Requirements
  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for training and deploying large-scale AI models.

  • Deep expertise in distributed training techniques, frameworks (e.g., PyTorch Distributed, DeepSpeed, MosaicML’s LLM Foundry), and tools (e.g. Ray) for optimizing the performance and scalability of AI workloads.

  • Experience in large-scale model training incl. distributed training techniques such as data, tensor & pipeline parallelism

  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines.

  • Passion for advancing the state-of-the-art in decentralized AI model training and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.

  • If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources (here, here and here) and please reach out!

Benefits & Perks
  • Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect.

  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco.

  • Visa sponsorship and relocation assistance for international candidates.

  • Quarterly team off-sites, hackathons, conferences and learning opportunities.

  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what's possible, we'd love to hear from you.

Top Skills

Ai/Ml
Ci/Cd
Deepspeed
Mosaicml
Pytorch Distributed
Ray

Similar Jobs

An Hour Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
150K-253K Annually
Senior level
150K-253K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Senior Software Engineer II, you'll develop solutions for Smart Trailers and Connected Equipment, focusing on full stack development and customer impact. Responsibilities include mentoring engineers, advocating for technical health, and driving customer-focused solutions.
Top Skills: GoGraphQLReactTypescript
An Hour Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
126K-253K Annually
Senior level
126K-253K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Senior Software Engineer, you'll develop route optimization, dispatch systems, and real-time tracking solutions, enhancing dispatcher tools and driving experiences.
Top Skills: GoGraphQLReactReact NativeTypescript
An Hour Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
85K-114K Annually
Senior level
85K-114K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The role involves managing customer deployments of IoT solutions, ensuring success through training, support, and communication, while building relationships with enterprise clients.
Top Skills: IotSaaS

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account