Cohere AI Logo

Cohere AI

Research Internship (Fall, 2026)

Reposted 22 Days Ago
In-Office or Remote
Hiring Remotely in United States
Internship
In-Office or Remote
Hiring Remotely in United States
Internship
The Research Intern will conduct machine learning research, focusing on large language models, and disseminate results through publications and code while collaborating with Cohere researchers.
The summary above was generated by AI

Who are we?

Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems.

We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate about their craft.

We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us!

Why this role?

To have the opportunity to collaborate with Cohere researchers and tools on designing and implementing novel research ideas and shipping state-of-the-art models to production. We have openings in teams covering base model training, retrieval augmented generation, data and evaluation, safety, and finetuning, to name a few; and we are open to receiving intern applications in any research area relating to LLMs to broaden your research connections while obtaining deep experience in a growing AI startup. 
Please Note: To be eligible for a Research Internship, you must be currently pursuing a PhD in Machine Learning, NLP, or a related discipline. You need to be available for a full-time internship that lasts for 4-6 months.

As a Cohere Research Intern, you will:

  • Conduct cutting-edge machine learning research, building and training large language models. 

  • Focus on research projects aimed at expanding the frontier of knowledge in language modelling and associate areas such as evaluation, multimodal models, optimisation etc.

  • Disseminate your research results through the production of publications, datasets, and code.

  • Contribute to research initiatives that have practical applications in Cohere’s product development. 

You may be a good fit if you:

  • Are currently pursuing, or in the process of obtaining, a PhD in Machine Learning, NLP, Artificial Intelligence, or a related discipline. We will also consider exceptional non-PhD candidates.

  • Are eligible for work authorization in the country of employment at the time of hire and maintain ongoing work authorization throughout the internship period. 

  • Have experience using large-scale distributed training strategies, data annotation and evaluation pipelines, or implementing state of the art ML models.

  • Are familiar with autoregressive sequence models, such as Transformers.

  • Have strong communication and problem-solving skills with the ability to convey complex research findings clearly and succinctly. 

  • Have knowledge, or are knowledgeable, of programming languages such as Python, C, C++, Lua, or related languages.

  • Have knowledge of related ML frameworks such as JAX, Pytorch and Tensorflow.

  • Have previous experience in building systems based on machine learning and deep learning techniques. 

  • Demonstrate passion for applied NLP models and products.

Preferred Qualifications:

  • Demonstrated expertise through publications in top tier venues in fields such as machine learning, NLP, artificial intelligence, computer vision, optimization, computer science, statistics, applied mathematics, or data science. 

  • Proven ability to tackle analytical problems using quantitative methodologies. 

  • Proficiency in handling and analysing complex, high-dimensional data from various sources.

  • Experience in applying theoretical and empirical research to real-world problem-solving.

Full-Time Employees at Cohere enjoy these Perks:
  • A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch.

  • Full health and dental benefits, including a separate budget for mental health.

  • RRSP matching, 401K, Pension Scheme.

  • 100% Parental Leave top-up for up to 6 months, for either parent.

  • Annual enrichment benefits:

    Arts & culture, fitness/wellness, quality time, and a workspace improvement credit.

    Education & learning stipend for conferences, courses, and coaching.

  • 6 weeks of paid vacation (30 working days!)

  • Budget for traveling to other offices if you are remote, plus an annual company offsite.

How and Where We Work:
  • Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.

  • For those in the office: a daily lunch program, plenty of snacks, and regular community and social events.

  • For those not near an office: a co-working benefit so you can work alongside others in your city.

  • Everyone receives a $500 home office stipend to set up your workspace properly.

If any of the above doesn’t line up exactly with your experience, we still encourage you to apply.


We strive to create an inclusive work environment for all; we welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider.

Cohere AI New York, New York, USA Office

New York, New York, United States

Similar Jobs

2 Hours Ago
Remote or Hybrid
Junior
Junior
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
The CI Engineer will focus on loss intelligence and prevention, facilitate improvements in manufacturing through lean methods, and coach teams in performance analysis.
Top Skills: ExcelFi ToolsLean ManufacturingPdcaRcaSmed
2 Hours Ago
In-Office or Remote
2 Locations
150K-250K Annually
Mid level
150K-250K Annually
Mid level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The role involves researching and developing large language models (LLMs) with a focus on transformer architecture, data curation, distributed training, and optimization. Responsibilities include conducting experiments, collaborating with teams, and staying updated on deep learning advancements.
Top Skills: Distributed ComputingLarge Language ModelsPythonPyTorchTransformer Architectures
2 Hours Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The Account Executive will drive new customer acquisition and revenue, self-prospecting to build sales pipelines, and collaborating with marketing and sales teams. Responsibilities include understanding customer needs in voice AI, articulating Deepgram's value, and managing existing accounts for upsell opportunities.
Top Skills: AIAPIsMlSpeech-To-SpeechSpeech-To-TextText-To-Speech

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account