Percepta AI Logo

Percepta AI

Research Engineer / Scientist – Reinforcement Learning (RL)

Reposted 9 Days Ago
Be an Early Applicant
In-Office
New York City, NY, USA
Senior level
In-Office
New York City, NY, USA
Senior level
As a Research Engineer/Scientist in Reinforcement Learning, you will develop RL methods, maintain experimental infrastructure, and collaborate to enhance AI solutions in critical sectors.
The summary above was generated by AI
Who we are

Percepta’s mission is to transform critical institutions with applied AI. We care that industries that power the world (e.g. healthcare, manufacturing, energy) benefit from frontier technology. To make that happen, we embed with industry-leading customers to drive AI transformation. We bring together:

  • Forward-deployed expertise in engineering, product, and research

  • Mosaic, our in-house toolkit for rapidly deploying agentic workflows

  • Strategic partnerships with Anthropic, McKinsey, AWS, companies within the General Catalyst portfolio, and more

Our team is a quickly growing group of Applied AI Engineers, Embedded Product Managers and Researchers motivated by diffusing the promise of AI into improvements we can feel in our day to day lives. Percepta is a direct partnership with General Catalyst, a global transformation and investment company.

About the role

As a Research Engineer/Scientist (Reinforcement Learning) at Percepta, you will work at the intersection of RL research and real-world deployment. You will advance the frontier of capabilities through research on decision-making for critical industries. You will collaborate closely with our Embedded Product Managers (EPMs) and engineers to ensure that our solutions transform how companies operate.

Role and responsibilities
  • Identifying which real-world challenges are tractable for RL-guided decision making.

  • Develop RL methods to perform complex tasks in domains like planning, decision-making, or optimization.

  • Develop and maintain the experimental infrastructure that powers our research, from simulation environments and data pipelines to training and evaluation frameworks.

  • Conduct in-the-wild evaluations at scale that drive millions of dollars in value.

  • Partner with our applied AI engineers to transition successful research ideas into robust features of our Mosaic platform.

  • Communicate research outcomes to both technical and non-technical stakeholders, making sure everyone understands the “so what” of research and how to apply it.

Indicators of a good fit
  • Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience.

  • Have a track record of effective RL work.

  • Are motivated by impact in critical industries including healthcare, supply chains, energy, and finance.

  • Understand how to perform rigorous RL experimentation.

  • Enjoy extreme ownership.

  • Believe that AI can drive transformative change in critical industries.

The following list can be a sign that you might be a good technical fit:

  • High performance, large scale distributed systems.

  • Large scale LLM training or RL training.

  • Possess strong programming skills, especially in Python.

  • Implementing LLM post-training algorithms.

  • Experience with vLLM/SGLang, Ray, Kubernetes (or AWS EKS).

  • Experience with distributed checkpointing, multi-node, multi-gpu training, custom KV-caching.

  • Experience with asynchronous training and inference, either with VeRL, ROLL, SkyRL, AReal, or with RL libraries like CleanRL.

We're working against an incredibly ambitious mission. It won't be easy but it will likely be the most fulfilling work of your career. If that excites you, let's chat, even if you don't meet all of the qualifications above.

Our Values

Dream bigger: We have the unique privilege of taking on the most ambitious problems and we should chase them with optimism, responsibility, and genuine belief that we can make it happen. We have to embrace the hard things when no one else will.
Heart in the game: What we're doing matters and we have to give a shit. Internally, that means fixing badness when you find it. Externally, it means honoring the trust our customers place in us with their most important problems. This isn’t a 9-5, nor is it a job we’re ever going to monitor your hours. We promise to put work in front of you that matters and in return, we ask you to promise to care.
Win for the customer: Everyone is an engineer and the job of an engineer is to deliver outcomes, not outputs. Everything we do—the products we build, the partnerships we launch, the strategy we set—exists to make our customers successful. Delivery is the strategy.
Make the call: Organizations are only as strong as the pace at which they make decisions. Everyone at Percepta should feel empowered to commit and shape the ambiguity in front of them. But "make the call" cuts both ways: make the decision and make the phone call. High-agency decision-making only works with high-bandwidth communication and we commit to never operate in silos.
Intensity with kindness: We believe in excellence in execution, candor in feedback, ruthlessness in prioritization, and survivalist urgency. We also believe you don't need to be an asshole to deliver on any of this. The trust built through shared kindness and vulnerability is what makes the intensity sustainable.

Similar Jobs

5 Hours Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
151K-297K Annually
Expert/Leader
151K-297K Annually
Expert/Leader
Big Data • Cloud • Software • Database
Join MongoDB's Query Execution team to improve the database's core execution engine, build high-performance query features, and mentor team members. You will drive roadmap initiatives and ensure production-ready code for complex analytical workloads.
Top Skills: C++
5 Hours Ago
Remote or Hybrid
United States
112K-189K Annually
Senior level
112K-189K Annually
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead architectural design and evolution of MetLife's enterprise identity governance platform, ensuring compliance and performance while mentoring engineering teams.
Top Skills: Azure DevopsBeanshellJavaPowershellPythonSailpoint Identityiq
5 Hours Ago
Remote or Hybrid
United States
59K-99K Annually
Junior
59K-99K Annually
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Operational Risk Consultant supports issue management for Global Customer Service, evolves risk frameworks, collaborates with stakeholders, and implements improvements.
Top Skills: ArcherArtificial IntelligenceData AnalyticsGrc ToolsOpenpages

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account