MLabs Logo

MLabs

Reinforcement Learning Engineer

Posted Yesterday
Be an Early Applicant
In-Office
New York, NY, USA
Senior level
In-Office
New York, NY, USA
Senior level
Own design, deployment, and iteration of an RL-driven trading agent using real capital. Design reward functions with strict downside risk constraints, build evaluation and simulation frameworks, transition heuristic production systems to learning-based approaches, and serve as sole RL technical lead responsible for end-to-end data modeling, deployment, monitoring, and safety.
The summary above was generated by AI
Reinforcement Learning (RL) Engineer

Location: New York (Office)

On-site | Full-time

Compensation: Competitive

Our client is an elite development firm and a high-growth software company responsible for building the infrastructure behind the world’s largest crypto social networks and digital asset launchpads. Operating at the frontier of decentralized finance, the organization is composed of a mission-driven group of builders who prioritize speed, technical excellence, and talent density.

The organization is seeking a Reinforcement Learning (RL) Engineer to take end-to-end ownership of an RL-driven trading agent. This individual will manage real capital to increase trading volume and user participation within a high-velocity memecoin ecosystem. This is a high-stakes role designed for a "single-owner" expert who can bridge the gap between sophisticated modeling and live financial production. The successful candidate will transition existing heuristic-based systems toward learning-based approaches while enforcing rigorous risk parameters in a 24/7 global market.

Key Responsibilities

  • Autonomous Agent Development: Own the design, shipment, and iteration of an RL-driven trading agent that utilizes real capital to drive ecosystem engagement.
  • Objective Function Design: Design reward functions and policies that align strictly with product goals while implementing and enforcing absolute downside risk constraints.
  • Validation Frameworks: Build robust evaluation and validation frameworks, including simulation and offline analysis, to minimize reliance on live sequential testing.
  • System Transition: Manage the safe transition of existing heuristic-based production systems toward advanced learning-based approaches.
  • Technical Leadership: Serve as the sole RL expert within a small, high-caliber team, maintaining responsibility for the entire lifecycle—from data modeling and deployment to monitoring and safety safeguards.

Interview Process

  1. Recruiter / HR Call: Initial screening to discuss professional background, risk management philosophy, and cultural alignment.
  2. Technical Interview: A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
  3. Final Interview: A strategic discussion with leadership focusing on mission alignment, role expectations, and long-term objectives.

Requirements
  • Production Experience: Proven track record of deploying autonomous learning systems into production environments that directly controlled capital, pricing, traffic, or resources. Candidates must be able to demonstrate a deep understanding of system failures and subsequent remediation.
  • Risk Management: Hands-on experience designing and enforcing hard risk limits, such as capital caps, loss bounds, and circuit breakers, within a live financial or resource-based system.
  • Evaluation Loop Mastery: Experience building policy evaluation loops from scratch, including simulators, replay, counterfactuals, and shadow deployments, prior to live rollout.
  • Empirical Judgment: Ability to make and defend pragmatic technical tradeoffs (e.g., opting for heuristics over RL or bandits over deep RL) based on empirical results rather than theoretical preference.
  • Operational Independence: Demonstrated experience as the primary owner of a complex ML system within a lean environment, operating without the support of dedicated research organizations or external ML platforms.
  • Work Style: Comfort with an intense, fast-paced environment where expectations are high and impact is immediate. Our client operates primarily in-person.

Benefits
  • High-Stakes Autonomy: Unmatched ownership over an RL agent managing real-world capital and massive user traffic.
  • Scale Exposure: Direct involvement with systems operating at the absolute edge of crypto and financial technology scale.
  • Elite Talent Density: Opportunity to collaborate with a mission-driven group of engineers who value first-principles thinking.
  • Immediate Impact: The ability to ship fast and see real-world results and market reactions instantly.
  • Compensation: A competitive package including Base Salary plus Equity/Tokens.

Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search.

Commitment to Equality and Accessibility:

At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing [email protected].

MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd’s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting [email protected].

Similar Jobs

24 Days Ago
In-Office
New York, NY, USA
400K-500K Annually
Expert/Leader
400K-500K Annually
Expert/Leader
Blockchain • Information Technology • Software • Cryptocurrency
As a Reinforcement Learning Engineer, you will own production trading systems, design reward functions, validate frameworks, and lead RL efforts to drive trading volume safely with real capital.
Top Skills: Data AnalysisMachine LearningReinforcement Learning
Yesterday
In-Office
New York, NY, USA
300K-405K Annually
Mid level
300K-405K Annually
Mid level
Artificial Intelligence • Natural Language Processing • Generative AI
As a Full-Stack Software Engineer in Reinforcement Learning, you'll develop platforms and tools for environment creation, manage data collection, and build evaluation interfaces, requiring strong software engineering skills and proficiency in web technologies.
Top Skills: Async PythonAWSCi/CdDockerGCPPythonReactTypescript
24 Days Ago
In-Office
New York, NY, USA
300K-405K Annually
Mid level
300K-405K Annually
Mid level
Artificial Intelligence • Natural Language Processing • Generative AI
As a Research Engineer for the Cybersecurity RL team, you'll develop AI models for secure coding and vulnerability remediation, blending research and engineering efforts.
Top Skills: CybersecurityLarge Language Models (Llms)Machine LearningReinforcement LearningSoftware Engineering

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account