Midpage Logo

Midpage

RL Deep Learning Engineer

Posted 9 Hours Ago
Be an Early Applicant
In-Office
New York City, NY, USA
210K-250K Hourly
Senior level
In-Office
New York City, NY, USA
210K-250K Hourly
Senior level
Build and scale RL environments, evaluation harnesses, and data pipelines that convert millions of court filings into contamination-free benchmarks and training tasks. Integrate with partner harnesses and model APIs, collaborate with attorneys to translate legal workflows into scorable tasks, and run large-scale, sandboxed evaluations.
The summary above was generated by AI

About Midpage

Midpage is the search engine for legal data used by AI labs. We cover all US court data - 20M records. Over 300 law firms use our platform directly, 200k+ visitors read cases on our site every month, and five multibillion-dollar companies including Perplexity trust us as their legal data supplier. We're a team of 7 in Bowery, lower Manhattan. Our ARR has grown from $400k to $2M in the last 4 months.

The role

We're seeking an engineering generalist to build the first RL environments and benchmarks purpose-built for long-horizon legal reasoning—tasks where AI agents must search, read, analyze, and draft across real case filings, the same work that still takes teams of lawyers days to weeks. Frontier labs are will use these environments to make future models more legally capable and we need an engineer to own the infrastructure that makes it all work.

You'll design and scale the systems that turn millions of real court filings into verifiable evaluation environments and RL training tasks. You'll work directly with our attorneys, our data pipeline, and our partners at frontier AI labs.

What you'll do

- Build and maintain the evaluation harness and RL environment infrastructure—task runners, sandboxed environments, and scoring logic that can scale to thousands of parallel agents

- Own the data pipeline that turns freshly collected court filings into benchmark and RL tasks before they reach any model's training set

- Integrate with partner harnesses and model APIs to run contamination-free evaluations

- Collaborate with attorneys to translate legal workflows like cite checks, motion drafting, and precedent research into structured, scorable task formats using the Harbor spec

What we're looking for

- Strong generalist software engineering fundamentals. You've built, scaled, and maintained diverse systems in production

- You’ve built entire systems yourself, don’t require detailed specs or product managers, and take full ownership over your projects

- Deep experience with Python, bonus for TypeScript. Most importantly, you can work on hard engineering problems

- You should be kind, self-managing, and a clear communicator

- You make effect use of Cursor/Claude Code/Codex and are capable of writing good code without them

Bonuses but not requirements

- Familiarity with LLM evaluation. You get what makes a good rubric and why benchmarks leak

- Comfort working with messy, real-world document data (legal filings, PDFs, long-form text)

Similar Jobs

9 Hours Ago
In-Office or Remote
United States
125K-180K Annually
Mid level
125K-180K Annually
Mid level
Artificial Intelligence • Legal Tech • Software • Generative AI
Build and scale RL environments and evaluation harnesses for long-horizon legal reasoning. Own pipelines converting court filings into contamination-free benchmarks and RL tasks, integrate with partner model APIs, and collaborate with attorneys to create scorable task formats.
Top Skills: Claude CodeCodexCursorHarbor SpecLong-Form Text ProcessingModel ApisPdfsPythonReinforcement Learning EnvironmentsTypescript
An Hour Ago
Remote or Hybrid
2 Locations
105K-163K Annually
Senior level
105K-163K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Manage and grow strategic partnerships with Presidio and Trace3 by developing and executing joint GTM plans, coordinating cross-functional enablement and marketing, leveraging investments to maximize ROI, aligning with sales leadership, and using data-driven insights to drive partner-sourced revenue and brand elevation.
An Hour Ago
Remote or Hybrid
USA
123K-228K Annually
Senior level
123K-228K Annually
Senior level
Machine Learning • Payments • Security • Software • Financial Services
Lead and manage engineering teams building scalable, low-latency fraud detection systems. Drive system design, performance optimization, streaming/event-driven data platforms, Agile delivery, regulatory compliance, and talent development while partnering with product and risk stakeholders to improve automation and platform reliability.
Top Skills: Data Management Platform (Dmp)Distributed SystemsEvent-Driven ArchitectureHigh-Throughput SystemsLow-Latency SystemsRule EnginesStreaming

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account