Haize Labs Logo

Haize Labs

Research Engineer

Reposted 17 Days Ago
Be an Early Applicant
Easy Apply
In-Office
New York, NY
150K-600K Annually
Mid level
Easy Apply
In-Office
New York, NY
150K-600K Annually
Mid level
Develop methods for optimizing AI applications, implement automated evaluation systems, and work with clients to adapt tools for various domains.
The summary above was generated by AI

Haize Labs gets LLM apps out of POCs and into production. We eliminate the risk and improve the reliability of LLM apps by haizing them -- i.e. rigorously, proactively, and continuously fuzz-testing them.

We are looking for Research Engineers to help develop our reliability platform, with a focus on:

  1. Data-efficient alignment of evaluation models
  2. Dynamic testing of AI applications
  3. Observability and anomaly detection 
  4. Discrete optimization (with applications in architecture search and automated prompting)

Our work is both intellectually stimulating and practically in high demand. Your work will result in net-new primitives, frameworks, and algorithms for developing robust LLM applications. You work will directly influence how LLM apps are tested, verified, and deployed everywhere. You will directly influence how the world responsibly uses LLMs.

Responsibilities
  • Develop optimization, synthetic data generation, and fuzzing methods for breaking LLM systems.
  • Implement complex automated evaluation models and systems.
  • Go from research idea to code within hours; and iterate quickly on experiments and data.
  • Work directly with customers to adapt our tools for different domains.
Qualifications
  • First-author publications in top-tier ML venues (NeurIPS, ICML, ICLR, and others).
  • Bias for action & experimentation over philosophizing (though sometimes good).
  • Not interested in printing papers for papers' sake.
  • Some production engineering experience (e.g. ML in an applied setting). No spaghetti research code!
  • Some familiarity with ideas from active learning, weak supervision, synthetic data, functional verification, reinforcement learning, reward modeling, automated evaluation. A subset of this is fine.
Annual Salary

$150,000 – $600,000 USD

Logistics
  • Location policy: 6 days a week, in person, in NYC.
  • US visa sponsorship: If you are exceptional, we will sponsor.
  • Compensation and Benefits: We provide generous salary, equity, and benefits
We're Not Here to Play Games.

We're not here to write GPT wrappers or get rich quick off the AI bubble. We're here to solve the hardest problem in AI: making it safe, reliable, and production-ready. 

Since our company's inception in 2024, we've amassed amazing customers like OpenAI, Anthropic, AI21, and several others. We've developed best-in-class tooling for evaluation, dynamic testing, red-teaming, observability, and continuous robustification. And we’re backed + advised by the founders of Cognition, Hugging Face, Weights and Biases, Nous, Etched, Okta, Replit and C-suite execs from Google, Stripe, Databricks, Robinhood, and more.

Our core team is exceptionally fit for this mission. We turned down Stanford PhDs, got into & rejected Y Combinator, wrote ML-guided matchmaking for 50,000+ students, built an educational nonprofit supporting 60 countries, and did some other cool things along the way. Our early hires include an MIT PhD with 21,000+ Physics/ML/Stats citations, a Datadog engineering manager who led their GenAI observability team, a Citadel quant with a huge open-source presence, and more.

We can only serve our mission with an incredibly high talent-density team. Come here to push yourself, learn fast, experience excellence, grow with each other, and pursue your life's work.

Top Skills

Automated Evaluation
Data Generation
Fuzz Testing
Machine Learning
Reinforcement Learning
HQ

Haize Labs New York, New York, USA Office

New York, New York, United States

Similar Jobs

4 Days Ago
In-Office
New York, NY, USA
200K-225K Annually
Mid level
200K-225K Annually
Mid level
Information Technology • Software • Financial Services • Big Data Analytics
Quantitative Research Engineers at Citadel's GQS develop software solutions, enhance trading systems, and collaborate with researchers to apply quantitative techniques.
Top Skills: C++Linux
4 Days Ago
In-Office
New York, NY, USA
200K-225K Annually
Mid level
200K-225K Annually
Mid level
Information Technology • Software • Financial Services • Big Data Analytics
Quantitative Research Engineers at Citadel develop software solutions using advanced quantitative techniques and technologies to optimize investment strategies, collaborating closely with researchers to enhance trading systems.
Top Skills: C++Linux
3 Days Ago
In-Office
New York, NY, USA
250K-350K Annually
Mid level
250K-350K Annually
Mid level
Information Technology • Software • Financial Services
The role involves designing, developing, testing, and deploying software solutions for automated trading systems, collaborating with quantitative researchers to prioritize projects.
Top Skills: C++Distributed ComputingMachine LearningNatural Language ProcessingNetworkingPlatform DevelopmentPythonRSystem DesignWeb Development

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account