HiPeople Logo

HiPeople

Applied AI Engineer – Systems & Reliability (remote/Berlin-based)

Posted 4 Days Ago
In-Office or Remote
Hiring Remotely in World Golf Village, FL
Mid level
In-Office or Remote
Hiring Remotely in World Golf Village, FL
Mid level
Build and maintain evaluation, monitoring, and CI systems to ensure AI quality, reliability, and compliance. Track metrics, detect drift, improve prompting, model selection, and pipelines, productionize robust AI workflows, support audits (SOC 2), and act as a quality gate for AI-related releases.
The summary above was generated by AI

HiPeople is the AI Hiring Platform that takes care of screening, interviews, assessments, and references. So recruiting teams can focus on what matters most. People.

We work with some of the world's leading brands, including the NFL, Zapier, Celonis, and DAZN. and are backed by leading investors and operators such as: Moonfire founder Mattias Ljungman, Capnamic, Cherry, André Christ (LeanIX, an SAP company), Mirko Novakovic (Founder Instana/Dash0), Micha Hernandez (Fiberplane), and others.

We’re hiring an Applied AI Engineer to build the backbone of how we ensure quality, reliability, and trust in our AI systems as we scale toward $10M ARR and beyond.

You’ll work directly with founders and play a central role in making sure our AI products are robust, measurable, and enteprise-production-ready. This role is for people who care deeply about quality, enjoy working on hard system problems, and want to build AI that actually works in the real world.

We are an extremely lean team and plan to reach $10M ARR with fewer than 20 people. Every hire materially changes the company. This role has direct exposure to founders and real responsibility from day one.

What you’ll do

Own evaluation systems and quality standards

  • Build and maintain evaluation pipelines for core AI workflows across screening, interviews, assessments, and references

  • Define metrics, benchmarks, and acceptance criteria for AI outputs

  • Track performance over time (quality trends, drift, regressions) and make results visible across the team

Drive continuous improvement of AI performance

  • Identify issues across prompts, workflows, and data pipelines using both quantitative analysis and deep dives into real cases

  • Design and implement improvements across:

    • prompting strategies

    • model selection, configuration, and fine-tuning

    • input data quality and preprocessing

    • orchestration and workflow design

  • Push new systems from “working” (80%) to reliable and high-quality (95%+)

Ensure reliability, monitoring, and stability

  • Build and improve monitoring for AI systems (e.g. dashboards, alerts, tracing)

  • Detect and prevent failure modes, breakdown risks, and performance degradation

  • Monitor usage, rate limits, and capacity to ensure stable operation at scale

Drive testing, CI, and safe shipping practices

  • Integrate AI and prompt testing into CI (e.g. regression tests, golden datasets, staging environments)

  • Define standards and tooling so product and engineering teams can safely ship without introducing regressions

  • Act as a quality gate for AI-related changes

Own AI system audits and compliance support

  • Prepare and support internal and external audits (e.g. SOC 2 and beyond)

  • Provide evidence, documentation, and artifacts for AI system behavior and controls

  • Translate audit findings into concrete improvements in systems and processes

Productionize AI workflows (not just prototype them)

  • Build and productionize AI workflows that meet defined quality and reliability standards

  • Support product and engineering teams in integrating AI cleanly into product logic and user experience

  • Ensure new AI capabilities are robust, measurable, and maintainable before release

What we are looking for
  • 100% alignment with our Ops Principles (if you feel this isn’t you, do not apply)

  • Excitement for building in Go

  • Experience working with AI/ML systems, LLMs, or data-intensive applications

  • High ownership mindset and attention to detail

  • Strong interest in quality, reliability, and system performance, not just building features

  • Ability to debug complex systems across prompts, models, and data pipelines

  • Clear communication and documentation skills

  • Comfort improving systems and processes, not just using them

  • Experience with evaluation methods, metrics, or experimentation is a strong plus

  • Familiarity with monitoring, CI/CD, and production systems is a plus

Background

Strong candidates often come from:

  • AI/ML engineering or applied AI roles

  • Backend or systems engineering roles with exposure to AI/ML

  • Data science roles with strong engineering and production experience

  • Other paths that demonstrate building and improving real-world systems with rigor

Logistics

This role is remote or on-site in our Berlin office. We do not offer any Visa support for Germany at this time.

Benefits
  • Direct ownership of one of the most critical parts of the company: AI quality and reliability

  • Work closely with founders on core product and technical decisions

  • Competitive salary and meaningful stock options

  • Educational stipend to support ongoing learning and development

  • The best team to work with (true story!)

Process
  • Step 1: AI Application Screen (immediate)

  • Step 2: AI Recruiter Interview (right after successful AI Application Screen)

  • Step 3: AI Skills-Assessment (right after successful AI Recruiter Interview)

  • Step 4: Interview with Co-founder

  • Step 5: Interview with the team (incl. Live Case Study)

  • Step 6: References + Offer

  • Duration: 1 week, end-to-end

🌈 We proudly believe in the power of diversity and inclusion. Diversity of thought fuels our success which can only be achieved with a diverse team. We welcome people from any race, orientation, gender, religion, age, ethnicity, differently-abled, neurodiverse or identity, we value all uniqueness.

Similar Jobs

10 Minutes Ago
Remote or Hybrid
United States
Junior
Junior
Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
The Support Specialist will handle customer inquiries, triage issues, maintain high service levels, and collaborate with teams to enhance customer experience.
Top Skills: Zendesk
10 Minutes Ago
Remote or Hybrid
United States
Senior level
Senior level
Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
The Principal Product Manager, Growth at CertifID will focus on improving customer activation and expansion, conducting customer interviews, and running experiments based on data-driven insights to enhance product usage and satisfaction.
Top Skills: AIAmplitudeHexSQL
26 Minutes Ago
Remote or Hybrid
USA
100K-223K Annually
Senior level
100K-223K Annually
Senior level
Machine Learning • Payments • Security • Software • Financial Services
Lead and mature detection and incident response lifecycle, run day-to-day SOC operations, manage on‑call readiness, drive SIEM detections and automation, coordinate cross‑team responses, maintain playbooks and run readiness exercises, mentor analysts, and ensure regulatory and post‑incident improvements.
Top Skills: Cloud SecurityEdrElasticEndpoint SecurityFedrampHipaaIdentity And Access ManagementIds/IpsIso 27035JIRAMitre Att&CkNist 800-61Pci DssServicenowSIEMSoc 2SplunkThreat Intelligence

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account