Bot Auto Logo

Bot Auto

Software Engineer - Machine Learning Infrastructure

Posted 11 Days Ago
Easy Apply
In-Office or Remote
Hiring Remotely in CA
Junior
Easy Apply
In-Office or Remote
Hiring Remotely in CA
Junior
The role involves designing and developing machine learning infrastructure for annotation, evaluation, and training models, focusing on scalable systems and efficient data workflows.
The summary above was generated by AI
Company Introduction

At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a start-up and the wisdom of seasoned experts, Bot Auto boasts a team that has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create miracles and propel the future of transportation. Join us and transform your dreams into reality.

We are seeking a highly skilled and motivated Software Engineer to design, develop, and scale our machine learning annotation, evaluation, and training infrastructure. This role is central to the quality and velocity of our perception and ML models — from curating and managing high-quality annotated datasets, to building robust evaluation pipelines that drive continuous model improvement. The ideal candidate combines strong systems engineering skills with a deep understanding of ML Workflows/Ops and large-scale data infrastructure.

Key Responsibilities

Machine Learning & Deep Learning Infrastructure

  • Evaluation Platform — Architect and own a scalable, end-to-end model evaluation platform for perception and prediction models central to autonomous driving. Define metrics, design for scale, and make results actionable for researchers.
  • Training Infrastructure — Partner with research scientists to optimize and scale distributed training workflows. Integrate experiment tracking and reproducibility into the model lifecycle from day one.
  • Dataset & Feature Store — Design and maintain a versioned, high-quality training data store that accelerates model development and supports rapid iteration.
  • ML Pipelines — Build automated pipelines spanning data preparation, model training, validation, and deployment — enabling fast experimentation and reproducible outcomes.
  • Annotation Platform — Contribute to tooling and infrastructure that powers high-throughput, high-accuracy data annotation at scale.
  • MLOps — Develop production ML services that treat models as products — with reliability, observability, and continuous improvement built in.

Data Infrastructure

  • Maintain and evolve a robust data storage and access layer (S3 data lake, Delta Lake) underpinning annotation, evaluation, and training workflows.
  • Build scalable, reliable data collection pipelines supporting diverse vehicle dispatch missions.
  • Develop foundational services and packages that provide clean, performant access to autonomous driving data across the stack.
Qualifications

Required:

  • Educational Background: Bachelor's or Master's in Computer Science, or equivalent practical experience.
  • Strong Programming Skills: Strong proficiency in Python; working knowledge of C++
  • ML/DL Infrastructure Experience — Demonstrated hands-on experience building or scaling at least one of the following in a production environment:
    • Evaluation platforms — automated model benchmarking, metric computation, and regression tracking across model versions.
    • Training infrastructure — distributed training pipelines, experiment tracking, and model lifecycle management (e.g. W&B, MLflow, ClearML).
    • Dataset curation & feature stores — versioned dataset management, data lineage, and tooling for high-quality training data at scale.
    • Annotation platforms — tooling or pipelines that support high-throughput, high-accuracy labeling workflows.
  • Distributed Systems — Strong experience with distributed computing and container orchestration — Kubernetes, Spark, or comparable frameworks.
  • Ability to operate independently: scope ambiguous problems, make sound architecture decisions, and drive them to completion.

Preferred:

  • C++ experience in performance-sensitive or safety-critical applications
  • Full-stack service development experience.
  • Prior work in autonomous driving or robotics.

Top Skills

C++
Kubernetes
Python
Spark

Similar Jobs

Yesterday
In-Office or Remote
USA
155K-190K Annually
Mid level
155K-190K Annually
Mid level
Robotics
As a Software Engineer in ML Infrastructure, you will develop scalable data pipelines, enhance data discovery, and collaborate with teams on data organization and ML features.
Top Skills: Data Annotation PlatformsLlmsPythonSQLVector DatabasesVlms
48 Minutes Ago
Remote or Hybrid
USA
86K-135K Annually
Senior level
86K-135K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design, coordinate, facilitate, and evaluate business continuity and crisis response exercises. Own exercise lifecycle from planning through facilitation, after-action reporting, corrective action tracking, and disaster recovery testing. Partner with cross-functional stakeholders to validate preparedness, identify gaps, and drive continuous improvement of resilience posture.
Top Skills: Business Continuity SoftwareChaos EngineeringCloud Native PlatformsDisaster Recovery TestingTableau
4 Hours Ago
Easy Apply
Remote
United States
Easy Apply
250K-300K Annually
Senior level
250K-300K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Lead the Discover engineering team to develop a recommendation engine for a new marketplace, focusing on machine learning systems and user experience.
Top Skills: Data EngineeringMachine LearningRecommendation Systems

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account