Virtu Financial Logo

Virtu Financial

Machine Learning Engineer

Posted Yesterday
Be an Early Applicant
Easy Apply
In-Office
New York, NY
200K-300K Annually
Senior level
Easy Apply
In-Office
New York, NY
200K-300K Annually
Senior level
The role involves developing an ML research platform, managing data pipelines, and building tools for simulation lifecycle and performance monitoring in a collaborative environment.
The summary above was generated by AI

Virtu’s Research Technology team is looking for an experienced Machine Learning Engineer to join a small group of technologists whose primary function is building the infrastructure that powers our quantitative researchers. This is a unique opportunity to work at the intersection of machine learning and systematic trading — building tools that directly determine how fast our researchers can move, and how effectively our GPU cluster translates into research output.

In this role, you will be responsible for the development of our ML research platform: the systems that manage data and compute, track experiments, and enable researchers to go from idea to result as efficiently as possible. You will work closely with quants and engineers alike and will play a central role in shaping how ML is done at the firm as we scale our capabilities. We mostly use Python, C++ and Java with a variety of open-source tools along with proprietary solutions.


THE ROLE

  • Design and build experiment tracking, job orchestration, and reproducibility infrastructure so researchers can iterate quickly, compare runs reliably, and recover from failures without losing work
  • Create tools for all stages of the simulation lifecycle including historical back-tests and production monitoring. Add new features to our simulators
  • Own visibility into GPU cluster utilization — track allocation, surface bottlenecks, and ensure our compute investment is being used effectively
  • Diagnose and resolve performance issues across training pipelines: data loading throughput, storage I/O, GPU utilization, and inter-node communication in distributed training runs
  • Build and maintain data pipelines that move financial data from storage into training workflows efficiently, with strong guarantees on correctness and versioning
  • Develop feature storage and retrieval patterns that support fast, reproducible access to training data at scale
  • Work directly with researchers to understand friction in their workflows, and build solutions that reduce it — from tooling improvements to infrastructure changes
  • Collaborate with existing infrastructure engineers on capacity planning, cloud/on-prem tradeoffs, and tooling decisions — this is a collaborative environment, not a siloed one
  • Stay current with developments in ML infrastructure tooling and bring relevant ideas and tools into our stack where they create genuine value

THE CANDIDATE

  • 5+ years of experience in ML engineering, research infrastructure, or HPC environments
  • Strong Python engineering skills — you write clean, maintainable, well-tested code that other engineers want to build on. Exposure to C++ in a performance-sensitive context is a plus
  • Experience building or operating distributed training infrastructure, with working knowledge of how collective communication libraries (NCCL, Horovod, or similar) behave at scale
  • Practical experience with experiment tracking systems and strong opinions about what good research infrastructure looks like
  • Comfort working across the Linux systems stack — storage, networking, job scheduling — enough to follow a problem wherever it leads
  • Excellent communication skills and the ability to work closely with researchers and engineers across disciplines
  • Intellectually curious and self-driven — you proactively identify problems worth solving, not just problems you've been asked to solve

DESIRED, BUT NOT REQUIRED

  • Experience with on-prem compute environments and job orchestration tools such as Slurm
  • Familiarity with GPU profiling tools (NSight Systems, PyTorch Profiler) and hands-on experience optimizing GPU memory or compute utilization
  • Experience with columnar data formats and high-performance data processing tools such as Parquet, Arrow, and Polars
  • Familiarity with workflow orchestration tools (Prefect, Dagster, or similar)
  • Prior experience in environments with high-stakes, time-series data at scale. Open to Quantitative Finance, Algorithmic Trading, and Other
  • Experience contributing to or extending open-source ML frameworks or infrastructure tooling

Salary Range: $200,000 - $300,000 (salary range is exclusive of bonuses, benefits or other categories of compensation)

Virtu Financial is an equal opportunity employer, committed to a diverse and inclusive workplace, welcoming you for who you are and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Top Skills

Arrow
C++
Dagster
Horovod
Java
Nccl
Nsight Systems
Parquet
Polars
Prefect
Python
Pytorch Profiler
Slurm
HQ

Virtu Financial New York, New York, USA Office

1633 Broadway, 41st Floor, , New York, NY, United States, 10019

Similar Jobs

4 Days Ago
Easy Apply
In-Office
New York, NY, USA
Easy Apply
270K-310K Annually
Senior level
270K-310K Annually
Senior level
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
Lead technical strategy for ML systems, optimize software systems, raise engineering standards, and ensure data security and privacy compliance.
Top Skills: AirflowAWSClaude CodeDbtDltKubernetes
7 Days Ago
Hybrid
New York, NY, USA
180K-210K Annually
Senior level
180K-210K Annually
Senior level
Hardware • Healthtech • Software • Analytics
The Senior Applied Machine Learning Engineer will manage AI models for fall detection, improve infrastructure, and enhance model accuracy in a collaborative environment focused on older adults' safety.
Top Skills: Aws SagemakerGoogle Vertex AiPythonTypescript
8 Days Ago
Hybrid
New York City, NY, USA
190K-255K Annually
Senior level
190K-255K Annually
Senior level
Consumer Web • eCommerce • Marketing Tech • Payments • Software • Design • SEO
As a Senior Machine Learning Engineer, you will build intelligent search experiences that leverage machine learning for domain name suggestions, focusing on retrieval and ranking systems.
Top Skills: Embedding ModelsLarge-Scale SearchLearning-To-RankMachine LearningNatural Language ProcessingRankingRecommendation InfrastructureRetrievalVector Search

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account