The DevOps Engineer will design, build, and optimize cloud infrastructure for machine learning operations, automate workflows, and ensure system reliability.
Dear applicant, please note that this role is for US-based candidated only.
We’re looking for a DevOps Engineer to help design, build, and optimize the cloud infrastructure powering our machine learning operations. You’ll play a key role in scaling AI models from research to production — ensuring smooth deployments, real-time monitoring, and rock-solid reliability across our Google Cloud Platform (GCP) environment.
You’ll work hand-in-hand with data scientists, ML engineers, and other DevOps experts to automate workflows, enhance performance, and keep our AI systems running seamlessly for millions of players worldwide.
What You’ll Do
- Manage, configure, and automate cloud infrastructure using tools such as Terraform and Ansible.
- Implement CI/CD pipelines for ML models and data workflows, focusing on automation, versioning, rollback, and monitoring with tools like Vertex AI, Jenkins, and DataDog.
- Build and maintain scalable data and feature pipelines for both real-time and batch processing using BigQuery, BigTable, Dataflow, Composer, Pub/Sub, and Cloud Run.
- Set up infrastructure for model monitoring and observability — detecting drift, bias, and performance issues using Vertex AI Model Monitoring and custom dashboards.
- Optimize inference performance, improving latency and cost-efficiency of AI workloads.
- Ensure overall system reliability, scalability, and performance across the ML/Data platform.
- Define and implement infrastructure best practices for deployment, monitoring, logging, and security.
- Troubleshoot complex issues affecting ML/Data pipelines and production systems.
- Ensure compliance with data governance, security, and regulatory standards, especially for real-money gaming environments.
What We’re Looking For
- 3+ years of experience as a DevOps Engineer, ideally with a focus on ML and Data infrastructure.
- Strong hands-on experience with Google Cloud Platform (GCP) — especially BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub.
- Proficiency with Terraform (and bonus points for Ansible).
- Solid grasp of containerization (Docker, Kubernetes) and orchestration platforms like GKE.
- Experience building and maintaining CI/CD pipelines, preferably with Jenkins.
- Strong understanding of monitoring and logging best practices for cloud and data systems.
- Scripting experience with Python, Groovy, or Shell.
- Familiarity with AI orchestration frameworks (LangGraph or LangChain) is a plus.
- Bonus points if you’ve worked in gaming, real-time fraud detection, or AI-driven personalization systems.
Top Skills
Ansible
BigQuery
Cloud Run
Datadog
Dataflow
Docker
Google Cloud Platform
Groovy
Jenkins
Kubernetes
Pub/Sub
Python
Shell
Terraform
Vertex Ai
Similar Jobs
Cloud • Information Technology • Security • Software • Cybersecurity
Responsible for overseeing the technical sales process and running Proof of Value. Collaborates with internal teams to solve customer pain points.
Top Skills:
Network Security Technologies
Cloud • Fintech • Information Technology • Machine Learning • Software
As a Senior Software Engineer, you will build scalable features, modernize frontend platforms, and improve collaboration with product and design teams.
Top Skills:
AWSC#/.NetKubernetesReactTypescript
Fintech • Real Estate • PropTech
The Customer Training Specialist will deliver training sessions, create content, and support customers in utilizing Agora's platform effectively.
Top Skills:
Customer TrainingEducational Content DevelopmentFintechSaaS
What you need to know about the NYC Tech Scene
As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory



