NVIDIA Logo

NVIDIA

Senior DevOps Engineer

Reposted 5 Hours Ago
Be an Early Applicant
In-Office
2 Locations
Senior level
In-Office
2 Locations
Senior level
Join NVIDIA's AIR team as a Senior DevOps Engineer, focusing on building SaaS/IaaS platforms for AI data centers, automating workflows, and managing secure cloud operations.
The summary above was generated by AI

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation that’s motivated by extraordinary technology —and amazing people. NVIDIA is looking for a highly motivated DevOps/SRE engineer to join the NVIDIA AIR team – the Digital Twin for Data Center Simulation web application. NVIDIA Air enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. To learn more, visit Nvidia Air.

  

What you'll be doing:  

  • The person will be part of the NVIDIA AIR team that is building the SaaS/IaaS platform for digital twin of AI data centers.

  • The responsibility specifically is for DevOps, infrastructure and Site Reliability Engineering (SRE)  requirements for AIR.

  • Focus on efficiency by automating repetitive workflows.

  • Working on microservices based architecture.

  • Deploying and troubleshooting non-disruptive cloud operations with an emphasis on secure production infrastructure.

  • Continuous evaluation of existing system and driving improvements.

  • Managing deployment/upgrade for Operating Systems, Kubernetes(k8s) clusters and/or or other orchestration tools.

  • Day to day support for engineering activities with CI/CD tools like git, Jenkins.

  • Efficiently multi-tasking on the different tracks to efficiently address evolving priorities .

What we need to see:  

  • BSc in Engineering/ Relevant Certifications/ equivalent experience.

  • 5+ years of experience in complex microservices based architectures  

  • Highly skilled in Kubernetes and Docker

  • Experience in IaaS environment - deploying, configuring, and administering Linux-based bare metal servers

  • Strong networking background (VLANs, routing, VPNs)

  • Experience with relational databases(MySQL) and SQL.

  • Experienced with modern deployment architecture for non-disruptive cloud operations including blue green and canary rollouts 

  • Infrastructure as code (IaC) skills in frameworks like Ansible & Terraform 

  • Expert in AWS

  • Knows best practices and discipline of managing and monitoring a highly available and secure production infrastructure 

  

Ways to stand out from the crowd:  

  • Strong expertise in Infrastructure as a Service (IaaS)

  • Skills in Linux/Unix Administration 

  • Experience with Prometheus/Grafana.

  • Experience with APM tools like Dynatrace, Datadog, AppDynamics, New Relic, etc.

  • Implemented robust metrics collection and alerting infrastructure  

  

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you! NVIDIA is leading the way in ground-breaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

Top Skills

Ansible
AWS
Ci/Cd
Docker
Grafana
Kubernetes
Linux
MySQL
Prometheus
Terraform

Similar Jobs

9 Hours Ago
In-Office or Remote
Petah Tikva, ISR
Senior level
Senior level
Software • Analytics • Cybersecurity
The Senior DevOps Engineer will design and maintain scalable cloud infrastructure, manage CI/CD pipelines, ensure cloud security, and enhance system reliability and performance.
Top Skills: AWSBashDatadogDockerGithub ActionsGrafanaKubernetesPrometheusPythonTerraform
Yesterday
In-Office
3 Locations
Senior level
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Join NVIDIA's DevOps team to develop tools, improve CI/CD workflows, enhance automation, and support developers across the organization.
Top Skills: Cloud TechnologiesGitGitGitlabGroovyJenkinsLinuxNvidiaPython
Yesterday
In-Office
2 Locations
Senior level
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves developing CI processes, supporting automation infrastructure, collaborating with teams globally, and improving release processes.
Top Skills: DockerGitGroovyJenkinsLinuxPython

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account