• Share :

NVIDIA is on the journey to build the best cloud offering for AI workloads and to bring its latest GPU technology to our clients as a set of managed services under the DGX Cloud umbrella. We want to be able to innovate on behalf of our clients and provide an easy, no-hassle way of using the latest and greatest NVIDIA products through scalable managed self-service APIs. We are looking for a Cloud Platform Engineer to drive the technical design and build foundational elements of our high-performing cloud services for Artificial Intelligence and high-performance computing. This is a unique opportunity to be a founding member of a team building at the intersection of a highly scalable fault-tolerant cloud services and AI.

If you are passionate about IaC and you can argue why declarative infra is the way to go, can explain Kubernetes PDB to your family in under 5 minutes, or If you always felt that Kubernetes is great, but this not the ultimate goal and always wanted to extend it and turn into the distributed operating system for AI, you are a perfect fit to join our team!

What you'll be doing:

As a part of the service team, build and design platforms for DGX Cloud services

Figure out how to take best from HPC and Kubernetes and help us make the unified platform

Work within the team of software engineers and product people as well as engineering teams across all of NVIDIA on DGX Cloud AI Compute services

Write IaC code, work on Kubernetes, and help the team to design and implement release pipelines

Collaborate to understand how to make the best use of GitOps and Pipelines

What we need to see:

BS in Computer Science, Information Systems, Computer Engineering or equivalent experience

Solid technical foundation in distributed computing and storage, including substantial experience with all of the following: server systems, storage, I/O, networking, and system software

12+ years of platform engineering experience on large-scale production systems

Kubernetes and IaC expertise as an engineer

Ability to understand and communicate complex designs, distributed infrastructure, and requirements to peers, customers, and vendors

General shared storage knowledge such as NFS, LustreFS, GlusterFS, etc.

Familiarity with system-level architecture, such as interconnects, memory hierarchy, interrupts, and memory-mapped IO.

Ways to stand out from the crowd:

Proven experience in high performance computing, Deep Learning, and/or GPU accelerated computing domains

Large-scale distributed system, HPC, ML and Training experience with Slurm and Kubernetes

Deep knowledge of both software and hardware knowledge in HPC and ML infrastructure

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is 220,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits (***/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Read the full job description and apply online on the recuiter's web-site

Find Jobs Hiring Now Near You!

Get Jobilize Mobile App

Get Jobilize Job Search Mobile App Now

Receive real-time job alerts and never miss the right job again

Get it on Google Play Download on the App Store
Senior Performance Engineer - Deep Learning

NVIDIA

  • US - US

  • December 12, 2024


We are seeking senior engineers with a passion for performance analysis and optimization to join our team in advancing ground breaking technologies for deep learning compilers and automated kernel generation. At NVIDIA, you will collaborate across the full hardware/software stack-from GPU...


Senior Platform Software Engineer, PCIe

NVIDIA

  • US - US

  • December 15, 2024


NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning - the next era of computing - with the GPU acting as the brain of computers,...


Senior Cloud Platform Software Engineer

NVIDIA

  • US - US

  • December 19, 2024


NVIDIA is on the journey to build the best cloud offering for AI workloads and to bring its latest GPU technology to our clients as a set of managed services under the DGX Cloud umbrella. We want to be able to innovate on behalf of our clients and provide an easy, no-hassle way of using the latest...


Senior Tegra System Performance Architect

NVIDIA

  • US - US

  • November 26, 2024


We are now looking for a Senior Tegra System Performance Architect! • Do you want to be a part of the Artificial Intelligence Revolution? Would you like to work with world-class systems architects and deep learning experts to define the next generation SoCs? • NVIDIA is developing...


Senior Software Engineer - Automated Parallel Programming

NVIDIA

  • US - US

  • December 12, 2024


The PyTorch Team @ NVIDIA is hiring passionate parallel programmers. Join us to design and build the tools used by millions of AI practitioners deploying AI applications scalable to thousands of GPUs. Our team is responsible for the continual delivery of best in class experience on NVIDIA's hardware...


Senior System Level Product Engineer

Nvidia


NVIDIA's GPUs and SOCs are the world leaders in performance and efficiency, and we are continually innovating in creative and unique ways to improve our ability to deliver extraordinary solutions in a wide range of sectors. We are seeking post-silicon Senior System Level Product Engineer who is...