Match score not available

Senior Software Engineer – AI Infrastructure and Tooling

extra holidays - fully flexible
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience., 4+ years of experience in Kubernetes-based computing platforms tooling and APIs development., Strong programming skills in Terraform, Python, and Go for cloud automation software., Expert knowledge of DevOps principles and strong AWS fundamentals..

Key responsabilities:

  • Design and implement Continuous Deployment (CD) pipelines for efficient software delivery.
  • Architect and drive advancements in large-scale cloud and on-premise computing clusters.
  • Utilize a breadth of tools and approaches to tackle a variety of system-related problems.
  • Ensure operational reliability and production excellence across all layers of the developed solutions.

NVIDIA logo
NVIDIA XLarge http://www.nvidia.com
10001 Employees
See all jobs

Job description

We are looking for a highly motivated AI infrastructure automation and tools development expert to join us. As a seasoned professional with a strong passion for designing and implementing cutting-edge infrastructure solutions, you will play a key role in architecting and driving advancements in our large-scale cloud and on-premise computing clusters. We are a small and fast moving team, and we own production excellence of everything we develop, on all layers from OS and up to the services. Please apply if you are passionate about operational reliability, building AWS infrastructure automation and deployment tools and working on new technologies and Cloud Native applications. The solutions you propose and build will directly impact the efficiency of the NVIDIA Autonomous Vehicles development team!

What you'll be doing:

  • You will be applying strong programming skills and a deep understanding of the  distributed systems design for crafting and building production-grade software.

  • Focus on designing and implementing Continuous Deployments (CD) pipelines to ensure flawless and efficient software delivery

  • Responsible for the big picture of how our systems relate to each other and utilizing a breadth of tools and approaches to tackle a broad spectrum of problems.

What we need to see:

  • BS or MS in the CS/CE/EE or equivalent experience

  • 4+ years of the k8s based computing platforms tooling/APIs development

  • At least 4 years building automation software for the cloud with Terraform, Python, Go

  • Strong AWS fundamentals: IAM, VPC, RDS, S3, CDN, EC2

  • Expert knowledge of DevOps principles, tools, and methodologies

  • Working experience with Continuous Deployments (CD) pipelines

  • Good understanding of the Traffic Engineering solutions. Load Balancing, Layer7 proxies

  • In depth understanding of all layers of the Internet protocols

  • Operational expertise with Observability, Prometheus eco system, logs ingestion at scale

  • Proficiency with Linux environment

  • Excellent written and verbal interpersonal skills

  • You'll be a fun and motivated teammate who enjoys a challenge and celebrates success

Ways to stand out from the crowd:

  • Previous experience with building sophisticated tooling and SRE automation on large GPU/CPU clusters

  • You have working experience with Agentic AI tools for the computing infrastructure management

  • Artifactory Management at scale

  • Good understanding of cloud and datacenter security concepts, AWS is preferred

  • Solid understanding of the large scale k8s observability platforms

NVIDIA is the leader in AI, machine learning and datacenter acceleration! NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can tackle, and that matter to the world. This is our life’s work, to amplify human imagination and intelligence. Make the choice, join our diverse team today!

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Social Skills

Software Engineer Related jobs