Hi,
Position: NVIDIA AI Infrastructure & Kubernetes Engineer (DGX)
Location: Remote
Duration: 12 Months (Contract)
Experience Required : 15+
Work Authorization: USC / GC Only
Certification: NVIDIA Certification (Mandatory)
Role Overview:
Experienced engineers to build and manage AI infrastructure using NVIDIA DGX systems and Kubernetes, supporting large-scale AI/ML workloads with high performance, reliability, and security.
Key Responsibilities:
โข Set up and manage NVIDIA DGX systems (BasePOD / SuperPOD)
โข Monitor system performance, upgrades, and capacity planning
โข Ensure GPU cluster health and performance
โข Build and manage Kubernetes clusters for AI workloads
โข Deploy containerized applications
โข Implement CI/CD pipelines and automation
โข Work with high-speed networking (InfiniBand)
โข Optimize GPU communication (NVLink / NVSwitch)
โข Secure Kubernetes environments (RBAC, secrets, policies)
Required Skills:
โข Strong Kubernetes experience (CKA/CKAD/CKS preferred)
โข Hands-on experience with NVIDIA DGX systems
โข Experience with GPU workloads & AI/ML infrastructure
โข Knowledge of InfiniBand or high-performance networking
โข DevOps experience (CI/CD, GitOps)
Good to Have:
โข Experience with NVIDIA BlueField / UFM tools
โข Understanding of AI model training & inference workflows
Thanks,
Shaik Jakeer
[Upgrade to PRO to see contact]
Contact details hidden
โก Apply Now โ Be First
Client's email and LinkedIn are on this page. Upgrade to see them and apply before others.
๐ Contact info is hidden on FREE plan. PRO members apply directly and get hired faster.