Epicareer Might not Working Properly
Learn More
P

HPC Administrator with Kubernetes-58907

  • Full Time, onsite
  • PRIMUS Global Services Inc.
  • Remote Hybrid, United States of America
Salary undisclosed

Apply on

Availability Status

This job is expected to be in high demand and may close soon. We’ll remove this job ad once it's closed.


Original
Simplified

HPC Administrator with Kubernetes-58907

We have an immediate long-term opportunity with one of our largest clients for a position of HPC Administrator to work in Remote Basis

Requirements

  • Manage and maintain HPC compute environments, including Infiniband network configurations.
  • Install and configure Nvidia GPU drivers and software stacks, including PyTorch, TensorFlow, and CUDA.
  • Troubleshoot and resolve issues related to PBS schedulers.
  • Administer and optimize Lustre file systems to ensure high performance and reliability.
  • Work with Docker and Kubernetes for container management and orchestration.
  • Handle Linux distributions such as Red Hat, CentOS, and Rocky Linux.
  • Write and maintain Python and Bash scripts for automation and system management.
  • Provide expertise in storage solutions and manage storage configurations.

Qualifications:

  • 5-7 years of experience in HPC administration and management.
  • Strong expertise with HPC compute environments and Infiniband networking.
  • Proven experience with Nvidia GPUs, including driver installation and software stack management (PyTorch, TensorFlow, CUDA).
  • In-depth knowledge of PBS scheduler troubleshooting.
  • Experience with Lustre file systems and their administration.
  • Proficiency in Docker and Kubernetes.
  • Solid experience with Linux distributions, including Red Hat, CentOS, and Rocky Linux.
  • Strong scripting skills in Python and Bash.
  • Comprehensive understanding of storage technologies and solutions.

For more information please connect with

Narasimha
PRIMUS Global Services
Direct No:
Phone No: Ext: 267
Email:

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Report this job