Offer summary
Qualifications:
BS/MS/PhD in relevant fields, 8+ years in networking fundamentals, Experience with LAN and InfiniBand setups, Knowledge of Linux system administration, Familiarity with automation tools like Ansible.
Key responsabilities:
- Build AI/HPC infrastructure for customers
- Support large-scale AI cluster reliability
- Engage in complete service lifecycle from design to refinement
- Monitor and maintain live services for performance
- Provide feedback to internal teams on improvements