SonicJobs Logo
Login
Left arrow iconBack to search

Senior HPC Infrastructure Engineer

Hays Specialist Recruitment Limited
Posted 20 days ago, valid for 3 days
Location

Winchester, Hampshire SO23 9PA, England

Contract type

Full Time

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.

SonicJobs' Terms & Conditions and Privacy Policy also apply.

Sonic Summary

info
  • A pioneering company in cloud infrastructure is seeking a hands-on HPC Cluster Architect for a fully remote role.
  • Candidates should have proven experience with Slurm, deep knowledge of Infiniband and RoCE, and proficiency in Ansible.
  • The position requires a minimum of 5 years of relevant experience in deploying and scaling HPC clusters.
  • The salary for this role is competitive, with additional benefits including share options and an unlimited holiday policy.
  • This opportunity offers the chance to work in a collaborative environment while shaping cutting-edge HPC solutions.

Your new companyI've partnered exclusively with a pioneering company that's shaping the future of cloud infrastructure. Their innovative, high-performance, GPU-optimised platform is driving advancements in AI and HPC, while also championing sustainability for a greener, more efficient world.This role is fully remote, with no expectation to ever be in an office. You'll also enjoy the fantastic perk of unlimited holiday, giving you the freedom to recharge and thrive.

Your new roleThis is a hands-on, fully remote role focused on designing and delivering high-performance computing (HPC) clusters. You'll lead end-to-end architecture and deployment projects, working closely with internal teams and external suppliers to build scalable, GPU-optimised environments. From planning hardware and data centre requirements to configuring networks, storage, and compute management software, you'll be at the heart of technical delivery. The role also involves supporting service teams with escalations, collaborating with software engineers to enhance platform capabilities, and staying up to date with the latest in HPC hardware. It's a great opportunity for someone who thrives in project-led infrastructure work and wants to help shape cutting-edge HPC solutions. What you'll need to succeed

  • Slurm: Proven experience managing and tuning HPC job schedulers.
  • Infiniband and RoCE: Deep knowledge of high-speed networking technologies.
  • Ansible: Proficiency in using Ansible for automation and configuration management.
  • Networking: Strong networking fundamentals, ideally with experience in complex environments.
  • Data Centre Infrastructure: Familiarity with planning and supporting power, cooling, and rack layouts.
  • Cluster Deployment: End-to-end experience deploying and scaling HPC clusters.
  • Server Architecture: Understanding of GPU-optimised server hardware and operating systems.
  • Scripting & Automation: Comfortable scripting in Bash, Python, or similar for deployment and maintenance tasks.

What you'll get in return

  • Share options.
  • Unlimited holiday policy.
  • 100% Remote working.
  • Fantastic opportunities to develop - they make a habit of promoting in-house.
  • A great team with a passion for working collaboratively.
  • Enhanced family-friendly policies.
  • A truly flexible workplace!

What you need to do nowIf you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.

Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found at hays.co.uk

Apply now in a few quick clicks

In order to submit this application, a Reed account will be created for you. As such, in addition to applying for this job, you will be signed up to all Reed’s services as part of the process. By submitting this application, you agree to Reed’s Terms and Conditions and acknowledge that your personal data will be transferred to Reed and processed by them in accordance with their Privacy Policy.

SonicJobs' Terms & Conditions and Privacy Policy also apply.