Software Infrastructure Kubernetes Engineer
Graphcore is seeking a Software Infrastructure Kubernetes Engineer to join their dynamic Software Infrastructure team. This role involves scaling and managing infrastructure to support the development and deployment of Graphcore's Machine Learning Software components. The successful candidate will work with High-Performance Computing (HPC) AI platforms, gaining invaluable experience in distributed systems.
Key responsibilities include developing, owning, and maintaining tools and services that support the software organization. The engineer will deploy and maintain Kubernetes infrastructure to develop, test, and scale Graphcore hardware and its software stack. Additionally, managing cloud infrastructure using tools such as Terraform is a critical aspect of the role.
The ideal candidate will have practical experience developing in Go and familiarity with cloud services, preferably AWS. Experience managing or developing in Linux environments and a strong understanding of CI/CD principles are essential. Proficiency in Kubernetes development and deployment is also required.
Desirable qualifications include experience developing Kubernetes Controllers, familiarity with Infrastructure as Code (IaC) tools like Terraform or OpenTofu, and experience with GitHub Actions. Knowledge of distributed HPC systems, modern observability tooling such as Prometheus, and programming languages like Python or C++ is advantageous.
Graphcore offers a competitive salary, flexible working arrangements, a generous annual leave policy, private medical insurance, a health cash plan, a dental plan, pension matching up to 5%, life assurance, and income protection. Additional benefits include a generous parental leave policy, an employee assistance program covering health and mental wellbeing, and a range of healthy food and snacks at their central Bristol office, which features its own barista bar.