Staff Site Reliability Engineer
Diligent Corporation is seeking a Staff Site Reliability Engineer to lead the design and operation of highly available private cloud platforms supporting large-scale SaaS services. This role involves working with VMware-based infrastructure to ensure systems are fast, resilient, and secure. The engineer will collaborate with various teams to deliver robust, enterprise-grade platforms.
Key responsibilities include designing, operating, and improving VMware-based private cloud infrastructure across multiple datacenter environments. The role also involves building automation to reduce manual effort, administering and troubleshooting Linux and Windows Server platforms, and supporting core infrastructure integrations such as Active Directory, DNS, and networking. Additionally, the engineer will drive improvements in platform resilience, performance, security, and compliance.
The ideal candidate should have strong experience in systems engineering or infrastructure roles, with deep expertise in VMware vSphere, including ESXi, vCenter, HA, DRS, vMotion, and distributed switching. Advanced Linux systems administration skills, practical experience with Windows Server and Active Directory, and proven ability to build automation using tools like PowerShell, PowerCLI, Python, or Ansible are essential. Strong enterprise datacenter knowledge and excellent problem-solving skills are also required.
Diligent offers a flexible work environment, comprehensive health benefits, generous time off policies, and wellness programs. The company fosters a culture of innovation and collaboration, with a commitment to diversity and inclusion. Employees have opportunities for professional growth and development within a global community dedicated to making a positive impact.