Site Reliability Engineering (SRE) Team Lead
I am working with an award-winning cloud analytics software company based in West Sussex, UK. They're seeking an experienced SRE Team Lead to join their growing team on a hybrid basis (one day per week in office).
About Them
My client delivers real-time call and contact analytics solutions to over 7,000 customers worldwide. They're part of the Cisco Partner Ecosystem and have won industry recognition including 'Best Analytics Platform'
The Role
The SRE Team Lead will drive operational excellence while leading a team of DevOps Engineers. They will shape infrastructure strategy and foster a resilient engineering culture using AWS and automation technologies.
Key Responsibilities
- Lead and develop the SRE team, establishing goals and performance standards
- Drive SRE strategy aligned with business objectives
- Design and manage scalable AWS infrastructure
- Implement automation using Ansible and related tools
- Oversee incident response and system reliability
- Establish SLAs, monitoring, and alerting strategies
Requirements
- Team leadership experience
- Deep AWS expertise
- Strong automation skills (especially Ansible)
- DevOps/SRE principles knowledge
- Strategic thinking balancing technical and business needs
Desirable
- Microsoft Azure experience
- Kubernetes knowledge
- Cloud security expertise
Benefits
- Flexible working arrangements
- Progressive holiday allowance (starting at 25 days)
- Pension scheme
- EV salary sacrifice scheme
- Private medical insurance (includes spouse)
For more information about this exciting opportunity with my client, please get in touch.