Site Reliability Engineer
Hybrid in Stratford, London
450 - 500, Inside IR35
Initial 6-month
We're recruiting on behalf of a Global Services Provider who are looking for a Site Reliability Engineer to join their team where you will manage and guide the SRE team.
As a Site Reliability Engineer, you will be responsible for:
- Manage and guide the Site Reliability Engineering (SRE) team while promoting SRE principles throughout FCA product groups.
- Serve as the SRE subject matter expert and strategic lead within the delivery organization.
- Oversee and advise on daily operations related to observability tools, including their upkeep and optimization.
- Provide hands-on support to engineering teams for delivering observability initiatives, as needed.
- Collaborate with product teams to develop standard practices, templates, and automation for monitoring and alerting systems.
- Track and analyse metrics related to system performance and capacity planning.
- Partner with Product Groups to gather requirements and define observability strategies that align with the Event Management framework.
- Design and deliver dashboards that reflect business priorities and stakeholder expectations.
- Lead the development and review of Observability Plans submitted by various projects.
- Contribute to test planning and ensure the reliability of quality assurance results.
Proven skills and experience to help you succeed in this role:
- Strong Experience with primary role of SRE Engineer
- Strong experience in Devops Tools (Git Hub, Git Hub Actions, Workflow, CodeQL Jenkins, Nexus, CloudFormation/Terraform etc.)
- Strong experience in monitoring tool (Datadog is preferred)
- Strong Knowledge of AWS services EC2, ELB, ECS, S3, Config, CloudTrail, EFS, Lambda, VPC
- Strong Knowledge and experience of python/shell scripting
- AWS Certification (desirable)
Further Information Available upon Application.
ECS Recruitment Group Ltd is acting as an Employment Business in relation to this vacancy.