
SRE Director
2 weeks ago
We're seeking a highly skilled Site Reliability Engineering (SRE) leader to join our team.
This is an exceptional opportunity for an experienced SRE professional to shape the SRE function and be part of a founder member of the Group SRE team. You will work closely with a small number of SREs in platform engineering, operations teams, and wider infrastructure teams for both public cloud and on-premises platforms.
The ideal candidate will have hands-on experience as an SRE practitioner with 5+ years of working experience in an SRE role. They should also have practical experience defining and implementing Service Level Objectives and operating to Error Budgets. Experience implementing and operating monitoring and observability technologies for enterprise-grade Production systems is also essential.
In this role, you will be responsible for defining, driving, and implementing the SRE strategy. This includes promoting an 'Automate-first' culture in operating services, reducing toil, and developing methodologies and strategies for identifying toil-heavy processes and automating their elimination. You will also assist in developing engineering and operational service metrics to improve efficiency and quality, and work with all parties to develop and implement Service Level Objectives (SLOs) for critical services.
Key Responsibilities: