Lead Cloud Site Reliability Engineer, Leadership, (Azure or GCP), SLO's, SLO's, Automation
A leading financial Services client is seeking a strong technical leader to help drive and support a large group of SRE engineers across multiple locations. The role will be split 50/50 hands-on, team management. This is an Engineering role, not operations.
The role:
- Lead and mentor a team of up to 15 SREs, championing continuous improvement and engineering excellence.
- Partner with application teams as they migrate services to the Cloud.
- Work with Product Owners and Engineering Leads to balance feature delivery with system reliability, performance and health.
- Use observability tooling, performance metrics and SRE principles to proactively identify issues and reduce operational toil.
- Implement Incident and problem management practices, ensuring strong root cause analysis and reduced MTTF/MTTR.
- Champion SLOs, ...