Site Reliability Engineering Lead - London
Location
london, england
Job Type
Full-time
Category
IT & Technology
Posted
June 02, 2026
Overview
We’re looking for a true SRE leader with a strong software engineering background. This isn’t a DevOps “on-call only” role — you’ll need to be comfortable reading and writing production code, deeply understanding application behaviour, and working alongside developers as a technical peer.
Responsibilities
You’ll lead and mentor the SRE team, setting direction and raising the bar for reliability across our systems. You’ll take end-to-end ownership of production, ensuring availability, performance, and effective incident response, while defining SLIs and partnering with Product on meaningful SLOs and error budgets.
- Own production systems (availability, performance, incident response)
- Define SLIs/SLOs and use error budgets to guide decisions
- Run incident management, on-call, and blameless postmortems
- Get hands-on with code (PHP, Java/.NET) to troubleshoot and improve reliability
- Drive automati...