A Primer on the History and Evolution of Incident Management to Today
Many of the concepts SREs take for granted about incident management originated with efforts to fight fires in California in the 1970s.
Many of the concepts SREs take for granted about incident management originated with efforts to fight fires in California in the 1970s.
An overview of major IT incidents and outages in 2021
A summary of the Log4j vulnerability, and key takeaways for SREs.
SREs face special challenges during the holidays. Here’s how to manage them.
An overview of how SREs can benefit from Infrastructure-as-Code.
Although every company can benefit from SREs, some need SREs more than others.
Six tips on how Site Reliability Engineers (SREs) can prepare for the reliability challenges of Black Friday and Cyber Monday 2021
A history of Site Reliability Engineering from its origins at Google in 2003 to the present.
Follow these steps to write a great SRE job resume.
An explanation of the meaning of SLA, SLO and SLI, and how SREs should use each concept to manage reliability.