The latest News and Information on Service Reliability Engineering and related technologies.
There’s no better time than now to dedicate effort to reliable software. If it wasn’t apparent before, this past year has made it more evident than ever: People expect their software tools to work every time, all the time. The shift in the way end-users think about software was as inevitable as our daily applications entered our lives, almost like water and electricity entered our homes.
Building Site Reliability Engineering (SRE) teams is hard! There are so many articles and explanations of what SRE means, it’s easy to get lost. Going beyond understanding what the individual SRE role is into building and scaling a team of SREs is more of a challenge. It’s important to find the right information that will help you take your SRE team to the next level.
Site Reliability Engineering (SRE) is playing an increasingly pivotal role in supporting hybrid-cloud, DevOps environments, where Dev teams need to release updates fast and Ops need to avoid errors and failures in production. Powered by integrations to monitoring, orchestration, provisioning and ITSM tools, Interlink’s SRE solution brings improved understanding of where threats to the health of your IT services might lurk within DevOps workflows.
Many organizations are transitioning toward a DevOps operational model, where software developers are responsible for operating the applications they develop, instead of a centralized IT operations group. In this “CTO Perspective” interview we talk to BigPanda’s CTO Elik Eizenberg about the challenges in that transition, and what it takes to make it easier. Lean back and watch the interview, or if you prefer reading, take a few minutes to read the transcript.