Operations | Monitoring | ITSM | DevOps | Cloud

Elastic's Guide to Keeping Services up and Running with Real-time Visibility

Learn how to start monitoring in minutes, keep your networks up and running, and make sure citizens have continuous access to digital portals and services. Increased traffic. New users on the network. Data sharing at unprecedented levels. Meet all the challenges coming your way with the free and open Elastic Stack.

Performing chaos in a serverless world  Gunnar Grosch  Failover Conf 2020

Chaos engineering is the practice of hypothesis testing through planned experiments to gain a better understanding of a system’s behavior. The principles of chaos engineering have been around for years, and we have now reached the point where chaos engineering has gone from just being a buzzword and practice used by a few large organizations in very specific fields, to it being put in to use by companies of all sizes and industries.

Swim Don't Sink: Why Training Matters to a Site Reliability Engineering Practice  Jennifer Petoff

Do you offer training to the engineers in your organization or do you throw them off the deep end to “sink or swim”? Providing training and education is universally important to set team members up for success in your organization and is critical for establishing a thriving Site Reliability Engineering (SRE) or DevOps practice and culture in the first place.