Applying Site Reliability Engineering 'Golden Signals' to your Kubernetes Cluster

Applying Site Reliability Engineering 'Golden Signals' to your Kubernetes Cluster

Jun 26, 2019

Understanding how to monitor the "Golden Signals" of Site Reliability Engineering (SRE) in your Kubernetes cluster(s) is an important skill for any engineer, especially for Day 2 Operations. Fortunately, there are some very useful, powerful, and open source tools and technologies out there for accomplishing these tasks. This training session will go over how to monitor these "Golden Signals" in a Kubernetes cluster using Prometheus and Slack.

Watch this session to learn:

  • What the SRE "Golden Signals" are and how to apply them.
  • What Prometheus is and how to use it.
  • How to monitor signals, events, and errors and receive alerts and notifications via Slack

To get the slides for this session, visit:

https://info.rancher.com/site-reliability-engineering-golden-signals-in-kubernetes-clusters