The latest News and Information on Service Reliability Engineering and related technologies.
They say imitation is the sincerest form of flattery. In the six years since we launched the initial SRE report, we've seen some similarly themed 'reports' jump on the state of site reliability bandwagon. Why? Because the impact and importance of SRE and resilience engineering have resonated across industries, prompting organizations to delve deeper into this vital domain.
Compare Graphite and Prometheus, two leading open-source monitoring solutions.
Overview of what is high cardinality in the context of monitoring using Prometheus and Grafana.
Everything you want to know about high cardinality in cloud native environments and how to manage it effectively.
Everything you want to know about Prometheus and Thanos, their differences, and how they can work together.
Learn what is OpenTelemetry: The open-source observability framework for collecting and processing telemetry data from applications and systems.