Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How to Reduce Change Lead Time

The time it takes to get a change into production, also known as Change Lead Time, is an important measure of developer productivity, and one of the four DORA metrics. In Sleuth, you can easily break down Change Lead Time into activities to get insights into potential bottlenecks. SLEUTH A deploy-based Accelerate Metrics tracker both managers and developers love.

Announcing Workflows - Fix k8s issues on the fly

Meet "Workflow" from Komodor, the feature that automates troubleshooting common errors in Kubernetes. Leveraging Komodor’s change intelligence capabilities, Workflow's smart algorithm automatically detects Kubernetes issues and responds with a series of checks that quickly pinpoint its root cause. Not stopping there, Komodor uses the information to provide made-to-measure instruction for remediation, turning troubleshooting into a fast and effortless experience for dev and ops.

How to easily track DORA metrics

With Sleuth, you can instantly and automatically track the four DORA metrics: Change Lead Time, Deploy Frequency, Change Failure Rate, and MTTR. No more, no less. Sleuth tracks them accurately, doesn't attempt to track dubious individual metrics, and doesn't require any manual work. SLEUTH A deploy-based Accelerate Metrics tracker both managers and developers love.

Best Practices Guide for Kubernetes Labels and Annotations

Kubernetes is the de facto container-management technology in the cloud world due to its scalability and reliability. It also provides a very flexible and developer-friendly API, which is the foundation of its control plane. The effectiveness of the Kubernetes API comes from how it manages the Kubernetes resources via metadata: labels and annotations. Metadata is essential for grouping resources, redirecting requests and managing deployments.

Tales of A11y In Grafana OS: Introducing Pa11y CI into our pipeline by Alexa Vargas

We want to make Grafana accessible to everyone! In this talk, Alexa will share how Grafana recently introduced Pa11y CI into the Grafana Continuous Integration pipeline. The library supports our developers and contributors to highlight a11y issues. And more importantly, it acts as a gatekeeper, stopping new A11y issues from making it into the project. You will additionally hear about the alternatives that were considered and their challenges. This talk will have everything!

Self Healing Kubernetes at the edge

As developers and businesses are shifting their attention to the edge, everyone wants to build their own edge clusters and manage them. However, building a highly available edge cluster is not easy. Kubernetes simplifies container deployments by abstracting the resource management details from the users, allowing them to deploy using standard CLI or templates.

What SREs Can Learn from Facebook's Largest Outage

Facebook’s October 2021 outage was the type of event that gives SREs nightmares: A series of critical business apps crashed in minutes and remained unavailable for hours, disrupting more than 3.5 billion users around the world and costing about 60 million dollars. As incidents go, this was a pretty big one.