Operations | Monitoring | ITSM | DevOps | Cloud

March 2020

How to successfully correlate metrics, logs, and traces in Grafana

As everyone knows, the Grafana project began with a goal to make the dashboarding experience better for everyone, and to make it easy to create beautiful and useful dashboards like this one. But as Andrej Ocenas, a full stack developer at Grafana Labs, said in a recent FOSDEM 2020 presentation, the company has bigger ambitions for Grafana than just being a beautiful dashboarding application. What Grafana Labs is really aiming to do now is make Grafana into a full observability platform.

WFH tips: a technical guide to video conference calls

The COVID-19 pandemic has forced many companies to require employees to work from home. It’s a new normal for many, but at Grafana Labs our team has always recruited and operated with a remote-first culture in mind. To help everyone transition to a home office environment, we launched a new WFH series in which Grafana team members share their best advice for staying productive at home – yes, even if you have kids around.

How we're using gossip to improve Cortex and Loki availability

Have you heard about using the hash ring in Loki and Cortex? Here is a short version: In Cortex and Loki, the ring is a space divided by tokens into smaller segments. Each segment belongs to a single “ingester” (component in Cortex and Loki that receives data) and is used to shard series/logs across multiple ingesters. In addition to the tokens, each ingester also has an entry with its ID, address, and the latest heartbeat timestamp updated periodically.

Introducing Grafana Cloud Agent, a remote_write-focused Prometheus agent that can save 40% on memory usage

Today, we are announcing the Grafana Cloud Agent, a subset of Prometheus built for hosted metrics that runs lean on memory and uses much of the same battle-tested code that has made Prometheus so awesome. At Grafana Labs, we love Prometheus. We deploy it for our internal monitoring, use it alongside Alertmanager, and have it configured to send its data to Cortex via remote_write. Unfortunately, as we scale to handle more load, our deployment becomes more and more difficult to manage.

How to work from home with kids: More tips from the remote-first Grafana Labs team

With every tech company on Earth suddenly pretending they’re remote-friendly overnight, there are a lot of posts about how to work well from home. As a matter of fact, we wrote our own, so why would we write another? The answer is: kids.

How the Jsonnet-based project Tanka improves Kubernetes usage

At FOSDEM 2020, Grafana Labs software engineers Tom Braack and Malcolm Holmes explained how and why the team developed Tanka, a scalable Jsonnet-based tool for deploying and managing Kubernetes infrastructure. They also shared how Grafana Labs leverages the project to manage and monitor its own infrastructure as well as showcased how Tanka makes deploying a Grafana instance faster and more efficient.

Learn Grafana: How to use dual axis graphs

You’re done setting up your first graph panels. You want to do more, look around the visualization settings, and discover the settings for the X and Y axes. You stumble over the configuration for a “Right Y” axis. You ask yourself, “Why on earth would I need another Y axis?” You toggle it back and forth and change some settings, yet that makes no difference to your graph. What gives? Never fear.

Pro tip: How to monitor client certificate expirations with Prometheus

Certificates can be difficult to track and opaque to administrators, and if any expire without someone noticing, embarrassing outages can happen. At Grafana Labs we strive to make all things visible and observable; why should certificates be any exception? In this post we will explore an easy way to expose and monitor certificate expirations using Grafana and Prometheus.

The benefits of observability

Grafana Labs cofounder and CEO Raj Dutt was a recent guest on the Designing Enterprise Platforms podcast from Early Adopter Research (EAR), speaking to host Dan Woods about the benefits of observability. The conversation touched on several related topics – including the tactics of observability, platform approaches, and why now is a great time to be part of an open source company.

New in Grafana 6.6: Forcing minimum alert evaluation frequency

There has long been a request from administrators to have the ability to enforce a minimum interval between alert rule evaluations. This is useful for restricting unrealistic user-defined alert rules that evaluate too often and create unnecessary load in the backend. @Uepoch took the initiative and made all the necessary modifications for this configuration in Grafana’s backend, and we finally pushed it forward and introduced the feature in Grafana v6.6.