Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Automate Your Infrastructure Analysis with Scheduled AI Reports

The least exciting part of an operations or SRE role is often the manual, repetitive task of generating reports. It’s the Monday morning scramble to summarize weekly infrastructure health for the team, or the end-of-quarter push to build a capacity planning document. This is boilerplate work that pulls you away from critical engineering tasks. We believe that if a process is repeatable, it should be automated. That’s why we’re introducing Scheduled AI Investigations and Insights.

How to perform real-time DNS monitoring in Grafana Cloud

When DNS or domain name server resolution processes fail, or become sluggish, users can experience timeouts, connection errors, and degraded performance — often without clear indication of the root cause. This is where DNS checks in Grafana Cloud Synthetic Monitoring come in, allowing you proactively monitor domain name resolution, verify that domains resolve to the correct IP address, and even measure how quickly that resolution occurs.

ICMP Monitoring: What Is ICMP & How It Works

Ever “pinged” a server and wondered what those milliseconds actually mean? If you’re a network admin or IT pro, you already use ping as a quick sniff test. But ICMP is more than a green checkmark or a scary timeout. In this article, we’ll define ICMP, walk through how echo requests and replies work, and show how to turn basic pings into useful network and ICMP monitoring.

OpenTelemetry and Jaeger | Key Features & Differences [2025]

OpenTelemetry is a broader, vendor-neutral framework for generating and collecting telemetry data (logs, metrics, traces), offering flexible backend integration. Jaeger, on the other hand, is focused on distributed tracing in microservices. Earlier Jaeger had its own SDKs based on OpenTracing APIs for instrumenting applications, but now Jaeger recommends using OpenTelemetry instrumentation and SDKs. Warning The original Jaeger client SDKs (based on OpenTracing) are archived and no longer maintained.

From Shadow AI to Strategy: The Six-Month AI Imperative (w/ Charlene Li)

In this very special episode of The DEX Show, we welcome back one of the world’s most influential voices on digital transformation and the future of AI leadership: Charlene Li. Charlene is a bestselling author and trailblazing thinker who has helped leaders navigate disruption for over two decades. She returns to the show for an unmissable conversation on the realities of AI Transformation—and what it means for organizations, leaders, and employees at every level.

Grafana & Friends Stockholm meetup at 0+X

In this talk, we’ll introduce the Kafka Data Source plugin we developed for Grafana, which enables users to query and visualise Kafka topic data directly in their dashboards—without the need for intermediate storage or external services. We'll share how the idea came about, how we collaborated with the Grafana community and developers to bring it to life, and the challenges we faced along the way. We'll also discuss our vision for the plugin’s future and its role in the evolving observability landscape.

Introducing Request Mirror: a free micro-service to reflect HTTP requests

We have launched Request Mirror, a little free service to reflect HTTP requests. We've also open-sourced it: you can read the code in the ohdearapp/request-mirror.ohdear.app repo on GitHub. In this blog post I'd like to explain why we built it and how you can use it.

AI in Server Monitoring: Why Human Context Still Matters in 2025

When Microsoft rolled out Windows Server 2025 last November, it marked a turning point in how IT teams think about monitoring. Suddenly, AI-powered features like anomaly detection, predictive resolution, and even self-healing aren’t ideas on a roadmap — they’re built into the very fabric of enterprise infrastructure.