Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Grafana Tempo 2.9 release: MCP server support, TraceQL metrics sampling, and more

Grafana Tempo 2.9 is now available, delivering MCP server support, TraceQL performance improvements, and more. Watch the video below to see the Tempo MCP server in action and learn how to speed up TraceQL metrics queries, or continue reading to get a quick overview of these and other updates. The Grafana Tempo 2.9 release notes and changelog provide more in-depth details and include all of the changes that came with this release.

Elastic recognized as a finalist for Innovation in Customer Portals in 2025 TSIA STAR Awards

We are proud to announce that Elastic has been named a finalist by the Technology & Service Industry Association (TSIA) in the 2025 STAR Awards program for Innovation in Customer Portals that Improve Digital Customer Experience. This award recognizes Elastic’s ability to embrace AI innovations to enhance our digital customer experience.

Your network isn't infrastructure anymore. It's a product.

In my last blog, I’ve discussed a common problem: metrics like mean time to resolution (MTTR) mean nothing to business leaders. Celebrating a faster fix for an outage that still cost the company thousands in lost sales is a conversation that goes nowhere. You might as well be speaking a different language.

What's New in Network Observability for Fall 2025

As your partner in network observability, we’ve worked together to help you manage an increasingly complex digital landscape. You’ve built a powerful monitoring foundation, but the pace of change doesn’t slow down. Your network continues to expand across hybrid clouds and multi-vendor SD-WAN, and the demands on your team grow with it.

The Network Engineers You Can't Hire? They Already Work for You

In my conversations about managing large, complex networks, one topic is now constant. The issue isn't budgets or new technology; it's about personnel. Specifically, it's the increasing difficulty of finding and retaining skilled professionals. If you are feeling this pressure, you are not alone. The search for technical talent is a universal challenge.

How to bridge speed and quality in experiments through unified data

Metrics are fundamental to experimentation for two reasons: They set the basis for evaluating ideas and interventions, and they can suggest where to look next. As such, many teams collect a wide variety of metrics, from application performance data to revenue trends. However, doing so often means manually knitting together data from multiple sources and formats. Even then, data silos can make it challenging to understand the full impact of experimental changes. In this post, we’ll explore.

Two Factors, Double Security?

“Please enter the code we just sent you.” – most people have seen this message when logging into an online service. Two-Factor Authentication (2FA) is no longer reserved for banks or enterprises. It’s now common in email, social media, and shopping accounts. The idea is simple: in addition to a password, you need a second factor so that attackers can’t break in with just one piece of information. But what methods are actually used – and how secure are they really?

Datadog Cloud Cost Management: Make cost a key metric for engineers

See how Datadog Cloud Cost Management puts cost and efficiency KPIs directly in front of engineers in their daily workflows. In this short demo, you’ll learn how to: Datadog unifies cost, performance, and business metrics in one platform, so FinOps, engineering, and finance teams can make cost-aware decisions together.