Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

AWS & Splunk: Accelerating Innovation Through Partnership

Discover how AWS and Splunk are pushing the boundaries of innovation to empower your security, observability, and cloud transformation journey. This video highlights our joint commitment to driving digital resilience through unified visibility, faster threat detection, and seamless integration across AWS services.

Synthetic Monitoring for GraphQL Endpoints: Beyond the Query

GraphQL isn’t just another API protocol—it’s a new layer of abstraction. It collapsed dozens of REST endpoints into one flexible interface where clients decide what data to fetch and how deep to go. That freedom is a gift for front-end teams and a headache for anyone tasked with reliability. Traditional monitoring doesn’t work here. A REST endpoint can be pinged for uptime.

Grafana Mimir 3.0 release: performance improvements, a new query engine, and more

In 2022, we introduced Grafana Mimir, our open source, horizontally scalable, multi-tenant time series database (TSDB) designed for long-term storage of Prometheus and OpenTelemetry metrics. Over the years, Mimir has become a go-to metrics backend within the open source community, with 30 project maintainers and more than 4.7k GitHub stars.

Stop the guesswork: Troubleshoot with confidence with process monitoring

IT infrastructure is vast, complex, and interdependent. At any point in time, businesses rely on thousands of servers running thousands of processes. Detecting server downtime is fairly easy—but true observability is when you know precisely which processes are working as intended and which are silently contributing to performance degradation. A failed database worker or a memory-leaking background service can silently drain resources until your most critical apps grind to a halt.

Accelerate your Azure integration setup with guided onboarding

Getting started with monitoring for Microsoft Azure environments can be a lengthy and manual process. Many tools require users to create app registrations, assign permissions, and enable log forwarding or telemetry data collection across multiple portals and scripts. These fragmented steps slow down onboarding and introduce opportunities for misconfiguration, making it harder for teams to quickly achieve full visibility.

Understand user experience through network performance with Datadog Synthetic Monitoring

When an application slows down or fails, pinpointing the cause isn’t always simple. Is it a backend regression, a misbehaving API, or a bottleneck somewhere deep in the network? Without full visibility, teams waste precious time troubleshooting across disconnected tools and layers. Datadog Synthetic Monitoring now supports Network Path to help you proactively identify whether user-facing issues stem from your code or from the underlying network.

OTel Updates: Declarative Config - A Steadier Way to Configure OpenTelemetry SDKs

Application configs change over time, often in small ways that are easy to miss. They may start simple — a few environment variables, one exporter, nothing unexpected. As your instrumentation grows, you add rules for filtering health check spans, adjust sampling based on attributes, or introduce environment-specific resource settings. Each change makes sense on its own. But months later, the picture can look different across dev, staging, and production.

Discover resources smarter with deep discovery in internet services

Discover how Deep Discovery from Site24x7 simplifies your website monitoring by automatically detecting, grouping, and managing all related resources—so you don’t miss a thing. In this video, we walk you through a real-world use case, the problems Site24x7 solves, and how its time-saving features like Bulk Addition make managing multiple monitors effortless. Whether you’re tracking SSL, DNSs, APIs, or website performance, Deep Discovery gives you complete visibility without manual hassle.

The APM paradox

Application Performance Monitoring (APM) means many things to many people. At its core, it enables developers to diagnose why their applications are slow and helps them provide a better experience to their users. Traditionally, this is accomplished by collecting a lot of data and displaying it in the form of dashboards and request traces. The problems you're trying to solve are generally known up front.