Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Making Observability AI-Native with the Logz.io MCP Server

Now available: Secure, real-time access to your observability data via Logz.io’s Model Context Protocol (MCP) Server. The Logz.io MCP Server brings your logs, metrics, and telemetry data into the Model Context Protocol (MCP), an emerging open standard that lets AI systems query real data securely and contextually, in real time. That means any MCP-compatible LLM, like Claude Desktop, Cursor, your own AI agent… can now connect directly to your Logz.io environment.

Messaging Infrastructure Is Still in the Dark: The Observability Illusion Costing Millions

In today’s always-on digital world, even the best messaging platforms—like Apache Kafka and Apache ActiveMQ—can become blind spots that undermine resilience. This article exposes the “observability illusion” many organizations face, showing how limited visibility and manual processes lead to outages, high costs, and constant firefighting. Learn how meshIQ transforms reactive operations into proactive engineering through unified observability, automation, and self-service.

Improve Observability in Your CI/CD Pipeline

The backbone of modern software development is automation and at the heart of that lies the CI/CD pipeline. It’s what turns code into deployable software, delivering changes to users faster, safer, and more predictably. In simple terms, a CI/CD pipeline automates everything from the moment developers push code to when it reaches production. It integrates, tests, builds, and deploys software continuously ensuring faster releases with fewer human errors.

What the RFC?! Making sense of syslog before you migrate

Syslog: it's everywhere, it’s ancient, and let’s be honest — it rarely shows up the way the RFC says it should. Before you cut over to Cribl Stream, it pays to understand exactly what you're dealing with and why it matters. In this talk, we’ll demystify the syslog format (yes, the actual RFC 3164 and 5424 stuff), look at what happens when data goes rogue, and explore how Cribl can help bring order to the chaos.

Observability and FedRAMP in Action: The VA's Mission to Deliver Reliable Digital Service

Ensuring digital services remain accessible, reliable, and secure is a high priority for any organization operating at scale. For the Department of Veterans Affairs (VA), this focus is central to its mission of providing quality care to veterans, their families, and caregivers. Often described as “the largest IT shop in the United States,” the VA manages 2.7 million pieces of equipment across a vast network of interconnected systems.

Unify Observability, Surface Business Impact, and Solve Problems Using AI Agents with Latest Splunk Observability Innovations

In September at.conf25, we announced how Splunk is shaping the future of digital resilience in the age of AI. Agentic AI is rewriting what it takes to build a leading observability practice. As vibe coding gains steam, applications will be built with less human involvement. At the same time, the rise of AI agents demands specialized telemetry to ensure models are performing as intended—aligned to their business purpose and cost.

Splunk Advances the OpenTelemetry Project with Its Latest Donation, the OpenTelemetry Injector

Splunk is very excited to be sponsoring Kubecon North America once again, kicking off this week in Atlanta, GA. As many know, Splunk is one of the top contributors to the OpenTelemetry project. We’re happy to have sent many of the Splunkers who serve as project maintainers and contributors to lead SIG meetings and engage with the greater community in the OpenTelemetry Observatory, sponsored by Splunk.
Sponsored Post

Preparing for cloud failures: Monitoring strategies for distributed hybrid infrastructure

When AWS experienced its recent outage, the ripple effect was immediate. Critical workloads slowed, dashboards went blank, and many teams realized multi-cloud isn't automatically resilient. Cloud-level failures are inevitable due to the interdependent components and complex IT architecture. The recent AWS disruption reminded many teams that the cloud isn't a magic uptime guarantee. Even the most mature providers can-and do-experience large-scale service interruptions.

AI Agents Observability with OpenTelemetry and the VictoriaMetrics Stack

Nowadays, AI agents are becoming more and more popular and often deployed as part of production systems. However, this rapid adoption brings unique observability challenges that require flexible solutions. On the one hand, AI agents are fundamentally just like any other software services that produce the same classic observability signals we’re familiar with: metrics, logs, and traces.

From Observability to Network Intelligence: How Kentik Built the Foundation for Networks That Think

The age of dashboards is ending, as observability has only created more noise for network teams to sift through. Kentik SVP of Product, Mav Turner, lays out why true network intelligence requires a clean, contextual data foundation to finally create a network that thinks.