%term

The latest News and Information on Observabilty for complex systems and related technologies.

Innovation Week Day 1: The SDLC Is Collapsing, and Observability Has Never Mattered More

May 12, 2026 By Shabih Syed In Honeycomb

The software development lifecycle is collapsing. The multi-stage pipeline that defined how software got built and shipped for decades is compressing into rapid loops of intent and validation, with agents now part of the teams building and running it. Day 1 of Innovation Week was about what that shift means for how software gets validated, where observability fits, and the problems that have always been hard but are now genuinely urgent.

Read Post

Honeycomb

Read more about Innovation Week Day 1: The SDLC Is Collapsing, and Observability Has Never Mattered More

Honeycomb Innovation Week - Day 1 Replay

May 12, 2026 By Honeycomb In Honeycomb

Watch a full replay of all keynotes on Day 1 of Honeycomb's Innovation Week.

View Video

Honeycomb

Read more about Honeycomb Innovation Week - Day 1 Replay

Security Integrations in Observability Self-Hosted

May 12, 2026 By solarwindsinc In SolarWinds

Integrating security data with observability data provides a comprehensive view for better threat detection and response. Security observability helps connect the dots between seemingly innocent events that, when correlated, reveal complex attack patterns. SolarWinds security products integrate into observability self-hosted, including Security Event Manager for log data and event correlation, Access Rights Management for identifying potential attack vectors, configuration management for compliance monitoring, and Patch Manager for tracking critical updates.

View Video

SolarWinds

Read more about Security Integrations in Observability Self-Hosted

Making Semantic Conventions Work for You With OpenTelemetry Weaver

May 11, 2026 By Mike Goldsmith In Honeycomb

Your dataset has hundreds of attributes. Some are self-explanatory: http.response.status_code, server.address. Others are not: meta.refinery.reason, dataset.slug, sli.latency_target_ms. If you don't know what an attribute means, you can't write a good query. And if an AI agent doesn't know what it means, it guesses.

Read Post

Honeycomb

Read more about Making Semantic Conventions Work for You With OpenTelemetry Weaver

Why Alert Fatigue Solutions Still Miss the Root Cause

May 11, 2026 By Lightrun Team In Lightrun

Alert fatigue solutions have never been better, but on-call engineers are still burning out. Threshold tuning, AI triage, and alert correlation reduce the noise, but every alert that clears filtering lands with the same incomplete telemetry and triggers the same manual investigation cycle. This post explains why the evidence gap survives every fix, and how runtime context changes that.

Read Post

Lightrun

Read more about Why Alert Fatigue Solutions Still Miss the Root Cause

Multi-tiered Observability: A Practical Way to Handle Diverse Workloads

May 8, 2026 By Pablo Fernandez In VictoriaMetrics

Observability in large companies is rarely one-size-fits-all. The VictoriaMetrics topologies guide shows why different deployment patterns are needed as scale, isolation, and reliability requirements grow. Different workloads require different trade-offs: some need long retention for audits and trend analysis, while others need higher resolution for debugging. Business-critical systems also demand dependable alerting and high availability, often with several 9s of reliability.

Read Post

VictoriaMetrics

Read more about Multi-tiered Observability: A Practical Way to Handle Diverse Workloads

Why Blast Radius Analysis Does Not End When Alerts Fire

May 7, 2026 By Lightrun Team In Lightrun

Modern distributed systems fail in ways that can bypass even well-designed isolation patterns. When a failure is actively propagating across services at four in the morning, the question shifts from “how do we limit the blast radius” to “how do we confirm what it actually is.” Monitoring shows which services are in the impact zone, but it cannot show what code path caused the failure to spread, or whether it has stopped.

Read Post

Lightrun

Read more about Why Blast Radius Analysis Does Not End When Alerts Fire

Span or Attribute in OpenTelemetry Custom Instrumentation

May 7, 2026 By Jessica Kerr (Jessitron) In Honeycomb

TL;DR: Attribute. More information on one event gives us more correlation power. It’s also cheaper. When you want to add some information to your tracing telemetry, you could emit a log, create a span, or add a piece of data to your current span. Adding a piece of data to your current span is the best! Usually.

Read Post

Honeycomb

Read more about Span or Attribute in OpenTelemetry Custom Instrumentation

Observability and Security for the AI Era

May 7, 2026 By Datadog In Datadog

Datadog has always been driven by a broader vision of helping teams understand and operate complex systems. In this session, you’ll hear from Michael Whetten, Product SVP, and Abrar Hussain, Senior Director, Product Management, as they share the latest updates across the Datadog product suite and discuss how that vision continues to shape the platform’s evolution and support the next generation of AI-driven applications.

View Video

Datadog

Read more about Observability and Security for the AI Era

How to Prevent AI Agents From Deleting Production Data

May 6, 2026 By Lightrun Team In Lightrun

There’s a new question teams are asking. How can we prevent AI agents from deleting production. When Cursor deleted PocketOS’s entire production database in nine seconds, the agent wasn’t malfunctioning. It had full technical capability, but it was inferring operational authority from static code rather than live environment state. That gap between capability and context is the root cause. This article breaks down exactly how that happens, and what runtime visibility does to stop it.

Read Post