Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Builder in the loop: Henry Andrews on building AURA like production software

An interview series with the people building Mezmo’s open-source agentic harness for production operations. Builder in the loop is a Mezmo interview series focused on the engineers, product leaders, and operators shaping AURA, our open-source, MCP-native agentic harness for production operations. The goal is to get past the polished product layer and talk through the decisions that matter when AI starts interacting with real systems. What should agents be allowed to do?

ActiveMQ Message Persistence: KahaDB, Artemis Journal & JDBC

Every persistent message in ActiveMQ must survive a broker restart. That guarantee is the contract behind DeliveryMode.PERSISTENT is what separates a messaging system from a memory buffer. It is also what makes message persistence configuration the most consequential decision in ActiveMQ architecture.

Why Siloed Monitoring Increases Your MTTR and How to Resolve It

Are you spending more time figuring out whose problem it is than actually fixing it? If that feels familiar, you are not alone. Many IT teams start their day with multiple dashboards and tools, yet still struggle to understand what is wrong when something breaks. Everything may look fine in one view, and fine in another, but the customer impact tells a different story. Incidents end up taking longer to resolve than they should. This is not about effort or capability.

Tips and Tricks for Handling Secrets in Icinga 2

Today, we are going to look at a few things related to handling secrets. While Icinga 2 has no dedicated mechanisms for secret handling, there are a few tricks you can do with standard features. This is not meant as a step-by-step tutorial, but rather as an inspiration where you can adopt the ideas that make sense in your setup.

Observability for the Agent Era: Day 1 | Keynotes

Honeycomb's Innovation Week: Observability for the Agent Era (May 12-14) For Day 1 of Innovation Week, Honeycomb co-founders Christine Yen and Charity Majors will share what it actually takes to understand and debug systems in the agent era, and what the best engineering teams are doing differently. A 3-Day Virtual Event for Teams Building the Future May 12: Get insights on how the best engineering teams are tackling the challenges of the agentic era.

Redgate Monitor | AWS Database Migration Readiness

n this demo, we explore the AWS Database Migration and Modernization (D2M) framework, from Align and Assess, trough to Optimize, and show how Redgate Monitor helps you to establish performance baselines, right-size target environments and continuously optimize RDS and Aurora spend for full cloud cost visibility. Learn how Redgate Monitor can give you a single view of your entire AWS and on-premises, multi-database environment.

Stop Guessing, Start Fixing: AI Root Cause Analysis

Automating root cause analysis is often regarded as the holy grail of IT operations. A solution capable of automatically identifying issues, resolutions and even prevention. Performed correctly, automated root cause analysis accelerates MTTI (Mean Time to Identify) and MTTR (Mean Time to Resolution). But for many platforms, this goal remains elusive: complexity, differences between deployments and different architectures make automating root cause challenging.