Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How to Communicate the Value of DEX Across Your Organization

For many EUC and Digital Workplace leaders, the challenge with digital employee experience (DEX) isn’t the technology, it’s building alignment. You can see the data. You know where friction exists. You can quantify disruption, productivity loss, and inefficiencies. But you struggle to achieve your targets, because you need buy in from other teams, and right now, they don’t want to hear anything about DEX. Security has different priorities. Application owners are focused on releases.

Internet Speed Monitoring - How to Proactively Test Your Internet Connections

Recent enhancements to eG Enterprise have added functionality to allow you to proactively test your internet speed with synthetic monitoring (“robot” tests that simulate real user activity). Using the new functionality you can proactively monitor internet speeds 24×7 from any location. The performance and quality of an Internet connection plays a major role in any IT environment. Use cases for this new functionality include.

Icinga Installation Guide - Part 1 - Getting started with a base Icinga Installation

Get up and running with Icinga 2 and Icinga Web in this step-by-step installation guide. In this video, we walk you through a complete base installation of Icinga, covering everything from setting up the database to accessing the web interface for the first time. This will help you get to the point of a working installation, especially if you're new to Icinga. We take you through the full process, including installing required components, configuring databases, enabling services, and completing the web setup wizard.

Icinga Installation Guide - Part 2 - Installing Icinga Director and configuring your first objects

Take the next step with Icinga by adding the powerful configuration management tool Icinga Director to your setup. In this second part of our installation guide, we focus on simplifying and scaling your configuration using the Director. You’ll learn how to connect it to your existing Icinga 2 instance, create reusable templates, and start monitoring hosts and services through a more flexible, web-based interface.

Leveraging Cognitive Diversity to Tackle System Complexity

Most engineering leaders today understand that diversity matters. They've built teams that reflect a range of backgrounds, functions, and experience levels. They run postmortems, retrospectives, and architecture reviews that bring multiple voices to the table. They believe, not unreasonably, that this variety of perspectives leads to better decisions. But there's a problem hiding inside that assumption that can undermine everything: who people are is a surprisingly poor predictor of how they think.

Observability Lessons From OpenAI

Writing code is moving from the good old IDE into the realm of autonomous AI agents. One example of this is OpenAI, which has been developing internally with 0 lines of manually written code. You can read about their workflow in their engineering blog: Harness engineering: leveraging Codex in an agent-first world. For me, the main takeaway of OpenAI’s article is how AI has rewritten the constraints equation.

API Error Monitoring: A Complete Guide to Detecting and Resolving API Failures

APIs power nearly every modern digital experience. From mobile apps and SaaS platforms to payment gateways and internal microservices, APIs handle authentication, transactions, content delivery, and system-to-system communication. When an API fails, users often experience broken features, slow responses, or complete service outages. In many cases, they leave before your team even realizes something is wrong. The business impact of API failures is significant.

API Availability Monitoring: How to Measure True API Availability

APIs are no longer just integration layers. They power customer logins, payment processing, SaaS workflows, partner ecosystems, and mobile applications. When an API becomes unavailable, revenue stops, user trust declines, and service level agreements are immediately at risk. Yet many teams still define API availability in the simplest possible way. If an endpoint responds with a 200 OK, the API is considered available. Monitoring dashboards stay green. Alerts remain silent. Everything appears healthy.