Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

From Reactive to Proactive: A User-Centric Digital Strategy for Banks

In today's digital-centric banking environment, financial institutions must be able to provide seamless and reliable application performance across all digital channels - from a branch to a mobile device. Failure to do so results in real impact to customer satisfaction, trust, and loyalty. Modern banking applications are increasingly complex, running off of internet-centric distributed architectures involving many different parties and services. For these modern tech frameworks, traditional APM tools are no longer sufficient to ensure service reliability and optimal customer experience.

If your site is slow, it might as well be down.

It’s no longer enough for a site to just be available; it had to be fast. If the experience lags, your customers will bounce within seconds. The consequences scale fast: business stops and revenue disappears. You need to monitor performance across the full delivery chain because speed is what keeps users engaged.

Smarter Workflows, Faster Insights: How InfluxDB 3 Unlocks the Power of Python at the Source

Businesses across industries rely on time-stamped data to track system health, monitor performance, and improve operations. Whether it’s sensors on a factory floor or usage logs from a SaaS platform, time series data reveals how things change. As businesses digitize operations and add connected devices, sensors produce growing streams of time-based data. This opens the door to faster analytics and smarter automation. But legacy approaches can’t keep up.

Monitor agents built on Amazon Bedrock with Datadog LLM Observability

As large language models (LLMs) grow more powerful, organizations are deploying agentic AI applications to tackle complex, multi-step tasks. With Amazon Bedrock Agents, developers can orchestrate these agents to manage tasks such as triggering serverless functions, calling APIs, accessing knowledge bases, and maintaining contextual conversations—all while breaking down complex user requests or tasks into manageable steps.

Proactively troubleshoot with synthetic testing and distributed tracing

As your application grows in complexity, identifying the root cause of issues becomes increasingly difficult. Many monitoring strategies make this even harder by siloing frontend and backend data. To effectively troubleshoot problems that spread across your app, you need visibility not just into each part of your stack, but also into how these parts interact.

A look back at DASH 2025

DASH 2025 brought the Datadog community together like never before. During our biggest event yet, thousands of attendees gathered at the North Javits Center in New York City for two and a half days of content, learning, and community, where they deepened their knowledge and connected with peers. Here's a quick look back at some of the highlights from this year's DASH.

Kubernetes Monitoring backend 2.2: better cluster observability through new alert and recording rules

We’re excited to announce version 2.2.0 of the backend for our Kubernetes Monitoring solution in Grafana Cloud is now available. The app’s backend is supported by kubernetes-mixin, an open source Prometheus Monitoring Mixin, and this latest version features significant improvements to alert rules and recording rules that will enhance your cluster observability and monitoring experience. There’s a lot to tell you about, so let’s dive in.

IT Task Automation: Best Practices and Use Cases for IT Management with Pandora FMS

IT teams must handle a large number of tasks on a daily basis. Many of these tasks, while essential, are repetitive: resetting passwords, rebooting servers, monitoring logs for errors, applying patches… When performed manually, they can overwhelm technical staff and compromise operational efficiency. IT automation has emerged as the answer to this challenge. It involves using scripts and specialized tools to automatically execute these and other tasks that previously required human intervention.

ScienceLogic Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms

It’s official: ScienceLogic has entered the observability arena. Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms, we believe we’re helping define where observability is heading, not just where it’s been. This marks our first inclusion in this Magic Quadrant and, in our opinion, validates our mission to redefine intelligent, actionable observability in the era of AI and automation.

Microsoft SCOM Management Pack Housekeeping in Secured, Offline, or Air-Gapped Environments

MP Catalog Offline Toolkit by NiCE | 20min Walkthrough Struggling with Management Pack updates in restricted environments? Discover how the MP Catalog Offline Toolkit by NiCE simplifies SCOM MP management—without the need for an internet connection. Watch the 20-minute walkthrough now and see how this free tool helps your SCOM team stay compliant, efficient, and secure Download it now on GitHub – absolutely free, from the experts at NiCE.