Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How to Achieve Deep Network Visibility with SolarWinds Observability SaaS

Looking for a faster way to discover every device on your network? This video walks through how SolarWinds Observability automatically scans and classifies network gear—including routers, switches, access points, firewalls, and SD-WAN devices—in seconds. You’ll learn how to: This is the easiest way to get full network visibility without scripts, config files, or manual inventory work.

#observability needs more than tools. It needs the right data.

Good observability starts with good data. In this clip, we hear how Cribl gives teams real control over their data pipelines so they can collect, enrich, and route telemetry from any source to the right destination. It is not just about more dashboards or another platform. It is about building an observability ecosystem that connects IT, security, and the business through cleaner data and smarter AIOps. Tool rationalization and AI driven pipelines are not future goals. They are happening right now.

Azure Monitor offers Grafana dashboards natively for immediate, real-time operational monitoring

Editor’s note: This blog originally published in May 2025 when Azure Monitor dashboards with Grafana became available in public preview. It was updated in November 2025 to reflect general availability. The Grafanaverse just got a little bit bigger.

How to pair Grafana Drilldown with Loki for faster logging insights

Our logs can tell us so much about the state of our systems, but they can also be a bit overwhelming. Yes, Grafana Loki—and, by extension, Grafana Cloud Logs, which is powered by Loki—reimagined the way log aggregation systems could meet modern engineering demands, but logs, by their very nature, are still voluminous.

Elasticsearch: The context engine for grounding and orchestration in Microsoft Azure AI Foundry Agent Service

The rise of large language models (LLMs) and agentic applications promises to transform enterprise workflows. Yet, the core challenge remains: How do we ensure these powerful agents generate accurate, relevant, and trustworthy responses based on proprietary enterprise data rather than relying solely on their generic training knowledge? The answer lies in grounding — connecting the LLM to verified, trusted, and up-to-date information.

The Shifting Nature of Organic Search in 2025

For decades now, search engine optimization (SEO) has been viewed as a “cheat code” channel – a method for businesses of any size or budget to achieve organic growth and scale against larger competitors. Industry research over the past two years has valued the SEO industry itself at over $150 billion, and projected to grow by an additional 20% by 2030, as there are thousands of case studies evidencing the value of investing in SEO as a growth channel.

Introducing Datadog Agent Builder: Build agentic workflows for alert response and remediation

Building automated workflows that adapt to real-world complexity can be a challenge. As systems scale and scenarios multiply, teams often end up hardcoding endless logic branches just to handle every potential outcome. That’s why we’re introducing Datadog Agent Builder, a powerful new tool that lets you create custom AI agents that are fully hosted by Datadog.

Optimizing Ruby performance: Observations from thousands of real-world services

Over the past three decades, Ruby has assumed a pivotal role in the modern web stack and become a fixture in the tool kits of countless DevOps and platform teams. Today, it is a driving force in contemporary application development, testing, automation, and CI/CD. For this blog post, we used data from our always-on continuous profiling of more than 3,000 real-world services from hundreds of organizations to track trends in Ruby usage and performance.

How to Speed Up Incident Response With Guided Remediation

Most teams picture incident response as a linear sprint from alert to resolution. A notification appears, an analyst pivots across screens, a decision gets made, and the workflow moves on. It works, but it is mechanical, tiring, and fragile. Graylog 7.0 aims for something more impactful. Guided remediation gives analysts clarity during the moments when pressure rises and context usually scatters. It takes raw detection data and turns it into a clear path forward. No theatrics.

Introducing Kentik AI Advisor: The Future of Network Intelligence

Introducing Kentik AI Advisor, a powerful new AI designed to deeply understand your network, reason through complex issues, and deliver clear, actionable guidance for designing, operating, and protecting your networks. By autonomously querying Kentik’s rich telemetry and tools, it explains what’s happening, why it matters, and what to do next — from troubleshooting and capacity planning to cost optimization and risk mitigation.