Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How Sentry's AI Autofix Changed my Mind About AI Assistants

Blockchain, IoT, Big Data. If you’ve been around in tech for a while, you know that these kinds of buzzwords come and go: they make a splash going in and fizzle out over time. Seeing many of them come and go over the years has made me skeptical. What are they trying to sell us this time? Some might call it getting grumpy; others might call it becoming an enterprise architect. So you’ll have to forgive me for thinking AI agents seemed like just another buzzword.

Building a Self-Service and Scalable Observability Practice

Join us in this session and learn how Splunk can help you build a standardized observability practice. From implementing an observability-as-code service to role-based access controls (RBAC), Token Management, Metrics Pipeline Management, and OpenTelemetry, learn how to create an Observability platform to optimize your metrics usage and costs while managing workloads efficiently.

Elastic extends production-ready AI capabilities for all!

Elastic Security is making your organization safer with general availability of our favorite AI features. Elastic Security is announcing the general availability (GA) of two of our most widely deployed generative artificial intelligence (GenAI) capabilities: Attack Discovery, launched in May, and Automatic Import, launched in August. Elastic’s AI-driven security analytics are providing immense value to many organizations.

Observability: It's Every Engineer's Job, Not Just Ops' Problem

For years, organizations have used the term “observability” as an evolution of monitoring, a discipline practiced by operations teams to understand whether production software was working. I’ve been annoyed by this—not because it’s philosophically wrong, but because it diminishes the importance of observability as a generalized software engineering practice.

TCP Monitoring With AppNeta: Why Expanded Support is a Game Changer

Broadcom continues to expand the capabilities of AppNeta by Broadcom, offering ongoing enhancements in features and value. With the introduction of TCP protocol support, users can now achieve more streamlined setup processes and deeper visibility into modern network paths. These enhancements help eliminate blind spots and improve monitoring accuracy across complex network environments. Review this post to learn more about these valuable new capabilities.

Deadman Alerts with the Python Processing Engine

Sometimes silence isn’t golden; it’s a red flag. Whether you’re monitoring IoT sensors, system logs, or application metrics, missing data can be just as critical as abnormal data. Without visibility into these gaps, you risk overlooking potential failures, security threats, or operational inefficiencies. In time series workflows, detecting silence is often the first sign of trouble—whether it’s a network issue, device failure, sensor failure, or stalled process.

Comparing ELK, Grafana, and Prometheus for Observability

Monitoring and observability are cornerstones of modern infrastructure management. Three popular solutions that often come up in this space are the ELK Stack, Grafana, and Prometheus. This comparison breaks down the key differences, use cases, and integration capabilities to help you determine which tool or combination better suits your operational needs.