Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Logs & Lattes: Episode 1 - Smart Logging Without the Price Trap

How much value are you really getting from your logs, and what are you giving up to stay on budget? In this episode of Logs and Lattes, host Palmer Wallace sits down with Seth Goldhammer, VP of Product Management at Graylog, for a candid conversation about the hidden cost of traditional SIEM pricing. Seth explains how ingest-based and resource-heavy licensing models pressure security teams into tough tradeoffs, such as dropping logs, tuning down detections, or limiting retention just to avoid budget overages.

Frontend JavaScript performance testing: A comprehensive guide

When a page pauses for even a quarter-second users feel it, and many will tab away before the spinner stops. Front-end performance testing lets us spot those delays on our own machines instead of reading about them in support tickets. The browser runs JavaScript, layout, painting, and every user interaction on a single main thread. If one task takes too long, everything else queues up behind it.

The Evolution of Digital Adoption: Insights from Gartner's 2025 Market Guide

The 2025 Market Guide for Digital Adoption Platforms (DAPs) marks an important point in the evolution of the category. Digital adoption has matured from a supporting role into a central part of enterprise strategy. Organizations are no longer asking if they need a DAP—they’re asking which one. In this latest research, Gartner establishes DAPs as essential to business transformation, efficiency, and employee experience. The takeaway is clear: digital adoption is no longer optional.

Pastries with SREs: OTel me where the cronuts are

In this episode of Pastries with SREs, we tackle an observability debated topic: Do you need a Single Pane of Glass OR is OpenTelemetry a better strategy? We explore: Additional Resources: About Elastic Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale. Elastic’s solutions for search, observability, and security are built on the Elastic Search AI Platform — the development platform used by thousands of companies, including more than 50% of the Fortune 500.

Bridging the Gap Integrating Logs Metrics and Flow for Observability

In this video, we discuss handling both old and new systems in IT environments. From legacy SNMP setups to modern telemetry, most organizations juggle multiple data sources, which can make observability feel overwhelming. We explore how to combine logs, metrics, and flow data into one system that provides actionable insights. You’ll see practical examples of simplifying scattered tools and making sense of complex, disparate information. Understanding how these different types of data work together is key to getting observability right.

Top Node.js Application Challenges and How Monitoring Solves Them

Deploying a Node.js application may feel straightforward at first. Everything checks out in tests, staging runs smoothly, and early users run into no problems. But as real traffic ramps up, hidden problems start to appear in unexpected ways. Requests fail intermittently, latency spikes without warning, memory usage climbs silently, and logs are scattered across multiple processes making it nearly impossible to trace the root cause.

Distributed performance testing for Kubernetes environments: Grafana k6 Operator 1.0 is here

Performance testing is critical to build reliable applications, but testing at scale, especially inside modern Kubernetes environments, can be a challenge. For example, how do you coordinate tests across multiple nodes, test private services without compromising security, or even do both at once? And most importantly, how do you do all this without adding too much operational complexity to your stack?

Kubernetes Service Discovery Explained with Practical Examples

In Kubernetes, applications are constantly changing — new pods start, old ones shut down, workloads shift across nodes. The challenge is making sure that different parts of your system, and even external clients, can still find each other when the actual locations keep moving. That’s what service discovery handles. It provides a stable way for applications to connect and communicate, no matter where they’re running or how often the underlying infrastructure changes.