Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Prioritize errors and create tickets using Rollbar's MCP Server

Production errors can feel overwhelming. Your Rollbar dashboard is filling up with alerts, your team is scrambling to understand what needs immediate attention, and critical revenue-impacting issues might be buried among less urgent problems. Sound familiar? In this post, I'll walk you through a workflow that transforms production error chaos into organized, prioritized action items. We'll cover everything from analyzing Rollbar errors to creating properly linked Linear tickets.

Cloudflare outage: another wake-up call for resilience planning

Another day, another massive Internet disruption, and this time it’s Cloudflare taking huge parts of the Internet offline. This incident is not an anomaly. It is part of a recurring pattern that has become standard in digital infrastructure. We have reached an inflection point in digital operations. Outages at major cloud and content delivery network (CDN) providers are now expected. The only real uncertainty is when it will happen next.

Introducing webvitals.com: Find out what's slowing down your site

Developers don’t need another “run this tool, stare at a number, and feel bad about it” website. So we built something different. WebVitals helps you analyze, optimize, and ship faster websites, all in one place. Built by the same folks who obsess over stack traces and slow queries, it connects the dots between performance metrics and what’s actually slowing your users down. In one place, you can.

Uptrends x OpenTelemetry: Stream browser-level synthetic data into your observability stack

Dashboards and alerts can tell you something’s wrong, but they don’t immediately tell you why. A red indicator or synthetic test failure prompts detective work. You flip between dashboards, timestamps, and logs, trying to line up what the check saw with what the system did. Now imagine your monitoring could explain itself by sending traces directly into your OpenTelemetry (OTel) backend.

Distributed Tracing for Microservices: 10 Essential Best Practices for 2026

Distributed tracing tracks how a single request moves across multiple microservices, helping teams see the entire execution path end to end. In modern architectures where dozens of services interact, it becomes difficult to understand where latency starts, why bottlenecks appear, and which component breaks under load. Traditional monitoring only shows isolated metrics. Distributed tracing connects those dots.

Why IT Outsourcing Is Becoming a Must-Have for Modern Operations

There's a quiet shift happening inside many organizations. Not the kind that makes headlines, but one that shows up in smoother workflows, fewer emergency calls, and teams that aren't constantly scrambling to "just keep things running." Operations leaders are realizing that the technology foundation of a company is no longer something that can be handled casually or reactively. Everything - processes, productivity, customer experience, and even employee morale - leans on the stability of IT.

Modernising Middleware and B2B Integration with Assurance

Modernising enterprise middleware is now a strategic necessity for cost efficiency, AI-readiness, and operational clarity. Hybrid estates of IBM MQ, Apache Kafka, and other brokers hide inefficiencies that drain profitability, but an operating model built on Assurance and Optimisation restores transparency and control. By unifying data, rebalancing workloads, and enabling safe AI autonomy, organisations can build a resilient “Confidence Economy.”