Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The Hidden Cost of DIY AI in Network Operations

While AI offers powerful benefits for network operations, building an in-house AI solution presents major challenges, particularly around complex data engineering, staffing specialized roles, and maintaining models over time. The effort required to handle real-time telemetry, retrain models, and manage evolving environments is often too great for most IT teams.

Serverless Monitoring In The Cloud With Bindplane and OpenTelemetry

Almost two years ago I wrote the first installment of what was supposed to be a 3 part series on Serverless Monitoring. Parts two and three never materialized. Today, however, I am revisiting that original idea and expanding upon it. I hope to succeed this time in making it a full three-part series. For this first installment (Revisited), I will again work with Google Cloud Run to monitor MongoDB Atlas.

Debug App Performance Down to the Function Call-Introducing Continuous Profiling & UI Profiling

When something slows down in prod, it’s too easy to fall into old habits. Throw in a few more logs, ship some metrics, try to reproduce the issue locally, and maybe reach for perf or py-spy if you’re feeling ambitious. Traces can help, but they usually stop just short of explaining why things are slow, especially when it’s deep in the stack.

New: Restrict subscriber email addresses by domain

We’ve just rolled out a highly requested feature: Email domain restrictions for your status page subscribers! Now you can control who subscribes to your status page updates by restricting access to email addresses from specific domains. Whether you want to limit subscriptions to internal team members or approved partners, this feature gives you the flexibility to manage your audience with precision.

GDPR Log Management: A Practical Guide for Engineers

GDPR compliance for logs can be tricky—especially when you're trying to maintain system visibility and protect user data at the same time. For SREs and IT teams, it’s a balancing act between staying on the right side of privacy laws and not losing the context you need to troubleshoot. This guide walks through practical ways to handle personal data in logs, set up retention rules that make sense, and stay compliant without creating unnecessary friction.

Why Software Performance Optimization Is Business-Critical - and Often Overlooked

You've probably heard this before: "If it works, don't touch it." And while that might fly in some areas of life, in software development it's a dangerous mindset - especially when it comes to performance. Many companies build digital products that technically work. They launch, they onboard users, and they don't crash on day one. But fast forward a few months - or a few years - and the same product becomes sluggish, bloated, and frustrating to use. It's not broken - but it's bleeding revenue and trust, quietly and continuously.

Remote Desktop File Transfer: Securely Move Files Between Desktops

Remote desktop file transfer allows people to transfer files between remote desktops and local machines quickly—and it is more vital than ever. IT teams rely on remote desktop tools to access servers, while remote employees use these tools for file transfers when working from home or collaborating with colleagues at other locations.