Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Not so "mini"-dumps: How we found missing crashes on SteamOS

We shipped an improvement to Sentry's game engine and native SDKs that most developers probably didn’t even notice until now – unless they were explicitly aiming to test their Windows-built games on Linux with Wine/Proton compatibility layers. That's exactly the point. While we were focused on improving our game engine SDKs, our learnings while investigating a mysterious issue are applicable for any Windows application running on Linux via Wine or compatibility layer.

Cybersecurity Monitoring Best Practices: Building a Stronger Defense Against Modern Threats

Let’s be honest—the cybersecurity battlefield keeps changing fast. Attackers evolve their tactics, networks expand, and data flows in every direction. If you’re responsible for protecting your organization’s security, staying ahead of cyber threats can feel like chasing shadows. This is where effective cybersecurity monitoring comes in.

Pastries with SREs: From AIOps to GenAI and LLMs (lactose-free latte making)

In this episode of Pastries with SREs, we look at AIOps, where it fell short, where it worked, and how generative AI (GenAI) is reshaping what’s possible in observability today. We explore: If you’re wondering whether generative AI is different this time, this episode offers a grounded, practical look at how it’s evolving observability workflows.

What Are AI Guardrails

When you're shipping LLM features, a lot of the work goes into keeping the model's behavior predictable. You deal with questions like: These are everyday concerns when you integrate LLMs into production systems. Guardrails AI provides a Python framework that helps you enforce those expectations. You define the schema or constraints you need, and the framework validates both the inputs going into the model and the outputs coming back.

Top 10 APM Tools [2026 Guide]

In 2026, application performance isn’t just a technical metric—it’s a business-critical factor. As organizations move deeper into cloud-native architectures, distributed systems, and AI-driven workflows, ensuring speed, reliability, and uptime has become non-negotiable. According to Gartner, by 2026 more than 70% of new APM implementations will be cloud-native, and businesses that leverage advanced observability platforms are expected to reduce downtime by up to 60%.

Why Your Website's Speed & Structure Affect Visibility

Website performance and organization are vital for a brand's digital success in today's competitive online environment. Users demand quick responses and seamless experiences, and delays can lead to frustration and lower search engine rankings. Focusing on load speed, straightforward navigation, mobile compatibility, and technical stability is crucial for businesses to stay relevant and competitive. A fast, well-organized website provides users with instant access to information, easy navigation, and low friction. Neglecting these aspects can lead to missed opportunities, reduced organic traffic, and poor online engagement.

Flight watch: Optimizing flight operations with real-time monitoring

Aviation has always relied on precise planning and timely communication. However, the rapid development of digital tools has transformed the way airlines, operators, and dispatchers manage flights. One technological advancement at the forefront of this progress is flight tracking and monitoring software, which enables real-time oversight of aircraft movements across the globe. Among these tools, flight watch stands out for its intuitive interface and rich functionality, offering operational teams a crucial advantage in a complex landscape.
Sponsored Post

Top 10 Statuspage.io Alternatives in 2025

Choosing the right status page solution can make the difference between customer trust and customer churn during incidents. This guide compares the top status page alternatives to help you find the perfect fit for your team's needs-whether you need public incident communication, internal vendor monitoring, or enterprise-grade features.
Sponsored Post

Transform your workflow with Raygun's remote MCP

We're happy to announce Raygun's new remote MCP server, giving AI tools direct access to live error data so they can investigate issues, surface root causes, and take action with real context, not guesses. It's been nearly a year since Anthropic released the Model Context Protocol (MCP), and a lot has changed in the AI space. Since then, almost all major players now support MCP, allowing them to tap into the massive and ever-expanding catalogue of MCP servers. When MCP first launched, we shipped our own Raygun MCP within 48 hours of the spec dropping, which was an early step toward giving LLMs visibility into Raygun data.