Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Preparing for Infoblox NetMRI End-of-Life: Why Restorepoint is the Ideal Replacement

When a trusted tool like NetMRI reaches its sunset date, it opens the door to modern alternatives that offer more automation, broader integration, and a lower total cost of ownership. You’ve invested time, training, and trust into this solution, and while it may feel like the rug is being pulled out, this is an opportunity to improve how your organization handles network configuration and change management.

Smarter debugging with Sentry MCP and Cursor

Debugging a production issue with Cursor? Your workflow probably looks like this: Alt-Tab to Sentry, copy error details, switch back to your IDE, paste into Cursor. By the time you’ve context-switched three times, you’ve lost your flow and you’re looking at generic suggestions that don’t show any understanding of your actual production environment or codebase.

With AI, You're Gonna Have to Manage Your (Massive) Energy Use in SPM

Forget boring spreadsheets. Strategic portfolio management (SPM) isn't just about ticking boxes. It’s the big boss plan that makes sure every penny spent and every project your company starts points towards the main goal. It's your company's smart GPS, guiding you through the AI energy maze. When it comes to AI's power hunger, SPM is a knight in shining armor. It helps leaders get smart, making sure they grab all the fancy tech without trashing the world.

Streamlining the Complexity of SD-WAN Deployments With DX NetOps Topology

If you're feeling like your network operations just keep getting more complicated, you're not wrong. One of the core promises of cloud models was improved simplicity. However, the ensuing reality for your network operations teams has been anything but simple. Suddenly, users and applications are everywhere. Traditional, on-premises equipment now coexists with software-defined wide area networks (SD-WANs), cloud-hosted resources, and hybrid connections that hop across public and private networks.

This Month in Datadog - July 2025

In July’s episode of This Month in Datadog, we’re doing things differently by spotlighting the people behind the products you rely on. Jeremy is joined by Tristan Ratchford to discuss saving time and effort when you’re on call with Bits AI SRE, and by Kevin Hu to explore gaining visibility into datasets across the entire data lifecycle with Data Observability.

Cloudflare's DNS Downtime: Why BGP Hijacks Were Never to Blame

On July 14, Cloudflare’s popular public DNS service (known as 1.1.1.1) suffered an outage lasting over two hours. As rumors swirled about the cause, we were the first to push back on the theory that a BGP hijack had caused the outage. In fact, the hijack was actually a consequence. How did we know this so early when other internet watchers did not? We’ll discuss in this post.

Complete Guide to Redis Monitoring: Essential Metrics, Tools & Best Practices 2025

Redis is a powerful tool, but its position in the critical path of applications means that performance issues can have a widespread impact. Whether you use Redis as a cache, session store, or primary database, effective monitoring is essential to prevent slowdowns and ensure a responsive user experience. This guide provides a comprehensive walkthrough of Redis monitoring, covering the essential metrics you need to track, the tools available to you, and the best practices to adopt in 2025.

Building a bulletproof network disaster recovery plan

Imagine it’s 2am. A core switch fries because of a sudden power surge. Most of your users wake up to a blank screen. Your team scrambles: Where’s the backup configuration? Who knows the last working state? Hours pass, productivity tanks, support calls flood in, and costs stack up by the minute. This isn’t a theoretical horror story. According to Gartner, the average cost of network downtime still hovers around $5,600 per minute, or over $300,000 per hour.

Netdata Overview: All You Need to Know in Under 3 Minutes

In just a few minutes, this walkthrough will show you how to unlock the full power of Netdata during your trial period. From real-time metrics to AI-powered insights, learn how to get immediate value without any guesswork. Whether you're running a Homelab or managing production systems at scale, this video will help you hit the ground running and make every minute of your trial count. Let’s turn your trial into insight, clarity, and control.