Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Opsgenie alternative: How to migrate to Grafana Cloud IRM

In recent years, we’ve seen many organizations migrate from legacy incident response tools to Grafana Cloud IRM — our unified incident response and on-call management application hosted on Grafana Cloud — as they look to improve reliability, reduce costs, and consolidate their tooling. To help guide those efforts, we offer several IRM migration tools that allow you to more seamlessly migrate away from those legacy solutions and start using Grafana Cloud IRM.

Top 5 Incident Response Platforms for 2025

An incident response platform helps organizations manage, track, and resolve IT incidents quickly and efficiently. With the right platform, teams can minimize downtime, reduce the impact of incidents, and improve overall response times. ‍ In this article, we’ll explore the top 5 incident response platforms for 2025, helping you choose the best solution for your needs. ‍

incident.io raises $62M in Series B fundraising

00:00 We're thrilled to share that Incident.io has raised $62 million in our Series B, led by Insight Partners.

00:11 Four years ago, we were three people around a kitchen table. Today, we're a team of 80 with thousands of teams using our platform to solve over 250,000 incidents a year. Whether you're streaming Netflix or buying something on Etsy, chances are our platform helped resolve the incidents behind the scenes.

Squadcast Strengthens Its Leadership in IT Alerting and Incident Management in the G2 Spring Report

2025 has already started out to be a remarkable year for Squadcast—with our key wins in the G2 Spring Reports, our acquisition by SolarWinds, and a series of impactful product releases and improvements. Our mission has always been clear: to deliver a unified platform that seamlessly integrates On-Call Management and Incident Response, empowering teams to boost service reliability and productivity—all without the burden of context switching.

Opsgenie Is Sunsetting: What to Look for in an Alternative

Atlassian is retiring Opsgenie, and if you're one of the teams relying on it to manage on-call and incidents, you're facing a tough question: Do you make the forced migration to Jira Service Management or Compass, scramble for a lookalike tool — or use this moment to upgrade your entire approach to incident response? If you’re facing that decision, we get it. Changing tools midstream isn’t ideal (to say the least). But it’s also a rare opportunity to take a meaningful step forward.

Metrics That Matter: Measuring Developer Productivity in the AI Era

In this episode, Ryan McDonald is joined by Mark Quigley, Head of Platform Engineering at Ninety.io, for a conversation that cuts through the noise around developer productivity metrics and AI. Mark dives deep into how teams can measure what matters—without falling into the trap of turning every measure into a target. He shares how tools like Developer NPS, DORA metrics, and balanced scorecards can help teams optimize for both output and well-being—but only when framed with the right intent.

The timeline to fully automated incident response

We speak to engineering teams every day, and everybody knows AI is the future. Some tell us they’re massively accelerated by Claude, or that they’re rebuilding their product, team and ways of working. Cursor and Lovable have announced they’re building the last piece of software. Should we give in to the vibes? Embrace exponentials, and forget that the code even exists? The reality is that things will still go wrong. They always do, at least from time to time.

Infrastructure Monitoring: A Comprehensive Guide to Integrating Effective Alerting

Imagine you’re the IT guardian of a busy company. Every day, you rely on infrastructure monitoring tools to keep an eye on your servers, networks, and applications. These tools are your early warning system – they spot glitches before they become full-blown problems. But what happens when an alert is missed or delayed? That’s where effective alerting comes in.

Mastering incident routing: a critical component in incident management

Imagine this: a high-priority alert is triggered, but it’s routed to the wrong team, or delayed by manual triage. By the time the right person is notified, the issue has escalated, and users are starting to notice. Technical failures don’t always cause these kinds of incidents. More often, they stem from something simpler: poor alert routing.

How to Fine Tune Your IncidentHub Alerts

IncidentHub can send outage alerts to many external systems. You can choose from Slack, Webhook, Email, Discord, PagerDuty, and more. Alerts are effective only when they are relevant and actionable. In this article, we will explore how to fine-tune your IncidentHub alerts to receive only the relevant ones for your third-party services.