Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Incident management tool integration

Picture the scene: a high‑severity alert fires, Slack lights up, and dashboards scream red. You’re juggling Datadog, PagerDuty, Jira, and status pages while trying to coordinate fixes. The problem isn’t a lack of tools; it’s that they aren’t talking to each other. This guide explains why incident management tool integration matters, how it cuts response times, and where to start.

AT&T Email-to-Text Service ended: Why SIGNL4 is the Best Alternative

In a move that caught many businesses and IT teams off guard, U.S. mobile carrier AT&T officially discontinued its email-to-text gateway service. ATT email to text was shut down on June 17, 2025 ( read more ). This change means that sending sms messages and mobile text alerts to AT&T subscribers using the format number@txt.att.net or number@mms.att.net no longer works.

Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

In this episode, we explore how networking has shaped reliability as we know it. Marino Wijay cloud networking expert and Staff Solutions Architect at Kong shares how his journey began not as an SRE, but with cables, routers, and switches. Marino explains the evolution of the fabric holding systems together through virtualization, and how software-defined networking, which is now a key element to resilient applications.

The New Rootly Ringtones: How Research-based On-Call Sounds

We set out to create a ringtone that wasn’t just loud—but the sound of a modern pager. Something that wakes you up, but without triggering a full-blown adrenaline spike. In this video, go behind the scenes with sound engineer Gorjão as he crafts a how research-based on-call sound sounds like.

How incident.io helps to reduce alert noise

We're often asked: "How does incident.io help reduce alert noise?" And it’s a fair question. It’s typically much easier to add new alerts than to remove existing ones, which means most organizations slow-march into a world where noisy, un-actionable alerts completely overshadow the high-signal ones that indicate a real problem.

Demo - Don't Settle for Less: Upgrade to PagerDuty in the Post-Opsgenie Era

Don't wait for Opsgenie's EOL to future-proof your operations. Migrating from Opsgenie to JSM isn't an upgrade–it's a leap of faith. Avoid risking your operations with a “good enough” tool and take the opportunity to rethink your incident management approach entirely. PagerDuty offers the enterprise-grade reliability, continuous innovation, and comprehensive incident management capabilities that modern operations demand.

Designing smarter on-call schedules for faster, calmer incident response

When an incident wakes your team early in the morning, the last thing you want is confusion about who’s responding or how help will arrive. An effective on-call schedule doesn’t just get the right person online. It helps them stay calm, confident, and capable of solving problems quickly. Done right, your on-call setup becomes a powerful lever for reducing Mean Time to Acknowledge (MTTA), Mean Time to Resolve (MTTR), and the overall stress that incidents place on your team.

Why you should embrace more incidents (seriously!)

We’re all looking for ways to improve on our incident response. We investigate various metrics and methodologies—all in the name of making sure our customers see the reliable and performant systems we’ve sought to build. In fact, all these efforts are leading us, as an industry, to finally realize the power of surprising anomalous events in our systems. They give us an opportunity to reexamine our expectations and see how our models of the sociotechnical system differs from reality.