Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Runbook Automation and Rundeck v5.9 Release Notes

Product Manager Forrest Evans takes us through the new features in Runbook Automation v5.9, including a demo of incorporating Azure Key Vault in your automation jobs. For a full listing of the release notes, see the release notes page. Learn more about automation solutions, including new components to support your FinOps needs on the solutions page.

OnPage Wins Spot on G2's Best Healthcare Software 2025 (Announcement Video)

OnPage Named in G2’s 2025 Best Healthcare Software List! We’re excited to share that OnPage has been recognized in G2’s 2025 Best Healthcare Software list! This recognition is driven by real customer reviews from healthcare teams who rely on OnPage to streamline communication, improve response times, and enhance patient care. In this video, our Head of Marketing, Ritika Bramhe, shares the big news and reads some of our favorite customer reviews that made this achievement possible.

Streamline IT incident response with the latest BigPanda features

Machine-generated data has exceeded human scalability, straining L1 Ops and Service Desk team resources. Fragmented data across tools, teams, and silos hinders situational awareness, delaying each action – from detection to remediation, making prevention increasingly unattainable. The latest BigPanda updates enhance ITOps and ITSM team efficiency throughout the incident lifecycle.

Feature Spotlight - User & Group Performance Reports

Understanding how groups and users respond to incidents is vital to refining and improving your incident response processes. Our user and group performance reports help admins visualize the way people in their organization handle notifications for alerts and incidents. These reports can be used to review performance data over a specific amount of time, allowing you to clearly analyze trends and changes, and identify groups that may be inundated with alerts, or users who may not be available when expected.

AI in Production with GitHub's Sean Goedecke

In this episode, we sit down with Sean Goedecke, Staff Software Engineer at GitHub, to discuss where LLMs fit into real-world development. Sean shares how he’s using LLMs how he’s drawing the line for AI-assistance in the codebases he manages—though, as he says, this might all change by next summer. Sean also weighs in on how LLMs could assist SREs during outages—especially when you’re only half-awake at 3 a.m. after a rather inconvinient page.

Why a mobile app is the key to better incident communication

While downtime is inevitable, communication should remain swift and transparent. Businesses need a way to relay updates as incidents unfold, ensuring customers, internal teams, and stakeholders stay informed in real time. Relying on emails and web-based updates alone is no longer enough. A mobile-first approach is the solution.

PagerDuty for Financial Services

PagerDuty acts as the primary interface for real-time actions, seamlessly connecting humans and systems. From the moment a monitoring tool detects a signal to the resolution of an incident, every action is automatically tracked and timestamped. With reduced human error and no risk of missed documentation, PagerDuty provides a reliable, efficient, and transparent incident management solution for financial entities.

Modernize Your NOC: A 2025 Guide to Reducing IT Costs and Protecting Profits

You can no longer afford to ignore the silent profit killers lurking in your operations. From bloated IT budgets to unplanned downtime and inefficient incident management, these hidden costs can drain your revenue, eroding customer trust, and exposing your company to financial penalties. The solution? A radical shift toward lean and modern Network Operations Centers (NOCs), digital resilience, and a relentless pursuit of inefficiencies.

Why a Mobile Alerts App Makes All the Difference in Efficient Mobile Alerting

written by Doreen Jacobi To understand the significance of a mobile alerts app, we need to first look at mobile technology in general. It is no secret that it has become an integral part of our personal and professional lives, fundamentally changing how we communicate, interact, work, and respond to challenges. With over 307 million smartphone users in the U.S. alone, smartphones are not just a convenience, they are at the center of our everyday life.

The New Retrospective Experience Is Now Available to All

A great retrospective isn’t just about documenting what happened — it’s about bringing your team together to uncover the insights that lead to real improvements in your process, roles, and technology. But to make that happen, retrospectives need to be structured enough to be effective, flexible enough to fit your team, and easy to collaborate on. That’s exactly what we set out to build.