%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Alert Deduplication Rules - Reduce alert noise by grouping similar alerts together | Squadcast

Jul 5, 2022 By Squadcast In Squadcast

Alert Deduplication can help you reduce alert noise by organizing and grouping alerts. This also provides easy access to similar alerts when needed.

View Video

Squadcast

Read more about Alert Deduplication Rules - Reduce alert noise by grouping similar alerts together | Squadcast

What I learned from leading my first incident

Jul 5, 2022 By Milly Leadley In Incident.io

A few weeks ago we had a major incident. We were releasing our Practical Guide to Incident Management, and after posting about it online an incident.io employee noticed that the page wasn’t loading. Just to set the scene, I’ve been at incident.io for 3 months and don’t have any experience of incidents in my previous role. When the team got paged I expected this to be one of those “follow along and learn how the wizards work their magic” exercises.

Read Post

Incident.io

Read more about What I learned from leading my first incident

AlertOps is in the ConnectWise's 2022 PitchIT Accelerator Program!

Jul 4, 2022 By AlertOps In AlertOps

PitchIT is a competition for MSP innovators. The program is designed to showcase potential offerings that could be built or integrated into the ConnectWise platforms. It’s a 16-week accelerator program where AlertOps and the other participants will go through a rigorous business assessment, gain coaching from industry experts, earn placement on the ConnectWise marketplace, engage in co-marketing and more.

Read Post

AlertOps

Read more about AlertOps is in the ConnectWise's 2022 PitchIT Accelerator Program!

Top Five Pitfalls of On-Call Scheduling

Jun 30, 2022 By Squadcast Community In Squadcast

On-call schedules ensure that there's someone available day and night to fix or escalate any issues that arise. Using an on-call schedule helps keep things running smoothly. These on-call workers can be anyone from nurses and doctors required to respond to emergencies to IT and software engineering staff who need to fix service outages or significant bugs. Being on-call can be challenging and stressful. But with the proper practices in place, on-call schedules can fit well into an employee's work-life balance while still meeting the organization's needs.

Read Post

Squadcast

Read more about Top Five Pitfalls of On-Call Scheduling

Handling third-party provider outages

Jun 30, 2022 By Lisa Karlin Curtis In Incident.io

There are a handful of providers that large parts of the internet rely on: Google, AWS, Fastly, Cloudflare. While these providers can boast five or even six nines of availability, they’re not perfect and - like everyone - they occasionally go down.

Read Post

Incident.io

Read more about Handling third-party provider outages

Why More Incidents Are Better

Jun 30, 2022 By Andre King In Rootly

Ask most SREs how many incidents they’d have to respond to in a perfect world, and their answer would probably be “zero.” After all, making software and infrastructure so reliable that incidents never occur is the dream that SREs are theoretically chasing. Reducing actual incidents by as much as possible is a noble goal. However, it’s important to recognize that incidents aren’t an SRE’s number one enemy.

Read Post

Rootly

Read more about Why More Incidents Are Better

Why Operational Maturity Helps Businesses Reduce the Great Resignation Trend

Jun 30, 2022 By Laura Chu In PagerDuty

The past few years have led to fundamental business and cultural shifts for both companies and employees. Covid-19 has brought opportunities for companies who invested early in digital operations, while others struggled to maintain the status quo. The latter gave rise to record employee burnout, and what is now commonly referred to as the Great Resignation.

Read Post

PagerDuty

Read more about Why Operational Maturity Helps Businesses Reduce the Great Resignation Trend

BigPanda Unified Analytics for IT Operations

Jun 30, 2022 By BigPanda In BigPanda

With Unified Analytics from BigPanda, you can create new and highly interactive IT Ops dashboards from complex IT Ops data to gain new insights from KPIs that deliver business value.

View Video

BigPanda

Read more about BigPanda Unified Analytics for IT Operations

3 mistakes I've made at the beginning of an incident (and how not to make them)

Jun 29, 2022 By Robert Ross In FireHydrant

The first few minutes of an incident are often the hardest. Tension and adrenaline levels are high, and if you don’t have a well-documented incident management plan in place, mistakes are inevitable. It was actually the years I spent managing incidents without the right tools in those high-tension moments that inspired me to build FireHydrant. I built the tool I wished I’d had when I was trying to move fast at the start of incidents.

Read Post

FireHydrant

Read more about 3 mistakes I've made at the beginning of an incident (and how not to make them)

Better Data for Public Health: How Nexleaf and PagerDuty are Monitoring Healthcare

Jun 29, 2022 By Rachel Schmitz In PagerDuty

Having a reliable power source is something many of us take for granted. It is particularly important for healthcare facilities to have a consistent, reliable power source to ensure that vulnerable patients – specifically those who rely on electricity to sustain their lives – are not disrupted. In rural Sub-Saharan Africa, however, it’s estimated that only about 28% of hospitals have reliable electricity.

Read Post