Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Communicate During Events with Opsgenie's New Email Templates

Emails are one of the most popular notification methods of Opsgenie when reaching out to your users. Now, you can create Email Templates which represent your company’s style and streamline communication processes! Email Templates are available for the emails sent out for Mass Notifications and Stakeholder Notifications, and you can customize of the content of the emails depending on your needs.

This IS NOT Fine: Putting Out (Code) Fires

So the dumpster is on fire. Again. The site’s down. Your boss’s face is an ever-deepening purple. And you begin debating whether you should join the #incident channel or call an ambulance to deal with his impending stroke. Firefighters have clear procedures and a strong hierarchy. The first truck at a scene immediately begins assessing the situation.

Reducing Noise with Event Intelligence

Learn how Event Intelligence, the next-gen approach to Event Management and AIOps, helps teams to cut through the noise and operate at scale. This introductory session will walk through key best practices and requirements such as reducing noise via adaptive machine learning, accelerating triage via integrating machine data with human response, and much more.

Introducing Jira Ops: Respond Faster with Atlassian + PagerDuty

Atlassian’s mission is to unleash the potential of every team. Atlassian’s newest product, Jira Ops, is built on top of Jira with a direct connection to PagerDuty to ensure teams can be successful and respond quickly when things break. This session will cover how PagerDuty and JiraOps work together to help teams respond to incidents, quickly and in real-time.

Another Journey of Chaos Engineering

Chaos engineering is here to stay. There's a thriving community, numerous open source projects, a few books, even a startup. Companies are hiring chaos engineers and creating entire teams focused on chaos engineering. This talk is about strategies for launching a chaos engineering movement at your company, as well as the challenges and results you can expect.

Accelerating Incident Response

Incidents are never fun, but a bad incident response process makes them even less so. How do technical teams mobilize the right people and provide the right context and tooling to rapidly take action and drive incident resolution? With the clock ticking and up to millions of dollars lost per minute of downtime, there’s no time to waste in assembling the right experts.

How StatusHub Complements and Extends Your Incident Management Process?

Although the main focus of StatusHub is incident communication, it compliments each 5 activities of Incident Management: Identification, Categorization, Prioritization, Response and Communication with the user community through the life of the incident.

It's Time to Start Talking about Digital Operations

IT operations teams have some of the most stressful jobs in IT. Keeping data centers online, servers running, enterprise systems functioning, and applications performing — all while responding to incidents and requests is hard work. While there are monitoring systems in place to provide visibility and change management practices give IT some control over the network and environment, IT operations teams constantly feel like they are fighting a losing battle.