Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Incident Response Management: A Category of Its Own

In recent weeks, I’ve spoken with several Opsgenie customers who are evaluating a migration to ilert after Atlassian’s decision to phase out Opsgenie and fold its functionality into other products. Atlassian is giving Opsgenie users “two options: move to Jira Service Management for robust end-to-end incident management, or move to Compass for alerting and on-call management.” This has raised a broader question in our industry: ‍

From Tickets to Action: Ensuring Proactive IT Support with Jira and OnPage

We’re excited to announce the launch of our bi-directional integration between OnPage and Jira! This integration is designed to bridge the gap between ticket creation and incident response, ensuring that IT, DevOps and other tech teams who rely on Jira to manage their incidents can automatically identify and engage the right on-call staff—ensuring critical incidents are addressed in real time without delay.

OpsGenie End of Life? What's next for OpsGenie users.

If you haven’t heard already (which would be shocking considering the numerous posts I’ve seen on Reddit) Opsgenie’s end of life is right around the corner. This means there is no better time for Opsgenie users to explore alerting and on-call management tools outside of the limited alternatives provided by Atlassian. So, I felt now is a better time than any to address the needs of those affected by the dissolution of Opsgenie and reveal why OnPage should be your new platform of choice.
Sponsored Post

Incident Response Process: Stages, Framework & Best Practices

These days, organizations must be prepared to handle unexpected disruptions efficiently. Whether it's a cybersecurity breach, system failure, or a natural disaster, having a structured Incident Management Process is essential. The Incident Management Team plays a crucial role in swiftly identifying, assessing, and resolving incidents, minimizing downtime, and ensuring business continuity. This blog explores the stages, framework, and best practices of incident management to help businesses build a robust response system.

Alertops Vs Jira Service Management: Why pay for ITSM when all you need is on-call and alerting?

When an incident happens—your systems go down, a critical service fails, or your end users start flooding support channels—what you need is fast, reliable alerting and an on-call team that can respond immediately. But if you’re using Jira Service Management (JSM) for this, chances are you’re paying for a lot more than just that.

Opsgenie vs JSM vs AlertOps: Do you need a full-stacked ITSM platform or just alerting?

If you’ve been relying on Opsgenie for real-time incident alerts and on-call scheduling, you’ve likely seen the writing on the wall: Opsgenie is being absorbed into Jira Service Management (JSM). For some teams, that may sound like a logical step forward. But for others, it poses a much more critical question.

An ultimate step-by-step guide on Zabbix Cloud Monitoring

‍ Learn how to set up Zabbix Cloud for AWS Auto-Discovery and receive critical alerts via SMS, phone calls, or push notifications. ‍ During the last Zabbix Summit, the company presented a cloud version of its well-known monitoring platform. We at ilert constantly see the growing popularity of Zabbix as more and more teams across the globe utilize it for their monitoring needs. To help users quickly adopt the new cloud version, we delivered this guide.

How we structure on-call rotations at Datadog

A well-structured on-call rotation helps you ensure the reliability of your services and meet your customers’ expectations by designating staff to respond to emerging issues. But the pressures of on-call work—such as long shifts, overnight hours, and dynamic situations—can compromise the well-being of your team members. This makes it harder for them to maximize service uptime during their on-call shifts and can limit the velocity of the feature work they do outside of their on-call duty.