Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

How Financial Entities Can Turn IT Outages Into Strategic Advantages

IT outages are a growing concern for financial entities, threatening both operational resilience and regulatory compliance. These disruptions don’t just create downtime—they also present unique opportunities for learning and transformation. By addressing common challenges and adopting forward-thinking strategies, organizations can turn outages into stepping stones for achieving operational excellence. Breaking down the barriers to incident management A lack of clear ownership.

Essential Software Deployment Best Practices for Success

Smooth and efficient software deployment is critical to delivering high-quality applications that meet user expectations. Still, many software failures can be traced back to deployment issues. A well-structured deployment strategy can help DevOps & SREs teams prevent these errors, ensure system reliability, and enhance user satisfaction. This guide explores software deployment best practices, from planning and execution to post-deployment monitoring and incident management.

Unlocking managed services provider growth with AIOps

As enterprises migrate to hybrid cloud environments, they face mounting pressures to manage complexities while cutting costs. Many turn to Managed Service Providers (MSPs) to streamline IT service delivery and drive results faster. For MSPs, this is both an opportunity and a challenge: the surge in hybrid cloud adoption, an explosion of observability tools, and rising operational costs push them to act decisively.

What's New: Annotate Messages for additional context

We’re thrilled to introduce Personal Message Notes, a new feature designed to enhance the way you document and manage critical communications. With this feature, users can now add private annotations to messages—offering space to add context, follow-up actions, and reminders that are visible only to the user and system administrators.

Feature Spotlight - Service Dependencies

To know how disruptions to one service might affect other services in your digital environment, it’s important to have a record of how applications and technical services connect within your architecture. Service dependency maps in xMatters define and visualize relationships between your services, so you can instantly see whether a service is impacted by any active incidents, and how that incident impacts other services. Dependency maps can be expanded to show additional upstream and downstream services and help identify a potential root cause.

RedIron: Unifying Alerts and Notifications in IT

RedIron Canada, a Managed Services Provider (MSP), Retail Integrator, and Solutions Provider, that specializes in managing cloud-based systems across AWS, Azure, and Oracle. Their expertise in IT monitoring and managed services makes them a trusted partner for retail businesses across North America. RedIron relied on traditional alert notification methods like email and SMS for their IT monitoring operations.

PagerDuty Runbook Automation 2024 Year in Review

Special guest Jeff Hausman, PagerDuty’s Chief Product Development Officer kicks off our 2024 recap for PagerDuty Runbook Automation and Rundeck Open Source. Then Jake and Forrest take us through all of the amazing improvements and new features added to the product, including shout outs to the amazing folks contributing to the Open Source repos and a customer success story from Ryanair.