The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.
Interlink Software’s AIOps Platform is now listed on VMware Marketplace – residing in the Management and Monitoring category. VMware Marketplace offers ITOps, DevOps and SREs a comprehensive catalog of solutions across fifteen distinct solution categories.
We’re excited to announce a new set of updates and enhancements to PagerDuty’s Digital Operations Platform. Recent updates from the product team include On-Call Management and Incident Response, Process Automation, to PagerDuty Community & Advocacy Events. New capabilities enable users and customers to resolve incidents faster, do the following, and more.
Business continuity is a crucial part of any scalable operations plan, but many businesses fail to realize how important it is until their first critical emergency. Only then does business continuity management come to the forefront of planning exercises, and stakeholders are forced to reflect on what went wrong, why it went wrong, and determine if they can avoid it happening again, or be better prepared if it does. The true business continuity management lifecycle begins long before an incident.
Atri Mandal, HEAL’s AI/ML expert, has written a second blog about the 4P strategy, this time primarily focusing on solution recommendation which gives useful suggestions to the SREs on how to fix problems pro-actively.
Site Reliability Engineering (SRE) teams and Platform Engineering teams share similar goals -- like maximizing automation and reducing toil -- and similar methodologies. But they have different priorities, and use somewhat different tools to achieve them. What are SREs, what are platform engineers and how is each role similar and different? This article explains.