Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

Rein in Your Incidents: Incidents and Alerts Foundations

Solving incidents is hard. Depending on your current situation, you may also be losing a lot of time figuring out what notifications constitute an incident. This results in more and more lost time as every notification must be triaged as a potential incident before you can proceed to move to resolve or disregard (as a non-incident). All this may sound very cumbersome, but the fastest way to improve is to learn and define what incidents are. And you’re in luck!

On-Call Scheduling: Building a Winning On-Call Schedule for Your Team

On-call scheduling enables 24/7/365 availability of service providers for critical issues like system downtime, technician response for critical systems, and patient care. Learn about the importance of on-call schedules for your organization and its customers, how to design an on-call schedule, and multiple ways you can build an on-call scheduling program that will improve customer response and make staff happier.

Observability with AIOps For Dummies

This new eBook explains how DevOps and SREs can develop more and operate less by applying AI to events, metrics, traces and logs to keep CI/CD agile and your business growing. DevOps and SRE teams build critical digital services, but often spend more time troubleshooting their complex applications and infrastructures than innovating. What's the solution? Combine AIOps algorithmic analysis and automation with observability's detailed operational data.

Product updates and changes | May-June 2020 + the new control panel is coming!

Check out the latest StatusHub updates and features, including bi-weekly schedule changes in widget, domains whitelist and more for the last two months. And We are happy to share with you that the new control panel is coming this autumn!

OnPage Mentioned in Gartner's Hype Cycle for Clinical Communication and Collaboration

Clinical communication and collaboration (CC&C) systems enhance care coordination to improve the patient experience. The systems are equipped with secure mobile messaging, allowing care teams to ditch their insecure pagers for HIPAA-compliant smartphone applications. Gartner, the global leader in tech research, has released its Hype Cycle for Real-Time Health System (RTHS) Technologies, 2020.

The Importance of Communicating Scheduled Maintenance to End-Users

Often, outages are planned. In fact, in most organizations, outages are typically not caused by something going wrong, but because some kind of IT operation requires your team to take a system temporarily offline. Communicating scheduled maintenance is just as important, if not more important than alerting users to unplanned outages.

Why Observability Matters to Site Reliability Engineers

This is the first in a three-post series themed around Ops-led DevOps, where I’ll explore the relationship between observability and a set of software delivery lifecycle practices that support the adoption of DevOps practices and the transition from project to product-centric ways of working. I’ll start with Site Reliability Engineering, move onto Value Stream Management and finish with Continuous Delivery.