Operations | Monitoring | ITSM | DevOps | Cloud

Business Monitoring: If You Can't Measure It, You Can't Improve It

“If you can’t measure it, you can’t improve it” …this quote by Peter Drucker and the philosophy behind it is a key driving force behind modern management and the introduction of BI solutions to support the scaling and increased complexity of businesses. Analytics tools were developed to enable metric measurement and business monitoring across large scale, complex systems and to enable continuous improvements of business performance.

Better Incident Response: Incident Classification & Setting Severities with Tags

What you absolutely must know when responding to an incident is what kind of impact it has on customers and how negatively it can affect your team. This is typically addressed by following some kind of incident classification, usually “incident severity levels”, to indicate the importance of every incident - that is, to understand how seriously various stakeholders are affected and to route the incident differently if necessary.

"Homegrown" May Be Good for Tomatoes, Not So Much for IT Ops

In the past, many organizations grew and managed their own data centers. Some still do. And many are still developing their own automated incident management (aka Autonomous Operations) tools. But as IT grows and becomes evermore complex and fast-moving, the reality of what it means to do so kicks in, and organizations are re-evaluating their strategies.

Scheduling IT and Engineering on-call rotations just got easier

It shouldn’t take you more time than a few seconds to understand your on-call schedule and rotations and how you could make changes to it. It is important for on-call scheduling and alerting tools to make this as simple as possible. If you’re spending more than a few seconds to understand what your on-call rotations are going to be like for the next day or week or month, then you need to start looking for a better on-call management tool.