Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Here are the Metrics you Need to Understand Operational Health

In recent polls we’ve conducted with engineers and leaders, we’ve found that around 70% of participants used MTTA and MTTR as one of their main metrics. 20% of participants cited looking at planned versus unplanned work, and 10% said they currently look at no metrics. While MTTA and MTTR are good starting points, they're no longer enough. With the rise in complexity, it can be difficult to gain insights into your services’ operational health.

What is a Network Operations Center (NOC)?

A network operations center (“NOC”) provides a central location for enterprise IT. Here, NOC team members supervise, monitor, and maintain an enterprise’s services, databases, external services, firewalls, and networks. With a full understanding of how a NOC works, your enterprise is well-equipped to maximize its performance.

Moogsoft Joins Inc. Magazine's List of America's Fastest-Growing Private Companies

Inc. Magazine has named Moogsoft to its annual Inc. 5000 list — an exclusive ranking of the nation’s fastest-growing private companies across all industries. The pioneer and leading provider of AIOps solutions made its debut on the prestigious list — ranked at # 884 — based on its 528 percent revenue growth over the past three years.

3 Takeaways From SaaStr Summit: Transformation, Opportunity, and Social Responsibility

At this year’s SaaStr Summit: Enterprise, our CEO Jennifer Tejada talked about why digital acceleration has been one of the positives to come out of the COVID-19 crisis, and why this time for change shouldn’t be wasted. Here are three key takeaways that she shared. The crisis, as Jenn explained, is accelerating permanent change in the role of the CEO.

6 Steps to Increase DevOps Velocity

Many organizations are looking to implement DevOps due to the promise of increased release velocity, better developmental agility, and the ability to free up time for developers to focus on innovation. However, adopting DevOps isn’t a panacea— instead, the idea of communication, collaboration, and blameless retrospectives that a DevOps model encourages can help foster a leaner system where bottlenecks are solved in a manner that not only fixes the problem, but also improves the process.

iGaming: Where Incident Management Meets Compliance

At times when players have multiple online choices and competition is fierce, safe betting and social responsibility is at the forefront of brand integrity. In fact, social responsibility has become a competitive edge for leading operators. Enter the era of the regulator. Regulation is now defining both the operator’s brand integrity and the player experience. Are online operators up to the regulation task? Some are, though some are not.

Resilience in Action, E5: Tammy Bryant and Eric Roberts The Importance of Glue Work

Resilience in Action is a podcast about all things resilience, from SRE to software engineering, to how it affects our personal lives, and more. Resilience in Action is hosted by Blameless Staff SRE Amy Tobey. Amy has been an SRE and DevOps practitioner since before those names existed. She cares deeply about her community of SREs and wants to take what she’s learned over the 20+ years of her career to help others.