Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

Three Ways MSPs Can Benefit From Dynamic Thresholds

People around the world depend on Managed Service Providers (MSPs) to keep their businesses running like clockwork, even as their IT infrastructure evolves. Keeping workflows efficient leads to higher profits, but this can be a challenge due to a mix of on-premise infrastructures, public and private cloud, and other complex customer environments. The shift to remote work in 2020 due to the COVID-19 pandemic has only made this more challenging for MSPs.

Key Learnings from the Facebook Status Page

Yesterday April 8th 2021 at around 22:00 UTC, Facebook experienced a major outage where Facebook, Messenger, WhatsApp web and Instagram were down, lasting for as much as 3 hours. This was reported at Facebook’s status page, which was a good example of how to communicate and incident.

Learning from Facebook: Keep your Status Page Separately from your Infrastructure

Yesterday April 8th 2021 at around 22:00 UTC, Facebook experienced a major outage where Facebook, Messenger, WhatsApp web and Instagram were all down and unavailable. The last update was reported 3 hours later resolving the incident, so even though the status page doesn’t state the duration of the incident, we can assume it was still affecting some users that long.

Monthly Moo Update | March 2021

Here we are a full quarter into 2021, a year that took off in a huge way for us, and the momentum continues to grow strong. March was a monumental month, and now it’s a wrap. We released significant updates across the board in almost all areas of Moogsoft, including pushing innovation to newfound levels when it comes to the ease of integrating your metric and event data.

How Can I Silence Alerts?

Yes, there is the ability to silence or disable alerts in Graylog. There are times in IT environments where you know you are going to generate specific events in your network. As an example, you are patching servers, upgrading hardware components, and many other things. These types of activities are very common during maintenance windows.

Three fundamental tips for an effective event filtering in SIGNL4

Event and alert filtering matters because alert fatigue is one of the most crucial issues in alerting and alert management. SIGNL4 implements a lightweight and effective way of filtering events. The overall process is based on alert categories. Alert categories are applied using a keyword search across the entire payload of incoming third-party events. But assigning alert categories, e.g. for alert augmentation, is not filtering.

Taming the Data Problem and Accelerating AIOps implementations with Robotic Data Automation (RDA)

RDA enables enterprises to operationalize machine data at scale to drive AI & analytics driven decisions. RDA automates repetitive data integration, preparation and transformation activities using bots that are invoked in “no-code” data workflows or pipelines. RDA helps to move data in and out of AIOps systems thereby simplifying and accelerating AIOps implementations that otherwise would depend numerous manual data integrations and professional services activities.

5 Ways Unplanned Work Is Disrupting Your Business

Unplanned work is rising, with consequences ranging from unhappy customers and lost revenue, to employee churn and burnout. So what is the true business cost of wasted time? In this blog, we will explore how one employee’s wasted time can impact the whole company—from operations, to development and beyond.