Alerting

anodot

Correlation Analysis is Boosting the Efficacy of eCommerce Promotions

In the first part of the blog series, we discussed how correlation analysis can be leveraged to reduce time to detection (TTD) and time to remediation (TTR) by guiding mitigation efforts early. Further, correlation analysis helps to reduce alert fatigue by filtering out irrelevant anomalies and grouping multiple anomalies stemming from a single incident into one alert. In this part, we throw light on the applicability of correlation analysis in the realm of eCommerce, specifically, promotions.

Are we all on the same page? Let's fix that. | Luis Mineiro on 99 Percent Visible

The industry has defined it as good practice to have as few alerts as possible, by alerting on symptoms that are associated with end-user pain rather than trying to catch every possible way that pain could be caused. Organizations with complex distributed systems that span dozens of teams can have a hard time following such practice without burning out the teams owning the client-facing services. A typical solution is to have alerts on all the layers of their distributed systems. This approach almost always leads to an excessive number of alerts and results in alert fatigue.
datadog

Add Datadog alerts to your xMatters incident workflows

xMatters provides flexible, smart tools for incident response and management. With configurable workflows that bring together data from sources like Github, Jenkins, and Zendesk, you can automate crucial tasks and send enriched notifications to streamline team communications.

pagerduty

Let's Talk AIOps: Part 2: Things to Think About & the PagerDuty Approach

This is the second in a two-part blog series about AIOps where I sit down with Julian Dunn, Director of Product Marketing at PagerDuty, to level-set on the hot DevOps topic. The first post discussed whether AIOps was just marketing fluff and whether ITOps actually has an AIOps problem. Let’s continue…

signl4

September 2020 Update: All-new Webhook and API Key Management

Our September update provides you with self-service API key management from the SIGNL4 web portal. Finally, you can fully exploit our comprehensive REST API. We also improved management of outbound webhook which can update your systems with any information on Signl handling. It is now possible to manage API keys for the SIGNL4 API in the SIGNL4 account portal. Click on the “Developer” menu item to manage API keys. Keys issued here can then be used to call SIGNL4 REST API functions.

amixr

What To Look For When Choosing An Incident Response Software

It doesn’t take a security professional to know that even the most insignificant incident can cause much greater outcomes. That’s because whether security, safety, or compliance, even the infinitesimal incidents can cause large and complex businesses to nosedive. Today, several large enterprises and service providers are perhaps dealing with thousands of incidents daily relating to their network, applications, end-user devices, cloud-based tools, and more.

grafana

Introducing Prometheus-style alerting for Grafana Cloud

Hi! My name’s Richard Lam, and I’m the new product manager for Grafana Cloud. I’m really excited for my first contribution to this community, both so I can introduce myself to you all, and so I can highlight an awesome new Grafana Cloud feature that’s coming your way! Happy reading, and hopefully this is just the start of many more communications from me.

anodot

Correlation Analysis: A Natural Next Step for Anomaly Detection

Over the last decade, data collection has become a commodity. Consequently, there has been a tremendous deluge of data in every area of industry. This trend is captured by recent research, which points to growing volume of raw data and growth of market segments fueled by that data growth.

signl4

No More False Alerts at Night

Do you know this situation? You are on-call and in the middle of the night you get a phone call. Loud enough to wake you up. Loud enough to wake your wife up, as well. You stand up and check your emails to see what the problem is. OK, you got it. Then you log on to the console of your monitoring tool and – green. Green? False alert? Why did you get the call then? After double-checking, still a bit sleepy, you recognize that the problem has been recovered automatically.

overops

OverOps + Microsoft: Streamline Error Resolution with Microsoft Teams [Part 1]

This first post in our Microsoft + OverOps blog series breaks down how to enhance your troubleshooting process with our Microsoft Teams integration. At OverOps, we like to work the way you work. Whatever tools you use, wherever you are in the DevOps lifecycle, we’ll meet you there – and bring our deep, code-level visibility along with us. When it comes to integrating with the modern DevOps life cycle, it should come as no surprise that Microsoft tools are on our radar.