Alerting

Building Automated Monitoring with Icinga and iLert

Jul 14, 2020 By iLert In iLert

How many servers can be managed by one system administrator? This question is pretty hard to answer since it depends decisively on the tasks that need to be operated. It is clear, however, that the amount of servers one engineer can manage has increased tremendously over the time, and is still growing. Public and private clouds, in combination with automation tools, enables us to automate many daily tasks. In a modern IT infrastructure almost everything can, and should, be automated.

Read Post

iLert

Read more about Building Automated Monitoring with Icinga and iLert

Sending Nagios alerts to Microsoft Teams and rapid incident response with Zenduty

Jul 14, 2020 By Vishwa Krishnakumar In Zenduty

Nagios is one of the most widely used open-source network monitoring software used by thousands of NOC teams globally to monitor the health of a vast array of their hosts and services. Most teams rely on Emails as their primary Nagios alert notification channel, which may take a few minutes to respond to by your NOC team.

Read Post

Zenduty

Read more about Sending Nagios alerts to Microsoft Teams and rapid incident response with Zenduty

FYI: Email Alerting Isn't Enough

Jul 14, 2020 By Christopher Gonzalez In OnPage

Email alerting is an inefficient way to receive and address critical alerts. Email inboxes tend to get flooded with “clutter,” as irrelevant messages bury urgent incident notifications. Incident management procedures require incident management systems, ensuring that urgent issues are immediately addressed. Yet, some services are reluctant to say goodbye to email alerting and its inefficiencies. This is the case with Google Voice, which recently solidified its commitment to email alerting.

Read Post

OnPage

Read more about FYI: Email Alerting Isn't Enough

What is a Status Page? (& How Does It Benefit Companies/Customers)

Jul 13, 2020 By StatusCast In StatusCast

There’s nothing worse than turning on your computer to start the work day and discovering the internet is down. We all know the frustration of tediously trying to figure out what’s wrong before finally breaking down and calling our service provider and waiting on hold, only to discover that it’s a known issue and it’s being addressed. What if there was a better way?

Read Post

StatusCast

Read more about What is a Status Page? (& How Does It Benefit Companies/Customers)

Product Metrics for Discovery Activities

Jul 12, 2020 By Ankur Rawal In Zenduty

Most companies today compile a set of metrics for their product teams to regularly report on to the company management. This includes a variety of product performance metrics(usage frequency, churn rate, NPS, etc.). But a lot of them struggle a bit with product discovery activities. So how do your track discovery?

Read Post

Zenduty

Read more about Product Metrics for Discovery Activities

Understanding the landscape of AWS compute

Jul 10, 2020 By Squadcast In Squadcast

In the second part of our "SLOs for AWS-based infrastructure" blog , Gigi Sayfan dives deeper into understanding the landscape of AWS compute by using the lens of Kubernetes to compare and contrast & covers in detail setting of SLOs for ECS, EKS, Fargate, and Lambda based services.

Read Post

Squadcast

Read more about Understanding the landscape of AWS compute

Keeping Your CMDB Up To Date in Distributed Times

Jul 10, 2020 By OpsRamp In OpsRamp

The configuration management database (CMDB) is meant to be a single source of truth to link IT elements with the application processes that underlie the business services. In the age of ITIL, a common repository to store information about your hardware and software assets, made sense. But with today's dynamic and distributed hybrid IT infrastructure, how do you keep your CMDB up to date? Should you even try?

View Video

OpsRamp

Read more about Keeping Your CMDB Up To Date in Distributed Times

NHS on Its Final Leg of Pager Replacement

Jul 10, 2020 By Ritika Bramhe In OnPage

If you’ve been following the U.K. healthcare landscape, you would know that the country has been considering replacing pagers for the longest time. This may soon materialize, partly accelerated by the challenges that doctors are facing during the COVID-19 pandemic. The pager replacement initiative not only signifies a pivotal shift from the aging infrastructure, but it also indicates how pagers have failed to thrive in today’s unprecedented times.

Read Post

OnPage

Read more about NHS on Its Final Leg of Pager Replacement

Resolve Actions - ServiceDesk - Integrating Resolve with a Chatbot in Microsoft Teams

Jul 9, 2020 By Resolve In Resolve

View some of the popular requests for a Service Desk chatbot in Teams, using Resolve Actions as the back-end IT automation platform.

View Video

Resolve

Read more about Resolve Actions - ServiceDesk - Integrating Resolve with a Chatbot in Microsoft Teams

Best practices for alerting on Kubernetes

Jul 9, 2020 By Jorge Salamero Sanz In Sysdig

A step by step cookbook on best practices for alerting on Kubernetes platform and orchestration, including PromQL alerts examples. If you are new to Kubernetes and monitoring, we recommend that you first read Monitoring Kubernetes in production, in which we cover monitoring fundamentals and open-source tools. Interested in Kubernetes monitoring?

Read Post

Sysdig

Read more about Best practices for alerting on Kubernetes

Subscribe to Alerting

Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

Building Automated Monitoring with Icinga and iLert

Sending Nagios alerts to Microsoft Teams and rapid incident response with Zenduty

FYI: Email Alerting Isn't Enough

What is a Status Page? (& How Does It Benefit Companies/Customers)

Product Metrics for Discovery Activities

Understanding the landscape of AWS compute

Keeping Your CMDB Up To Date in Distributed Times

NHS on Its Final Leg of Pager Replacement

Resolve Actions - ServiceDesk - Integrating Resolve with a Chatbot in Microsoft Teams

Best practices for alerting on Kubernetes

Monthly Archive

Follow Us