Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

A Closer Look at PagerDuty's New AIOps Capabilities

Another PagerDuty Summit is in the books, and we’re still coming down from the excitement and energy our customers and community showed us over the past week. We made several big announcements over the course of the conference, but none more significant than the AIOps advancements on our digital operations platform. We introduced a number of ways customers can apply machine learning algorithms and automation to a wide range of workflows across the platform.

Microsoft Teams and OpManager: The perfect team for your remote IT management game

It seems almost everything is going digital during this pandemic: businesses, education, and medical consultations. This increased digital consumption is squeezing the juice out of the IT infrastructure of many organizations. On top of that, remote work policies are posing serious security issues. At times like these, IT infrastructure monitoring is like a football game for IT admins, except: So how do you navigate all these challenges and score a touchdown?

Building and Using a 2020 Status Page with Uptime.com

A hosted status page gives you the peace of mind that users can always answer one simple question: is it up or down. Hosted status pages work because they offer third-party confirmation your services are up. If your site goes down, the third party is likely not down and you can use them to refer to your status. Status pages are your personal 24 hour news cycle. Regardless of if you’re up or down, customer service fields fewer support tickets, and users praise your transparency.

Any PLC alarm on your mobile device

Maintenance of machines is an incredibly important task. And it is important to fix a machine before it completely fails. In reactive maintenance scenarios, speed of response is key. Once an issue is detected is important to communicate as reliably and quickly as possible to the right engineer. Ideally, the machine is connected directly to team of mobile engineers in charge and can let them know what exactly happened and what needs to be fixed.

The incident resolution mandate of telehealth and telepharmacy providers in the age of Covid-19

The incident management challenges of a pandemic-driven world & how to overcome them “While the safety and well-being of workers affected by COVID-19 is the first priority, companies will also triage other essentials, such as incident management and stakeholder communications.” (PWC) In a pandemic-stricken world that is consuming products and services over the internet, more than ever, there is a great strain on digital and connectivity systems.

Best PagerDuty Alternatives of 2020: An Independent Review by StatusGator

Modern applications offer more and more features, and the infrastructure needed to run them becomes increasingly complex. The need for Application Performance Monitoring (APM) and Network Performance Monitoring (NPM) tools like PagerDuty is obvious, as the cost of downtime can be exorbitant for a business of any scope. Thus, every business needs to use Pager Duty or one of its alternatives that alerts the Ops team should anything go awry.

Datadog on Incident Management

Datadog is a monitoring and analytics platform that ingests trillions of data points per day, coming from more than 8,000 customers. With a complex distributed architecture and hundreds of deployments per day, needless to say sometimes things don't go as planned. Our teams have been improving the way incidents are managed at Datadog over the years and they are using that knowledge to help Datadog customers manage their own incidents.