Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Log analytics and dashboarding in Datadog

Achieving optimal performance can be challenging when you depend on separate platforms to monitor service health and to manage your logs. When data about your systems is spread across multiple platforms, investigating issues—and ultimately resolving them—takes longer and requires expertise with more tools. It takes more effort to identify real customer impact, as well as to verify that your responses to an incident are having the desired effect.

Managing Python Processes with PM2

PM2 is a production-grade process manager that makes management of background process easy. In the Python world we could compare PM2 to Supervisord, but PM2 has some nifty features you might like. With PM2, rolling restarts, monitoring, checking logs and even deploying application has never been that simple. We really value CLI UX, so PM2 is really simple to use and master.

Monitoring Social Signals to Reduce Alert Fatigue With SignalFx and PagerDuty

“I need to be notified if there’s a significant event ongoing with SignalFx.” This is what I tell my team. However, despite being the CTO of a monitoring company, creating the right set of alerts for me to stay informed of incidents in progress or potential issues was harder than it seemed at first glance. Why?

Massachusetts Natural Gas Explosions - A Lesson in The Importance of Alert Automation

The pressure in the natural gas pipelines under three Massachusetts communities spiked to 12 times their normal level last week, just before the explosions and fires that destroyed dozens of homes and killed an 18-year-old man. Columbia Gas went under fire for their mismanagement of the incident. The NTSB says a Columbia Gas control room in Columbus, Ohio, registered pressures of 6 pounds per square inch last Thursday in pipelines that are intended to carry just 0.5 PSI.

Saving lives by ensuring uptime of mission-critical IT at Gift of Hope

Gift of Hope Organ & Tissue Donor Network is a non-profit organ procurement organization that coordinates organ and tissue donation and provides public education on donation in Illinois and northwest Indiana. As one of 58 OPOs that make up the nation’s donation system, Gift of Hope works with 180 hospitals and serves 12 million people in their donation service area.

Alert fatigue, part 2: alert reduction with Sensu filters & token substitution

In my previous post, I talked about the real costs of alert fatigue — the toll it can take on your engineers as well as your business — and some suggestions for rethinking alerting. In part 2 of this series, I’ll share some best practices for fine-tuning Sensu to help reduce alert fatigue.

How AI/ML Helps Retailers Keep 3 Promises This Holiday Season?

Another holiday season will soon be upon us, and many retailers and eCommerce businesses are already making plans. As you take inventory of what you learned last holiday season, let’s start with some lessons learned by the entire retail industry this time last year. In addition to stocking up on hot items and planning your promotions, the most competitive sites found that using AI/ML to optimize customer experience not only kept customers happy, it dramatically increased their revenues.

What Is Lambda Architecture? (for dummies)

From ancient Rome and Greece throughout Latin America and Egypt, there is only one thing beside the history itself that kept those ancient times alive even today – the architecture. The most important part of any era in our immersive history was the building of magnificent objects all around the world. These objects, even today, are some of the many wonders of the world.

Will Layer 3 Switches Give Routers the Boot?

Switches are the most common network device deployed on MSP-managed networks, while routers are the least popular—and not by a small margin. The data in Auvik’s recently published report, Managing Network Vendor Diversity: The MSP Challenge, shows switches represent almost half (48%) of all network devices on MSP-managed sites, while routers account for only 6% of the total. Does this mean the death of the router is imminent? In short, no—and here’s why.

Super Monitoring Decorated with 2 Distinctions for Application Performance Monitoring Software

Super Monitoring was recently lauded by a popular software review platform for its steadfast assistance in keeping everyone’s business operations smooth and seamless at all times. For its efficiency in informing users regarding emerging issues and anomalous threats, Super Monitoring was distinguished by the FinancesOnline SaaS review platform with two prestigious awards for 2018: Great User Experience and Rising Star.