Operations | Monitoring | ITSM | DevOps | Cloud

Monthly Snap July: Releases, Meetups & Macbook Touchbar

July brought you new releases for our monitoring core and web frameworks: Icinga 2.9 and Icinga Web 2.6. Both major versions add more awesome features to your monitoring stack. Icinga 2.9.1 was already pushed fixing a problem with non-Systemd platforms, Icinga Web 2.6.1 is coming later this week fixing a regression.

How I got my job in WebGazer on the day I learned that it existed

Yes, you heard it right. They both happened on the same day. I had no idea about WebGazer a few months ago, and now I am a part of it! It all started when a friend of mine texted me that he wanted some help. I said okay, but honestly, I did not think that it would become my new job.

The Fastest Path to Modernizing Incident Management

Long gone are the days of manually monitoring an inbox and deciphering which alerts require attention or action. However, when adopting or migrating to a new tool, it can seem like a daunting process to set up all of your teams, integrations, and notification settings. OpsGenie is here to help. We offer dedicated Pre-Sale Engineers and Customer Success Engineers who will help you identify your bottlenecks and precise needs within OpsGenie.

EventSentry v3.5 Released: Windows Process Monitoring to the Max, Registry Tracking, Tags & More

EventSentry v3.5 continues to increase visibility into networks with additional vantage points, making it easier for EventSentry users to reduce their attack surface as well as discover anomalies.

The Monitor - Andy Tuba, Senior Software Developer at Reddit

For the sixth edition of The Monitor we spoke to Andy Tuba, a Senior Software Engineer at Reddit. Reddit is a site that needs no introduction, but we’re gonna write one anyway because otherwise this section would just be blank. They bill themselves as the front page of the internet, and considering they’re the 8th most popular website in the world, that isn’t just marketing pablum.

Monitoring Django apps on Heroku

I don't know of an easier way to deploy a Django app than letting Heroku do the work. That said, how do you stay on top of your app's performance, errors, and stability post-launch? Running an app on Heroku is a blissful experience, but it presents some monitoring challenges that aren't present when you control the hardware. In this post, I'll walk through a free-to-start, low-effort approach that gives you great visibility of the health of your Django app on Heroku.

Accelerating Incident Response With Real-Time Business Data at Wayfair

Like any good e-commerce company, Wayfair collects a significant amount of data to use for business intelligence. Until recently, the majority of this data was crunched off-hours in preparation for business use the next day. We also create a great deal of data about our applications and infrastructure in real time.

Volunteers, Not Conscripts: Fixing Out-Of-Hours On-Call at Intercom

Uptime matters. At Intercom, we believe that keeping our product online and working well at all times is critical to the success of our business. Out-of-hours on-call is inherently disruptive to your life as an engineer. You need to be ready to respond quickly and competently to an alert about something being broken.