Operations | Monitoring | ITSM | DevOps | Cloud

Latest Publications

4 Steps to Prepare for an Outage

The quest for 100 percent reliability is one that can't be achieved, but it's possible to get very close. While completely eliminating failure isn't an option, there are steps you can take to prepare your team for an eventual outage so you can quickly restore availability when an outage inevitably occurs. This four-step process for setting up your metrics will ensure your team is ready to manage the unexpected.

Splunk, Big Data and the Future of Security

Current IT security tools and mindsets are no longer adequate to meet the scope and complexity of today's threats. Internet security has evolved over the last ten years but advanced persistent threats and the sophistication of the malware have fundamentally changed the way security teams must think about these new threats and the tools used for detective controls.

The On-Call Survival Guide

Welcome to the on-call life. Being on-call can be a daunting experience for any new team member. There is only one thing worse than being woken up at 3am to discover that your systems are down - to wake up on your own at 8am and discover that your systems were down for 5 hours and nobody got the alert! With the right approach, a culture of collaboration, infrastructure insight, and the right tools in place, being on-call doesn't have to be so bad.