Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

50% cost-savings by automating alarm dispatching at Aquafin

Aquafin is a Belgian company with over 1,000 employees that was established by the Flemish Region in 1990 for the purpose of expanding, operating and pre-financing the wastewater treatment infrastructure in Flanders. Aquafin collects household wastewater from the municipal sewers and transports it to wastewater treatment plants, where it is treated in accordance with European and Flemish standards.

4 Simple Reasons Why Rules-Based Solutions Fail IT Ops

Managing IT Operations is a challenging job that's only getting harder. Complexity is growing with exponential speed - especially as more companies embrace digital transformation. Enabling continuous service assurance has become a task beyond human capability alone, especially in large enterprises that generate millions of operational events, every single day.

Why incident response automation is top-of-list for CISOs in 2020

When considering the state of critical incidents in 2019 – it’s no surprise that looking ahead to 2020, CISOs have one of the organization’s most challenging and stressful jobs. During the first half of the year alone 4.1 billion records were compromised, and the average cost of a data breach is now estimated at $3.92 million.

On-call doesn't have to be stressfull

“Being on-call is a critical duty that many operations and engineering teams must undertake to keep their services reliable and available. However, there are several pitfalls in the organization of on-call rotations and responsibilities that can lead to serious consequences for the services and the teams if not avoided.

The Age of Service Mesh

You have built a massively successful system. The users just can't get enough and request new features. Your developers crank out new services on a regular basis. Your DevOps/SRE team configures and scale your Kubernetes cluster (or clusters). As the system becomes more complicated and sophisticated you realize that there are common themes that repeat across all your services.

Improving Postmortem Practices with Veteran Google SRE, Steve McGhee

For many SREs, Google’s 99.999% availability seems like an untouchable dream. If anything, getting out of pager hell is already worth celebrating with all your coworkers, friends, and family on the moon. How can teams climb out of it? How can you get to a stage where you have time to proactively prevent incidents, and enter a mental state of calm and control? The rope out of pager hell is weaved with a thorough and rigorous postmortem process.

Sensitive Medical Data Hacked by Unsophisticated Software

There’s a solid rationale behind replacing antiquated technology, as they fail to keep pace with how the healthcare environment is evolving. One such invention is the good, old pager. Recently, the U.K.’s National Health Service Trust (NHS) was on the radar when the organization’s sensitive medical data was hacked by an individual in North London. The malicious party intercepted radio waves, converting it into legible text on his computer monitor.