Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

No room for downtime during lockdown

As of the beginning of June, even though some countries have started to slowly come out of lockdown, one-third of the world’s population is still at home in quarantine – a fact that is truly astounding. Never has reliable access to connected systems been so critical to the ongoing productiveness, emotional wellbeing, and even the survival of individuals and companies worldwide.

Elastic: New alerting for observability, security, and the Elastic Stack

The alerting framework released into beta in version 7.7 introduces a wholly new and streamlined experience for creating, managing, and monitoring your alerts and notifications within the Elastic Stack. From the ability to create alerts directly within Elastic Observability and Elastic Security interfaces to a full alert management UI and the flexibility to trigger emails or integrate with 3rd party platforms like PagerDuty, the new alerting framework gives you the power to proactively monitor what matters. And when things do end up on your radar, this new alert management will help you make sense of the issue and act quickly.

Infrastructure dashboards: Declutter your monitoring data and ensure you're not overspending

The task of monitoring and managing an entire network, including all the servers and applications that run on it, is by no means easy. With so many components of varying complexity, the volume of performance data coming at you can be overwhelming. This information overload increases the chances of missing data that could help discover performance inefficiencies.

Deploying a Containerized App in Google GKE

Because of its popularity and widespread adoption, Kubernetes has become the industry’s de facto for deploying a containerized app. Google Kubernetes Engine (GKE) is Google Cloud Products’ (GCP) managed Kubernetes service. It provides out-of-the-box features such as auto-scaling nodes, high-availability clusters, and automatic upgrades of masters and nodes. In addition, it offers the most convenient cluster setup workflow and the best overall developer experience.

How to Use Monitoring Tools to Improve Root Cause Analysis

As an IT manager you would have often heard from your line manager or user ask “Let’s drill down to find the root cause.”? As dreaded a question as it may seem, it is really the most important answer to understand IT outages. IT infrastructure availability is highly dependent on isolating problems, so the deciding variable in a problem can be fixed without putting the entire system at a halt. This is where RCA can be of tremendous help.