Operations | Monitoring | ITSM | DevOps | Cloud

Ready to move on and pick up speed again

We are going through an incredibly difficult time of uncertainty, lockdowns, cutbacks, and even fear. Taking this time to optimize and rethink the way we do business is essential in ensuring we get back on track and return even stronger than before. Most of us have been working from home for months now and, in some cases, there is no end in sight. How are you and your operations holding up? Are you able to work, maintain, and control your infrastructure?

How to use check aggregates in Sensu Go

Aggregates, which allow you to monitor groups of checks or entities, were a much-beloved feature in Sensu Core (the predecessor to Sensu Go) — Ben Abrams describes them as “awesome” in his post on alert fatigue, noting that aggregates are like having “a bunch of nodes behind a load balancer where each node is healthchecked, and if a node drops out it may not be worth waking someone up in the middle of the night.”

Best practices for alerting on Kubernetes

A step by step cookbook on best practices for alerting on Kubernetes platform and orchestration, including PromQL alerts examples. If you are new to Kubernetes and monitoring, we recommend that you first read Monitoring Kubernetes in production, in which we cover monitoring fundamentals and open-source tools. Interested in Kubernetes monitoring?

OpManager now supports SMSEagle, Twilio, and Clickatell, so you can get SMS alerts anywhere!

IT admins need to know the status of their IT devices, servers, routers, switches, and firewalls. To meet this need, OpManager has a highly responsive and robust notification and alerting system that sends alerts via email, Slack, and even SMS. Murphy’s law says anything that can go wrong will go wrong, and if you’re in IT, you’re probably familiar with how easily things can go wrong.

SLOs for AWS-based infrastructure

In our latest two-part series blog, Gigi Sayfan, author of “Mastering Kubernetes”, discusses managing complex infrastructure on AWS with an eye towards SLOs (service level objectives). Though there are many ways to discuss the management of infrastructure, in this two-part series, he covers SLOs for AWS, Observability on AWS, Quotas Limits, and Optimizing cost on AWS and in the second part, he uses the lens of Kubernetes to compare and contrast compute infrastructure on AWS with Kubernetes.

Monitoring 101: Collecting the Right Data

When it comes to business, data is everything. Whether it’s sales, supply, marketing, or your I.T. systems, every day is a constant stream of decisions to be made. To make the right decisions, you need enough knowledge. To have enough knowledge, you need the right data. What’s the best way to collect all this data? In the information age, there’s so much data out there, all the time.

Improve Customer Experiences & Collaboration Between Support and Engineering With Bidirectional Communication

We are delighted to announce our new PagerDuty integration for Salesforce Cloud. This integration empowers Customer Service, Engineering, and IT teams to proactively resolve customer issues in real time by improving communication and collaboration.