Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Kubernetes monitoring explained: Key metrics, labels, and best practices

Monitoring Kubernetes and containers doesn’t have to be overwhelming. In this video, we’ll break down the essential metrics you need to track, why labels are critical for container visibility, and the best practices for Kubernetes monitoring at scale. You’ll learn: How tools like Site24x7 simplify Kubernetes monitoring with auto-discovery, dashboards, anomaly detection, and forecasting. Whether you’re a DevOps engineer, SRE, or developer, this video gives you the practical knowledge to improve container monitoring and observability.

Creating and using a Network Discovery Profile in Site24x7

Learn how to create and use a Discovery Profile in Site24x7 to simplify and automate network device onboarding. In this video, we walk you through setting up discovery parameters, applying filters and thresholds, grouping and tagging devices, configuring alerts, integrating with ITSM and collaboration tools, and scheduling periodic rediscovery. Whether you're managing a single site or multiple customer environments, Discovery Profiles help you.

How to Responsibly and Effectively Contribute to Open Source Using AI

With the influx of AI tooling, it’s never been easier to contribute to open source communities. These tools are capable of gathering context quickly, “understanding” repositories faster than ever before. They provide instant summaries about repositories that, previously, would have meant reading lines and lines of code. They can fix bugs in programming languages you don’t know, and ultimately allow more contributors to get involved, which (almost) every open source project wants.

You don't need a real outage to find your weak spots.

Modern digital services rely on complex systems, and chaos can strike at any layer. But the most effective teams don’t wait for failure to learn. They simulate it. By introducing controlled performance degradations, you can stress your systems, test your dependencies, and uncover hidden risks without touching production. In our latest webinar, Catchpoint experts walk through how teams are building resilience through proactive, safe failure testing, and why it’s become a cornerstone of digital reliability.

Top 3 MSP dashboards compared: SquaredUp, BrightGauge and MSPbots

Managed Service Providers (MSPs) live and die by their data. Externally, clients expect clear reporting, fast responses, and visible proof of value. Internally, smooth operations and low overheads are essential to business success. But with so many tools, key data is scattered across multiple systems – PSA, RMM, cloud services, ticketing, monitoring, finance – and that causes blind spots. Dashboards fix this problem by consolidating data into a single view.

Telemetry Now Teaser: "What's the real cost of delivering this traffic?"

Why is it so difficult to answer the age-old question CFOs are asking, "What's the real cost of delivering this traffic?" Complex billing structures and cost modeling are only part of it. Lauren Basile joins Phillip Gervasi to discuss turning network telemetry into financial insight in the latest episode of Telemetry Now.

Certificate Rotation with Progress-powered Solutions

Don’t let expired certificates put your organization at risk! Progress WhatsUp Gold makes it easy to discover, manage, and automate certificate lifecycles across your network. With powerful automation from Progress Infrastructure solutions, you can rotate and manage certificates without a manual routine to maintain compliance and security. Schedule updates, push certificates to thousands of nodes and maintain governance with built-in traceability. Experience simplicity, scalability and seamless integration with Progress-powered solutions.

Mute timing vs. silences in Grafana Alerting: How to pick the best fit for your use case

Have you ever been in a situation where know your team is going to run their weekly maintenance window and you silence your notifications to prevent a flood of false positives from pinging your inbox? If you are associated with a team that uses any type of alert system, you know how easily alert fatigue can happen. The incessant and unpredictable (or even, at times, predictable) pings, emails, and notification alerts can drive even the most serene worker totally batty.