Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Our customers aren't just numbers-they're a priority

At incident.io, “We care about our customers” isn’t just a talking point. It’s a core part of how we operate. Whether it’s a big feature request or a small bug fix, we’ve been intentional about making sure that customers always feel heard and seen—no matter the ask. But it’s not just that.

The role of secure data storage in fueling AI innovation

Artificial intelligence is the most exciting technology revolution of recent years. Nvidia, Intel, AMD and others continue to produce faster and faster GPU’s enabling larger models, and higher throughput in decision making processes. Outside of the immediate AI-hype, one area still remains somewhat overlooked: AI needs data (find out more here).

Decentralized Monitoring Explained

Users often find themselves puzzled by the concepts of decentralized or distributed monitoring. This confusion is likely due to many monitoring systems claiming distributed capabilities, making it challenging to discern how Netdata stands out. To grasp the distinction, we must delve into the evolution of monitoring systems. When the first monitoring systems were created, about 20-25 years ago, they were built as SNMP collectors.

Colo Rental Rates are Rising: Are You Keeping Track of Your Power Utilization?

It’s getting more expensive to rent space in colocation data centers. According to CBRE, colocation rates are up 18.6% year-over-year to a record $163.44 per kW/month due to limited supply and strong demand. Average Asking Rental Rate with Y-o-Y % Change for Primary Markets *Rental rates are quoted asking rates for 250-500 kW at N+1/Tier III requirements. Image Source: CBRE Research, CBRE Data Center Solutions, H2 2023.

How to standardize resiliency on Kubernetes

There’s more pressure than ever to deliver high-availability Kubernetes systems, but there’s a combination of organizational and technological hurdles that make this ‌easier said than done. Technologically, Kubernetes is complex and ephemeral, with deployments that span infrastructure, cluster, node, and pod layers. And like with any complex and ephemeral system, the large amount of constantly-changing parts opens the possibility for sudden, unexpected failures.

Maximizing Cloud SQL database availability

How does Cloud SQL achieve near-zero downtime? Join Debi Cabrera as she interviews Product Manager, Rahul Deshmukh. Rahul discusses the various capabilities of Cloud SQL and the best practices to maximize business continuity for applications. Watch along and hear firsthand from the session speaker about configuring and monitoring Cloud SQL for maximum availability.

Observability Vs. Monitoring: The Complete Comparison

Many often wonder, “Is there a difference between observability and monitoring?” The thing is as IT environments have become more complex, monitoring alone has become increasingly less effective. That’s because while monitoring is crucial, it isn’t particularly suited to tracking unforeseen or unexpected turns of events. That’s what observability is meant for. This guide will clarify what observability and monitoring are – and how they differ.

Step-by-Step Guide to Monitoring Your SNMP Devices With Telegraf

Monitoring SNMP (Simple Network Management Protocol) devices is crucial for maintaining network health and security, enabling early detection of issues and proactive troubleshooting. Continuous monitoring ensures efficient resource utilization, minimizes downtime, and enhances overall network performance. In this article, we'll detail how to use the Telegraf agent to collect SNMP (MIB) performance statistics that you can forward to a data source.