Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Step-by-Step Guide to Monitoring Your SNMP Devices With Telegraf

Monitoring SNMP (Simple Network Management Protocol) devices is crucial for maintaining network health and security, enabling early detection of issues and proactive troubleshooting. Continuous monitoring ensures efficient resource utilization, minimizes downtime, and enhances overall network performance. In this article, we'll detail how to use the Telegraf agent to collect SNMP (MIB) performance statistics that you can forward to a data source.

The Complete Guide to Capacity Management in Kubernetes

In the dynamic world of container orchestration, Kubernetes stands out as the undisputed champion, empowering organizations to scale and deploy applications seamlessly. Yet, as the deployment scope increases, so do the associated Kubernetes workload costs, and the need for effective resource capacity planning becomes more critical than ever. When dealing with containers and Kubernetes you can find yourself facing multiple challenges that can affect your cluster stability and your business performance.

Netdata is the only real-time monitoring solution: Justified

In the digital era, where data flows like a ceaseless river, real-time monitoring stands as a pivotal technology, allowing organizations to not only keep pace but also to deeply understand the intricate dance of their operational ecosystems. This technology is not just about keeping tabs; it’s about gaining a profound, almost intuitive sense of the micro-worlds within which systems, containers, services, and applications pulse and thrive.

The next buzz in the city of bees: digital infrastructure, AI, and Manchester

Manchester has come a long way - from pioneering the world’s first stored program digital computer, to becoming the top tech city in the UK outside of London. The MCC 2021-2026 Digital Strategy now guides a £5bn digital economy, with more than 10,000 businesses employing over 96,000 people. It has seen the development of five unicorns and is still home to three, billion-pound businesses. So, the city of bees is buzzing.

How we Went From Two Major Outages to 99.98% Reliability in Just 6 Months with Eran Kampf

Discover TwinGate's incredible journey from facing major outages to achieving 99.98% reliability within six months. At Navigate NA 24, hear firsthand about the challenges, solutions, and innovations that transformed their operations. Learn about their approach to architecture, incident management, and customer communication that not only restored trust but also turned reliability into a competitive advantage.

#024 - Kubernetes for Humans Podcast with Gabriele Bartolini [EDB]

A long-time open-source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and DevOps culture, he is now leading the Cloud Native initiative at EDB.

Understanding Monitoring Tools

If you care about operational excellence when it comes to your IT infrastructure, the role of monitoring systems is pivotal. As we navigate through the myriad of available monitoring tools, it becomes essential to understand the distinct architectures, styles, and focal points of various monitoring solutions, as well as the time-to-value they offer.

The broken cloud market: Why free credits are the kryptonite of competition

The Competition and Markets Authority (CMA) is investigating the state of the UK cloud market, and for good reason. While the submissions and commentary already submitted to the CMA point to various concerning practices, one issue stands out as the single biggest threat to a healthy, competitive landscape: excessive free cloud credits. This tactic, employed by the dominant hyperscalers (AWS, Microsoft Azure, and Google Cloud Platform), is akin to a drug dealer offering free samples.