Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Metrics That Matter: Measuring Developer Productivity in the AI Era

In this episode, Ryan McDonald is joined by Mark Quigley, Head of Platform Engineering at Ninety.io, for a conversation that cuts through the noise around developer productivity metrics and AI. Mark dives deep into how teams can measure what matters—without falling into the trap of turning every measure into a target. He shares how tools like Developer NPS, DORA metrics, and balanced scorecards can help teams optimize for both output and well-being—but only when framed with the right intent.

The timeline to fully automated incident response

We speak to engineering teams every day, and everybody knows AI is the future. Some tell us they’re massively accelerated by Claude, or that they’re rebuilding their product, team and ways of working. Cursor and Lovable have announced they’re building the last piece of software. Should we give in to the vibes? Embrace exponentials, and forget that the code even exists? The reality is that things will still go wrong. They always do, at least from time to time.

SONiC: The open source network operating system for modern data centers

Software for Open Networking in the Cloud (SONiC) is an open-source network operating system that has revolutionized data center networking. Originating as a Microsoft-led initiative in the Open Compute Project (OCP) in 2016, SONiC has rapidly gained traction among hyperscalers and switch hardware vendors, including Broadcom, Cisco, and NVIDIA. By building its services using containerized microservices, SONiC brings flexibility, scalability, and modularity to network infrastructure.
Sponsored Post

Step by Step Guide for Using the HG-CLI Agent Installation Tool

Our latest project at MetricFire is a brand-new CLI tool! This tool makes agent installation on any OS a breeze, and we are quite proud of it. In this article, we'll share an overview of HG-CLI, and how to use it in Terminal User Interface (TUI) and Command Line Interface (CLI) mode. We'll also show you what to do with the metrics that are collected and forwarded to your Hosted Graphite account, giving you a full server monitoring setup in minutes!

Incident management vs. problem management: A practical guide for SREs

In Site Reliability Engineering (SRE), distinguishing incident management from problem management is crucial. While both processes aim to maintain system reliability, they fulfill distinct roles: incident management focuses on quickly resolving immediate disruptions, whereas problem management identifies and rectifies root causes to prevent recurrence. Effectively combining these processes helps minimize downtime, enhances system resilience, and fosters a proactive operational approach.

Shift Right: The On-Premise to Private Cloud Evolution

We're taking a look at our clients' journey to a hybrid-cloud architecture, exploring the benefits private cloud boasts over on-premise infrastructure and how this helps an organisation's digital transformation. The adoption of cloud technology is determined by where data originated and is held, and how it is used.