Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

AI SRE Summit 2026 Brings Together Engineering Leaders From AWS, Salesforce, Man Group, Smarsh, Honeycomb and More

Virtual event will explore what it takes to use AI in production SRE, from incident response and observability to platform design, cost control and self-healing operations TEL AVIV and SAN FRANCISCO, April 22, 2026 — Komodor, the autonomous AI SRE company, today announced it will host AI SRE Summit 2026, a free live virtual event on Tuesday, May 12, 2026, bringing together site reliability, platform engineering and cloud-native leaders to discuss how AI is changing production operations, and where i

Resolve's Agents of IT podcast - Ep. 17 - Agentic Workflows to Performance Intelligence

In this episode of Agents of IT, Ari Stowe sits down with Geoff McQueen, four-time founder and CEO of Ascendius, to unpack what it takes to navigate AI-driven disruption. Geoff shares a clear framework for where automation is headed, from individual AI use to agent-driven workflows to AI embedded across the business. Most organizations are still early. The real opportunity is in making AI work at the business level.

Introducing on-demand Pipelines: run pipelines via API

Your CI/CD pipeline doesn’t have to live in a YAML file anymore. With on-demand pipelines, you can generate pipeline definitions programmatically, from scripts, services, or automation tools – and execute them instantly via the Pipelines API. No commit. No pull request. No static configuration to modify. Just build the YAML your situation demands and run it.

Nagios Plugins Collector: Run Your Existing Checks and Custom Scripts Inside Netdata

A lot of teams have a collection of Nagios plugins and custom monitoring scripts that have been running reliably for years. Some are standard community plugins for checking disk health or SSL certificate expiry. Others are homegrown Bash or Python scripts that check something very specific to the business: whether an API endpoint returns the right payload, whether a batch job completed on time, whether a queue depth is within bounds.

Share artifacts between parent and child pipelines

As part of an initiative to increase the flexibility and power of child pipelines, we are happy to announce that Bitbucket Pipelines will now allow you to share artifacts between parent and child pipelines. This feature extends the use-cases for child pipelines, allowing a greater degree of coordination between parent and child and the use of child-pipelines as modular pieces of processing for larger operations with artifacts. Here’s how it works.

Under the Hood: Engineering JFrog Premium Availability

In the modern software factory, 99.9% uptime is no longer the gold standard. A standard 99.9% SLA translates to approximately 43 minutes of unexpected downtime per month. While industry data shows that a single minute of downtime costs an average of $9,000, for large global enterprises, that figure can easily be 5x higher. At tens of thousands of dollars per minute, those 43 minutes quickly compound into a catastrophic financial and operational risk.

How to Align CloudOps and FinOps for Better Azure Cost Management

The rapid migration to the cloud has brought unprecedented agility to modern enterprises, but it has also introduced a significant challenge in the form of cloud sprawl. As engineering teams provision resources at breakneck speed to support new applications and AI-driven workloads, financial departments often struggle to keep track of the escalating costs. This disconnect between operational execution and financial oversight is a primary driver of wasted cloud spend. To truly harness the power of scalable infrastructure without breaking the budget, organisations must bridge the gap between CloudOps and FinOps. Aligning these two disciplines ensures that technical performance and financial accountability work hand in hand to deliver sustainable business value. For companies heavily invested in Microsoft ecosystems, this alignment is even more crucial. Unchecked deployment can lead to massive end-of-month bill surprises, turning what should be a strategic advantage into a financial burden.

The Regional Data Centre Revolution Powered by AI Demand

London still hosts the biggest concentration of UK data centre capacity, but the centre of gravity is starting to move. AI workloads are changing the infrastructure maths, pushing power, space and planning considerations up the decision list. That is exactly where regional locations start to look like the sensible option. Government data shows how concentrated the market remains: as of autumn 2024, London is estimated at 1,048MW of colocation IT load. Compare that with 44MW in the East of England, 17MW in the North East and 30MW in Scotland. The gap is huge, yet it is not a permanent advantage.

Moving Beyond SolarWinds: Building a Modern Observability Strategy

For years, platforms like SolarWinds have been a standard in IT environments. They helped teams answer a fundamental question: are systems up or down? That approach worked well when environments were more contained and predictable. The challenge is that most environments no longer operate that way. Hybrid infrastructure, cloud services, and tightly interconnected applications have changed what “visibility” needs to mean.