Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How To Run Monthly Cloud Cost Meetings For AI Teams

If you’ve ever stared at your cloud bill and thought, “How on earth did this get so crazy?” — you’re not alone. Especially when AI workloads come into play, those GPU costs can feel like a runaway train. The good news? It doesn’t have to be that way. The magic happens when you’ve got someone from every team that cares about smart growth (FinOps, AI/ML, product, engineering, whatever) all in one room, looking at the same set of numbers.

Visualizing Logs Alongside Metrics: A Practical Use Case

Security threats aren’t always loud and don’t always crash systems or trigger alarms. Sometimes they creep in quietly as a steady stream of unauthorized login attempts, slow brute-force probes, or unknown IPs scanning your server for vulnerabilities. These behaviors often show up in logs before they surface in metrics but if you're only watching logs or only tracking metrics, you're missing part of the story.

Getting closer to space with Canonical #ubuntu #space #shorts

@EuropeanSpaceAgency is scaling to support more missions than ever. Canonical makes it possible with open source infrastructure built for space. Watch the full video to see how we're helping ESA automate, scale, and future-proof its operations. Subscribe for more tech stories from space.

A local fix just spreads the problem

“You fixed a bug in QA — great! But did that fix go into version control and get tested and deployed everywhere? If not, you just created drift, and more problems down the line.” Peter Kruis, Microsoft SQL Engineer at Monin Fixing a bug in the environment where it appears feels like progress, but without a proper process, it creates fragility everywhere else.

Break it early to ship it safely

“We want developers to break things – just not for the customers. If all our tests are green, I get nervous that we’re not testing deep enough.” Naga Santhosh Reddy Vootukuri, Principal Software Eng. Manager, Microsoft Azure SQL Naga Santhosh, Sunny to most, leads a team that ships changes to Azure SQL databases worldwide. Those deployments must be fast, frequent, and invisible to customers. That kind of reliability doesn’t come from playing it safe during development.

Best Practices for End-to-End Testing in 2025

End-to-end (E2E) testing is a critical practice in today’s software development, ensuring that entire applications work seamlessly from the user’s perspective. With the growing complexity of web applications – from large monoliths to distributed microservices – thorough E2E testing has become essential for quality assurance.

Automating Network Diagrams for A Complete View of All Active and Passive Components

Accurately tracking how data center devices are connected—across switches, patch panels, structured cabling, and more—is essential for efficient data center operations. But for many teams, documentation still lives in static diagrams or outdated spreadsheets, requiring extensive manual effort. This is time-consuming and leads to inaccuracies that can cause delays in planning or troubleshooting and unnecessary risk. Sunbird DCIM changes that.

IP Optical Middle Mile Network Architectures for Rural America

In addressing the burgeoning demand for broadband connectivity in rural America, a robust and innovative IP Optical Network Architecture is essential. The architecture must incorporate a best-in-class multi-layer design optimized for middle-mile functionality, integrating both voice and security dimensions. A pivotal requirement is to decouple the last mile from the middle mile, ensuring that the last-mile solutions can remain agnostic to various technologies while still benefiting from a unified middle-mile infrastructure.