Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Code Optimization: The Cloud Always Collects Its $2,000 Tuition Fee

We hear a lot of war stories from the teams we work with. Horror stories about cloud bills, surprise overages, and the infrastructure decisions that seemed perfectly reasonable at the time. This one comes from Erik Dasque, CTO at Allure Security. It involves a junior developer, a Kubernetes CronJob, and a recurring bill that, if not caught, would have happened on a yearly basis.

5 AI And Cloud Cost Problems That Are Now Everyone's Problem

Not long ago, cloud cost was an engineering problem. FinOps teams owned it, finance leaned in occasionally, and everyone else stayed out of it. Now, that’s changed. AI changed who has skin in the game. CFOs get asked about it in board meetings. CEOs field questions on earnings calls. The audience for cloud cost management has exploded — and that means the conversation CloudZero is built to enable isn’t only a technical one, it’s a business one.

Nine Ways to Connect to Cloud Using Private Connectivity

Struggling with cloud complexity? Compare dedicated, partner, and IPsec connections to find the right private connectivity solution. Multicloud environments bring complexity, and how you connect to your CSPs can make or break performance, cost, and reliability. Here’s how dedicated, partner, and IPsec connections compare — and which might be right for your business. There are three main methods of connecting to the cloud with private connectivity.

The hidden reliability risks in your agentic AI workflows

Artificial intelligence recently took a major leap from “saying” to “doing.” Instead of simple back-and-forth chats, we’re now allowing automated AI processes to take action on our behalf—from responding to emails to building and deploying complete applications. This shift from “assistant” to “actor” can make applications more capable, but it also creates additional failure modes.

Building a dry-run mode for the OpenTelemetry Collector

Teams continuously deploy programmable telemetry pipelines to production, without having access to a dry-run mode. At the same time, most organizations lack staging environments that resemble production – especially with regards to observability and other platform-level services.

The next wave of AI: Balancing innovation with sovereignty

This blog is based on the webinar, “AI panel: The next wave of AI technology”. You can watch the full recording by clicking here! The pace of AI innovation is reshaping research, business, and everyday life. However, as breakthroughs in Large Language Models (LLMs) and high-performance computing accelerate, they bring new technical challenges around scale, efficiency, and reliability.

Re-Inventing Network Operations: Are AI Extensions the Right Path?

For decades, telecom network operations have depended on traditional OSS tools – complex, services-heavy platforms that take years to modernize and even longer to deliver measurable business impact. This year at MWC, the leading OSS vendors showcased a variety of new AI extensions for their portfolios and marketed them as the fastest path to autonomous network operations. They are not.

Event Intelligence for Agentic IT Operations

Modern IT teams are experimenting with AI agents. But individual agents, working in isolation are not enough. To truly achieve Agentic IT Operations, organisations need a platform — one that coordinates, governs, and contextualises AI-driven actions across the entire IT landscape. That’s where Interlink Software comes in.