Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

Stop ECS Containers From Collapsing Into One Service in OpenTelemetry

Why ECS containers collapse under service.name = aws_ecs and how to fix it for both EC2 launch type and Fargate, including the resource-vs-log-record pitfall that quietly breaks log filtering. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

Build with Claude Code, Deploy with Qovery

AI coding tools eliminated the 'writing code' bottleneck. But deploying that code? Still a mess. Here's how Claude Code + Qovery Skill lets you go from idea to production in a single prompt - with enterprise-grade guardrails. Romaric founded Qovery to make Kubernetes accessible to every engineering team. He writes about platform strategy, developer experience, and the future of cloud infrastructure.

ISO 27001, G-Cloud and SOC 2: How to vet a sovereign cloud provider

A procurement officer at a mid-sized financial services firm spent six months last year negotiating with a cloud provider that turned out not to hold the certification it had implied in its sales deck. The contract collapsed during legal review. The firm lost the time, the provider lost the deal, and somewhere in the middle, a senior engineer learned the difference between "compliant with the principles of" and "audited to the standard of.".

Shadow IT Is Back - And Vibe Coding Made It 10x Worse

AI coding tools are the new Shadow IT - but instead of rogue Trello boards, they have OAuth access to your code repos, cloud accounts, and production databases. Here's what's already gone wrong, and how platform engineering fixes it. Romaric founded Qovery to make Kubernetes accessible to every engineering team. He writes about platform strategy, developer experience, and the future of cloud infrastructure.

7 best AI deployment platforms for production Kubernetes workloads in 2026

Training a model in a notebook is easy. What breaks teams is the step after, serving it reliably without haemorrhaging cloud budget or burying your SREs in YAML. The common trap: picking a platform that handles the model but not the surrounding stack. An AI deployment platform should orchestrate the full application graph (inference endpoints, vector databases, caching layers, and frontends) inside a single VPC, with GPU autoscaling that doesn't require a dedicated platform engineer to babysit.

#056 - Cloud Contradictions and Cautionary Tales with Corey Quinn (The Duckbill Group)

In this episode of the Kubernetes for Humans podcast, Itiel sits down with the internet's favorite cloud contrarian, Corey Quinn of the Duckbill Group. Corey shares his unconventional career path as a "cautionary tale," explaining why his knack for fixing horrifying AWS bills makes him a terrible employee, and why he absolutely refuses to touch Kubernetes in production.

Rightsizing Nightmares: When Your Cloud Cost Tool Degrades Performance

This is what production teams see happening. A vertical pod autoscaler recommendation gets applied automatically. Resource requests come down a notch across a namespace. The cost dashboard registers a small cost savings win. A few minutes later, health checks start failing. Pods enter crash loops.

VM Migration to Kubernetes: What Breaks and How to Prevent It

Here is what nobody putting together the business case for a VM migration to Kubernetes will tell you upfront: the compute is the easy part. Moving workloads off vSphere and onto Kubernetes is conceptually straightforward. The tooling has matured. The architecture is proven. Compute moves, storage remaps, and the platform team has a plan. The network is where projects quietly stall.