Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

Kubernetes Optimization Beyond Requests and Limits - Node Scaling Blockers

Many of us understand the concept of Kubernetes Requests and Limits, and that by reducing over-sized resource requests we can reduce waste in our clusters. And for GKE Autopilot and EKS Fargate clusters that is true. Because you’re being billed directly for the resources you’re requesting, driving down requests can result in instantaneous savings. However in most hosted Kubernetes environments you’re not actually being billed for requests.

Your Company Has 10x More Developers Than You Think

The low-code promise failed for 15 years. AI builders delivered in 15 months. Here's what actually changed, why the engineer in me resisted it, and what it means for every CTO. Romaric founded Qovery to make Kubernetes accessible to every engineering team. He writes about platform strategy, developer experience, and the future of cloud infrastructure.

Don't Ban the Builders - Govern Them

AI tools turned everyone into a builder. Your sales team, your finance team, your CEO - they're all shipping apps now. The answer isn't to ban them. It's to give them a governed platform they actually want to use. Romaric founded Qovery to make Kubernetes accessible to every engineering team. He writes about platform strategy, developer experience, and the future of cloud infrastructure.

The Five Pillars of AI Agent Accountability: A Diagnostic Framework for Engineering Leaders

You’re in a board meeting. The CISO is presenting on AI risk. The CFO asks a simple question: “When that finance agent we deployed last quarter accessed a customer payment record, can we tell who authorized it, what policy permitted it, and produce the full audit trail?” The CISO looks at the head of the platform. The head of the platform looks at security. Nobody answers. If you can picture that meeting happening at your company, you’re not alone.

Autonomous K8s Optimization Involves Both Compute and Storage Resources - Are You Doing Both?

One of the most powerful capabilities in K8s is the ability to autoscale resources to meet demands, scaling resources up during peak periods to ensure performance, and down again during lower periods to save money. In this joint session, Lucidity and Kubex walk through what end-to-end K8s optimization looks like when you address both layers together. We cover: Expect real examples, not slides full of theory. You’ll leave with a clear picture of where waste is hiding in your environment and a prioritized approach to addressing it.

Ubuntu Core 26 fleet observability

What is Ubuntu Core? Ubuntu Core is a minimal and strictly confined variant of Ubuntu powering devices around the world. Ubuntu Core 26 now integrates with the Canonical Observability Stack, streaming device logs and metrics to centralized Grafana, Loki, and Prometheus infrastructure, deployable in the cloud or on-premise, without burdening the device's primary workloads.

Canonical announces fully Managed Kubeflow AI operations platform on the Microsoft Azure Marketplace

Canonical, the publisher of Ubuntu, today announced the general availability (GA) of Managed Kubeflow on the Microsoft Azure Marketplace. This solution enables AI teams to get a fully managed, production-ready MLOps platform in their own tenant. Upstream Kubeflow is a powerful tool for machine learning, but it remains notoriously challenging to deploy and maintain.

Civo AI: Strategy over complexity

Most cloud providers think AI is just a hardware problem. They focus on the GPUs, the racks, and the raw compute, but they leave the strategy up to you. At Civo, we do AI differently. We don't just provide the hardware; we guide you through the full life cycle of AI adoption, from initial planning to scaling production workloads. By leveraging best-in-class NVIDIA models and GPUs, we give you the performance to unlock AI at scale without the fear of being bogged down by complexity. It's more than infrastructure, it’s cloud freedom with AI built-in.

Self-host AI on Kubernetes: GPU clusters, private models, and the GitOps Catalog

Spin up a GPU workload cluster using Konstruct's new GPU cluster templates, deploy a self-hosted LLM, and use it in your organization — all live on stream. This hands-on session shows how shipping AI workloads to GPU clusters is just as easy as deploying to Konstruct physical or virtual clusters, and how open source apps in the GitOps Catalog make it even faster. Walk away knowing how to cut your token spend by running models privately on your own infrastructure.

NVIDIA Vera Rubin: What is it, what's new, and when you can get it

NVIDIA's infrastructure roadmap moves fast, and the next major milestone is already here. The NVIDIA Vera Rubin platform is the company's next-generation AI compute architecture, the successor to Blackwell, and it's shaping up to be one of the most significant leaps forward in AI infrastructure NVIDIA has ever shipped. Whether you're planning your next training cluster, scaling inference pipelines, or building the infrastructure to power autonomous agents, Vera Rubin is worth understanding now.