Operations | Monitoring | ITSM | DevOps | Cloud

The Guide to Kubernetes Debugging

Kubernetes is widely used for deploying, scaling, and managing systems and applications and is an industry standard for container orchestration. Google engineers originally developed Kubernetes as an open-source project. Its first release was in September 2014, and since then, it has matured into a graduate project maintained by the Cloud Native Computing Foundation (CNCF). With the complexities of scale and distributed systems, debugging in Kubernetes environments can be difficult.

Simplifying Container Observability for DevOps Teams

In modern microservices architectures, container observability is crucial for maintaining reliability and performance. It helps teams detect issues early and optimize distributed systems. This guide will walk you through the essentials of container observability, including advanced techniques and troubleshooting strategies to ensure your containerized applications run smoothly.

Introducing Server Nicknames

This past week we released the simple yet widely requested feature Server Nicknames, the ability to easily track and manage various servers with unique custom names. At first glance this may seem like a non update but not when you consider that most Cycle users are connecting many servers from multiple providers and locations in the cloud and on premises. With long default server names, this is a huge quality of life improvement.

Densify 3.0: Introducing Cloudex, Smarter K8s Automation, and Enhanced API Power

We’re excited to announce the release of Densify 3.0, a major step forward in our mission to deliver precision-driven cloud optimization across Kubernetes and public cloud environments. This release brings to life a new user experience, powerful automation capabilities, richer data insights, and smarter APIs—all designed to simplify and scale your optimization journey.

The Future of Cloud Data & AI/ML with Google's Engineering Lead

Explore the future of AI and data analytics with Google Engineering Lead Samarth Shah. From Lakehouse architectures to Tiny Models at the edge, Samarth breaks down the key trends shaping cloud-native analytics. Gain insights into augmented analytics, data governance, and real-time intelligence in this forward-looking session from Civo Navigate San Francisco 2025.

All about OTel and Logging on Kubernetes with Loki (Loki Community Call April 2025)

In this pre-recorded Loki Community Call, we talk all about OTel and logging on Kubernetes with Cyril Tovena, Ward Bekker, Jay Clifford, and Nicole van der Hoeven at KubeCon EU 2025 in London. We discuss when why you should switch to OTel and why you shouldn't, what OTLP is exactly, and best practices for ingesting data through an OTLP endpoint.

The Cloud Has Made Us Reckless! It's Time to Reclaim Control

Tim Banks takes the stage at Civo Navigate San Francisco 2025 to explore the unintended consequences of cloud adoption. From runaway AWS bills to vendor lock-in and lost operational skills, Tim offers a thought-provoking look at how convenience in the cloud has come at a cost to resilience, privacy, and sustainability.

Super Mario & Platform Engineering? You'll Be Surprised!

What can Super Mario teach us about platform engineering? More than you'd think! At Civo Navigate San Francisco 2025, Ramiro Berrelleza, founder of Okteto, draws surprising parallels between one of gaming’s most iconic series and the world of platform engineering. Learn how building familiar, reliable foundations, just like Super Mario, can empower developers, foster innovation, and avoid constantly reinventing the wheel.

[Webinar] Drift Happens! 3 Kubernetes Drift Scenarios & How to Overcome Them

Many organizations don’t realize the impact of Kubernetes drift until they’re managing multiple clusters and face skyrocketing costs, delayed fixes, and downtime. Drift can lead to inefficiencies and even critical security risks. That’s why it’s crucial to understand and proactively address drift at scale.

Going Green with GPUs: The Role of Sustainability in Cloud Computing

As businesses increasingly transition to cloud-based services, the demand for data center resources has surged, leading to higher electricity consumption. Currently, data centers contribute to about 3% of global carbon emissions and account for roughly 1-1.5% of global electricity demand. As a result, the need for sustainable solutions in cloud computing has never been more urgent.

The Top 4 Kubernetes Misconfigurations You Can Avoid on Cycle

Most cloud infrastructure and deployment misconfigurations start innocently enough: a dev under pressure to ship quickly tweaks a configuration file or adjusts a permission setting to make something work. It's not malicious and it might even be well thought out, but these small changes can cause a cascade of reactions that bring down production in seconds.

How to find Kubernetes reliability risks with Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Most Kubernetes clusters have reliability risks lurking just below the surface. You could spend hours or even days manually finding these risks, but what if someone could find them for you? With Detected Risks, Gremlin automates the work involved in finding and tracking reliability risks across your Kubernetes clusters. Surface failed Pods, mismatched image versions, missing resource definitions, and single points of failure, all without having to run a single test.

Introducing relaxAI in India: Your Trusted AI Assistant

We're excited to announce that relaxAI, the AI assistant designed with a strong focus on privacy and data sovereignty, is now available in India. In a world where AI tools are becoming increasingly popular, concerns about data usage and security are at an all-time high. relaxAI is here to change that experience completely.

Meet RelaxAI: India's Affordable & Secure AI Assistant

Get ready to experience the power of AI in India with relaxAI! Our AI assistant is designed with a strong focus on data sovereignty, ensuring that your data stays confidential and under your control. With relaxAI, you can enjoy 100% Indian data sovereignty, compliance with Indian data protection laws (DPDPA), and complete control over your data. Learn more about relaxAI's features, pricing, and how it can help Indian businesses and individuals achieve their goals.

A Closer Look at Docker Build Logs for Troubleshooting

In the world of containerization, understanding what's happening under the hood during image builds can mean the difference between smooth deployments and frustrating debugging sessions. Docker build logs are your window into this process, offering crucial insights that help you optimize builds, troubleshoot errors, and maintain robust container infrastructure.

What is Agentic AI? Understanding the Next Evolution of AI

In the ever-evolving world of artificial intelligence, a new frontier is emerging—Agentic AI. This revolutionary concept goes beyond the traditional models of AI that we’ve grown accustomed to. Instead of simply following explicit instructions, agentic AI systems are designed to act autonomously, make decisions, and adapt dynamically. In other words, they can “think” independently to achieve specific goals.

How to get started with Calico Observability features

Kubernetes, by default, adopts a permissive networking model where all pods can freely communicate unless explicitly restricted using network policies. While this simplifies application deployment, it introduces significant security risks. Unrestricted network traffic allows workloads to interact with unauthorized destinations, increasing the potential for cyberattacks such as Remote Code Execution (RCE), DNS spoofing, and privilege escalation.

Managing EKS deployments with CircleCI deploys

Development teams managing Kubernetes-based applications face challenges in maintaining visibility and control over their deployment processes. Without a centralized interface, teams struggle to track, monitor, and manage releases across their Kubernetes clusters, leading to potential deployment errors, and difficulties in maintaining consistent deployment workflows.

Navigating container monitoring: Key challenges and practical solutions

It’s no secret: Containers have fundamentally reshaped application deployment, driving agility and scalability. However, they’ve also introduced a new set of complexities in container monitoring that often outpace traditional methodologies. In this blog, we’ll explore the core challenges in container observability and outline pragmatic strategies for ensuring a robust and performance-driven containerized environment.

#040 - Beyond Mere Penguins: Crafting Engaging Developer Communities with Jono Bacon (Stateshift)

In this special KubeCon episode, Itiel sits down with Jono Bacon of CNCF fame to talk about his career in building developer and user engagement for open-source technologies. Jono shared his early experiences with Linux and how it sparked a passion for community building. They discuss their current coaching company, Stateshift, which helps various tech companies, including those in the CNCF ecosystem, improve their community outreach, brand building, and user adoption. Examples of successful community building, like GitLab and Dagger, are mentioned.

#041 - Virtualizing Kubernetes with Lukas Gentele (Loft Labs)

In this special KubeCon edition episode of the Kubernetes for Humans podcast, Itiel meets with Lukas Gentele, CEO and co-founder of Loft Labs. Discover why Lukas believes multi-tenancy is a major obstacle in Kubernetes adoption and how Loft Labs is tackling this challenge through innovative projects like vCluster, a "super famous project" for running virtual Kubernetes clusters.

Examining Network Architectures: Kubernetes and Cycle

In a world of managed services, details can often be skipped, overlooked, ignored, or just plain avoided. And in many cases, that's fine. But if you're here, reading this, then I will take it for granted that these things interest you, and I welcome you to join me on this journey of exploration, looking under the hood of two prominent container orchestration platforms on the market: Cycle and Kubernetes.

The Coming Decentralization of Cloud

This quote resonates deeply when considering the pendulum swings in technology. We’ve seen boom-and-bust cycles with various trends, from blockchain to AI. Some trends have more staying power than others, but the pendulum swings one way, only to swing back—sometimes with a vengeance, correcting the overreach of the previous swing. One of the most significant pendulum swings of the last few decades was the shift to cloud computing.

Calico Open Source 3.30: Exploring the Goldmane API for custom Kubernetes Network Observability

Kubernetes is built on the foundation of APIs and abstraction, and Calico leverages its extensibility to deliver network security and observability in both its commercial and open source versions. APIs are the special sauce that help automate and operationalize your Kubernetes platforms as part of a CI/CD pipeline and other GitOps workflows. Calico OSS 3.30, introduces numerous battle-tested observability and security tools from our commercial editions. This includes the following key features.

The DevOps secret to 99.9% uptime: The ultimate Kubernetes monitoring guide

Monitoring your Kubernetes clusters is critical for maintaining reliable applications. But with so many metrics to track and tools to choose from, setting up effective monitoring can feel overwhelming. The Cloud Native Computing Foundation (CNCF) highlights record Kubernetes adoption, underscoring the growing need for robust monitoring solutions. Search for "Kubernetes monitoring" and you'll find a sea of contradicting information, countless tools, and complex setups.

Meta's Llama 4 models now on relaxAI

Here at Civo, we’re proud to announce that we have become the first UK company to successfully host and operationalize Meta’s new Llama 4 model family on relaxAI, our AI assistant. This breakthrough positions relaxAI at the forefront of sovereign AI development, combining world-class capabilities with uncompromising data protection standards that UK businesses can trust.

Top 7 Kubernetes Alternatives in 2025

Gone are the days when scaling meant downtime, and introducing breaking fixes to production were more commonplace. In today's age of software development companies are adapting and quick to iterate, though we are still learning. The truth is Kubernetes has played a huge part in revolutionizing software development but it is not so straightforward as, 'adopt K8s and your organization will suddenly move in the right direction'.

Kubernetes 1.33 - What you need to know

Kubernetes 1.33 is right around the corner, and there are quite a lot of changes to unpack! Removing enhancements with the status of “Deferred” or “Removed from Milestone” we have 64 Enhancements in all listed within the official tracker. So, what’s new in 1.33? Kubernetes 1.33 brings a whole bunch of useful enhancements, including 35 changes tracked as ‘Graduating’ in this Kubernetes release.

What's New in Qovery Q1 2025: Faster Deployments, Smarter Scaling, and More Control

Over the last three months, we’ve focused on solving three core challenges our users face: delivering faster, improving resiliency, and gaining tighter control over cloud infrastructure. Today, we’re excited to share the new features we rolled out in Q1 2025 - all built to help teams ship faster, with more confidence, and lower operational overhead.

Pod Memory Usage: Tracking, Commands & Troubleshooting

Your containers are running, nd your clusters seem fine, but then you get that dreaded alert – memory pressure. Whether you're scaling up your infrastructure or just trying to keep things running smoothly, understanding pod memory usage isn't just nice to have – it's essential knowledge for any DevOps engineer worth their salt. Let's cut through the noise and get straight to what matters: practical ways to track, analyze, and fix memory issues in your Kubernetes pods.

How to Configure ContainerPort in Kubernetes (The Easy Way)

This guide covers container port configurations in Kubernetes, explaining key concepts and practical setups. If you're setting up ports for the first time or troubleshooting connectivity issues, you'll find clear explanations and useful examples to help you navigate container networking effectively.

How to Master Log Management with Logrotate in Docker Containers

Docker containers continuously generate logs during operation, and without proper management, these logs can consume significant disk space, impact system performance, and create operational issues. Logrotate offers an effective solution for managing these logs in containerized environments. This guide covers the implementation of logrotate in Docker containers – from initial setup through advanced configurations that ensure stable, maintainable container deployments.

The future of Kubernetes networking: Cilium and other CNIs with Canonical Kubernetes

Choosing the right Container Network Interface (CNI) for Kubernetes is critical to achieving optimal performance, security, and scalability. With the launch of Canonical Kubernetes LTS (long-term support) last month, Canonical decided to integrate Cilium as the default CNI in order to reflect our commitment to delivering a modern, security-maintained, high-performance Kubernetes experience.

Calico Whisker, Your New Ally in Network Observability

With the upcoming release of Calico v3.30 on the horizon, we are excited to introduce Calico Whisker, a simple yet powerful User Interface (UI) designed to enhance network observability and policy debugging. If you’ve ever struggled to make sense of network flow logs or troubleshoot policies in a complex Kubernetes cluster, Whisker is your friend!

Understanding Docker monitoring: A comprehensive list of key Docker metrics

In today’s fast-paced development landscape, containerization has become a cornerstone for deploying scalable and efficient applications. Docker, as one of the most popular container platforms, offers a robust environment for building and running containers. However, with great power comes the need for greater scrutiny, i.e., Docker monitoring or observability. Understanding Docker metrics is key to maintaining optimal performance and ensuring your containerized applications run smoothly.

SUSE and RKE2 are introducing KubeSleep: Smart Kubernetes Scaling Based on Developer Inactivity

We’re excited to announce Kubesleep, a smart Kubernetes operator developed by SUSE that optimizes cluster efficiency and significantly reduces infrastructure costs. Kubesleep automatically scales workloads based on actual developer activity, intelligently detecting periods of inactivity and scaling down resources to save energy and expenses. Best of all, your clusters smoothly scale back up before developers even notice.

Ending the IngressNightmare: How SUSE Secures Your Kubernetes Clusters from External and Internal Threats

In March 2025, Wiz researchers disclosed a set of critical vulnerabilities in the popular ingress-nginx controller for Kubernetes. Collectively referred to as IngressNightmare, these issues (CVE-2025-1097, CVE-2025-1098, CVE-2025-24513, CVE-2025-24514, and CVE-2025-1974) allow unauthenticated attackers to exploit the Ingress admission controller, potentially achieving remote code execution or escalating privileges in the cluster.

Back to the Metal

Bare metal is BACK! For years virtualization has absolutely dominated the cloud market. The market for virtualization is still 10x larger than bare metal ($8B USD vs$100B USD). But now consumers are demanding MORE for their workloads. … and the signal from the data suggest that this trend isn't going away anytime soon. If we look a bit deeper, we might see another story enabling the avalanche of (re) adoption in bare metal.

Kubernetes Monitoring: One view for observing all your storage volumes

If you want to observe your entire Kubernetes environment, you need visibility into all of your resources, including storage volumes. But monitoring Kubernetes storage hasn’t always been easy, especially if you wanted to see how it related to other parts of your infrastructure.

9 Best Container Monitoring Tools You Should Know in 2025

In a world where containers power everything from startup MVPs to enterprise applications, keeping tabs on your containerized environment isn't just good practice—it's survival. Container environments are notoriously dynamic and ephemeral, creating unique monitoring challenges that traditional tools simply can't handle. We've sorted through the noise to bring you the nine tools that deliver.