Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

Kubernetes Incident Management Best Practices

Creating just any infrastructure on Kubernetes is not enough. There are so many basic configurations you could apply and create the infrastructure for your application for the time being and it might work just fine. The incident responses won’t always remain 100% reliable. You will run into newer potholes, and that’s okay.

How to save on container costs efficiently using Kubernetes cost reporting in CloudSpend

Kubernetes reports in CloudSpend In the current era focused on cloud computing, it is essential for businesses to streamline costs. As containerization and Kubernetes become increasingly popular, efficiently managing costs related to Amazon Elastic Kubernetes Service (EKS) and Azure Kubernetes Service (AKS) is crucial for maintaining a successful infrastructure.

Using Helm Dashboard and Intents-Based Access Control for Pain-Free Network Segmentation

Helm Dashboard is an open-source project which graphically shows installed Helm charts, revisions, and changes to their Kubernetes resources. The intents operator is an open-source Kubernetes operator which makes it possible to roll out network policies in a Kubernetes cluster, chart by chart, and gradually achieve zero trust or network segmentation.

Securing Access to Cloud Native Resources with Certificates - Civo Navigate NA 2023

In this talk, Alan Vailliencourt, a Senior Solutions Engineer with Teleport, discusses the importance of moving away from passwords and securing access to cloud-native resources using short-lived certificates. He highlights the risks associated with passwords and showcases the benefits of identity-native access, incorporating proof of presence, mutual authentication, and device security. The talk provides practical steps for adopting certificate-based authentication and improving security posture for Kubernetes, databases, and other cloud resources.

Kubernetes Troubleshooting Reimagined: Operators and Auto-Tracing

Kubernetes operators help to simplify, streamline, and automate application tasks beyond the conventional Kubernetes offerings. In this webinar, AWS Developer Advocate for Kubernetes, Lukonde Mwila, will delve into the remarkable capabilities of Kubernetes operators and how to leverage them in your applications. You’ll also learn how Lumigo built a Kubernetes operator for seamless distributed tracing leveraging OpenTelemetry. We will also demonstrate how our operator transforms complex processes into a single command, promising an unmatched user experience and exceptional app health insights.

Solving the Never Ending Requirements of Authorization - Civo Navigate NA 2023

In this talk, Alex Olivier shares their personal experience with the challenges of constantly changing authorization requirements in software systems. They discuss the drawbacks of traditional if-else statement-based authorization logic and propose a more efficient and scalable solution using an authorization service called Cerbos. The talk explores the benefits of decoupling authorization logic into policies, providing a centralized and maintainable approach with a clear audit trail.

Exploring AKS networking options

At Kubecon 2023 in Amsterdam, Azure made several exciting announcements and introduced a range of updates and new options to Azure-CNI (Azure Container Networking Interface). These changes will help Azure Kubernetes Services (AKS) users to solve some of the pain points that they used to face in previous iterations of Azure-CNI such as IP exhaustion and big cluster deployments with custom IP address management (IPAM).

Enable and use GKE Control plane logs

Are you having any issues with the control plane components in your GKE Cluster? Are you interested in gaining visibility into the control plane side of the cluster to troubleshoot the issues by yourself? Then GKE Control Plane Logs is a great way to gain insights on what's going on with your cluster. In this video, we provide a quick overview about Control Plane components and logs, and show how to enable control plane logs on the new and existing GKE clusters. Watch this video to learn how to use Control plane logs to troubleshoot webhook and control plane latency issues in GKE clusters.

Kubernetes Troubleshooting with Operators and Auto-Tracing

Kubernetes has revolutionized the way we manage and deploy applications, but as with any system, troubleshooting can often be a daunting task. Even with the multitude of features and services provided by Kubernetes, when something goes awry, the complexity can feel like finding a needle in a haystack. This is where Kubernetes Operators and Auto-Tracing come into play, aiming to simplify the troubleshooting process.

Monitoring MLOps Workflows with Flyte-powered Grafana Dashboards - Civo Navigate NA 2023

Learn how to monitor MLOps workflows effectively with Flyte-powered Grafana dashboards in this talk from Navigate NA 2023. Shivay Lamba discusses the importance of monitoring in the MLOps journey, highlighting the unique challenges in monitoring machine learning models compared to standard software development. Discover how Flyte, an open-source project, can help manage and monitor ML tasks efficiently, and see a live demonstration of setting up Grafana dashboards to visualize system metrics like CPU and GPU utilization. Take advantage of this opportunity to enhance your MLOps monitoring skills and optimize your machine learning workflows.