Operations | Monitoring | ITSM | DevOps | Cloud

Kubernetes Troubleshooting Reimagined: Operators and Auto-Tracing

Kubernetes operators help to simplify, streamline, and automate application tasks beyond the conventional Kubernetes offerings. In this webinar, AWS Developer Advocate for Kubernetes, Lukonde Mwila, will delve into the remarkable capabilities of Kubernetes operators and how to leverage them in your applications. You’ll also learn how Lumigo built a Kubernetes operator for seamless distributed tracing leveraging OpenTelemetry. We will also demonstrate how our operator transforms complex processes into a single command, promising an unmatched user experience and exceptional app health insights.

Unleashing the Power of IT Event Management - Infraon

Are you tired of reactive IT approaches that leave you firefighting issues? IT event management is your proactive solution! By actively monitoring and managing events, you gain invaluable insights into system performance, availability, and overall health, enabling you to pre-emptively tackle potential disruptions. Say goodbye to downtime and hello to seamless operations! So, are you ready to revolutionize your IT service delivery and achieve new levels of success? Don't miss out on the opportunity to harness the true potential of IT event management!

The broader approach on Azure monitoring

This episode of Azure On Air podcast tackles the challenges in IT infrastructure monitoring and transitioning from on-premise to the cloud. Pedro Sousa, Microsoft Azure MVP, advocates for a shift from traditional monitoring to a holistic observability approach, starting with an understanding of business needs and working down to infrastructure details. Furthermore, he provides invaluable advice on migrating from on-premise to Azure, emphasizing the consistency of observability principles across environments.

Securing Access to Cloud Native Resources with Certificates - Civo Navigate NA 2023

In this talk, Alan Vailliencourt, a Senior Solutions Engineer with Teleport, discusses the importance of moving away from passwords and securing access to cloud-native resources using short-lived certificates. He highlights the risks associated with passwords and showcases the benefits of identity-native access, incorporating proof of presence, mutual authentication, and device security. The talk provides practical steps for adopting certificate-based authentication and improving security posture for Kubernetes, databases, and other cloud resources.

Dive Deeper into your Trace and Logs Data with Query Builder - Community Call Aug 1

This week for our community call we show our new Trace explorer with a GUI for creating queries, custom dashboards, and alert thresholds. Great participation from the community, thank you so much for participating. SigNoz - Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc. Backed by Y Combinator. SigNoz helps developers monitor applications and troubleshoot problems in their deployed applications. SigNoz uses distributed tracing to gain visibility into your software stack.

How to monitor your Python app performance with Site24x7

What is Site24x7 APM Insight? Prevalence and importance of Python code in application design, and how to monitor its performance? Site24x7 helps monitor Python app performance with its agent-based APM Insight. Site24x7 APM provides metrics like response time, throughput, database ops, and error handling in your Python applications.

Enable and use GKE Control plane logs

Are you having any issues with the control plane components in your GKE Cluster? Are you interested in gaining visibility into the control plane side of the cluster to troubleshoot the issues by yourself? Then GKE Control Plane Logs is a great way to gain insights on what's going on with your cluster. In this video, we provide a quick overview about Control Plane components and logs, and show how to enable control plane logs on the new and existing GKE clusters. Watch this video to learn how to use Control plane logs to troubleshoot webhook and control plane latency issues in GKE clusters.