Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Application Performance Monitoring and related technologies.

Monitoring Claude Code Usage with OpenTelemetry and SigNoz

In this video, we’ll walk you through how to monitor Claude code activity using OpenTelemetry and SigNoz. You’ll learn how to instrument your usage, capture telemetry data, and visualize it with SigNoz to get better insights into your system performance. Whether you’re exploring observability for AI workloads or looking for an open-source solution to monitor your llm activity, this guide will help you get started.

Full Session Simulation - Simulate Anything, Everything, Anywhere

Full Session Simulation is a powerful troubleshooting strategy. Have you ever been in a situation where everything on your dashboards looks green, but users are still encountering issues and raising support tickets?The cliche of “everything is fine on our side” moment is not just frustrating for everyone. It’s risky! Because when you can’t replicate what the user is experiencing, you’re flying blind.

What is APM Tracing?

APM tracing records the complete execution path of a request as it travels through your system, including database queries, external API calls, cache lookups, message queue events, and inter-service requests. Each step is captured with precise start and end timestamps, duration, and context such as service name, operation name, and relevant attributes. This lets you pinpoint where latency or errors originate without piecing together metrics and logs manually.

Cost Controls and so Much More: Issue Detection Through Usage Analysis

Keeping tabs on cloud spending across multiple organizations and vendors, including Datadog, can be tough and costly. If you're not tracking expenses, you're also missing other critical insights. The Flight Centre Travel Group (FCTG) faced this when moving to Datadog, needing to monitor costs across numerous organizations and over 180 Azure subscriptions. After a rapid migration, new cost reports quickly revealed more than just financial benefits. Unusual spending patterns often highlighted incidents, bugs, or security issues, offering early warnings about internal system problems.

Bridging the Gap: Legacy Systems and Modern Observability

Technology moves quickly and while the spotlight has shifted to dynamic, cloud-based systems, many organizations have legacy applications and infrastructure that they must maintain. In this fireside chat, Datadog’s Matt Moore (Principal Observability Strategist) will host James Flores (Enterprise Systems Engineer) at Australian Community Media to discuss their journey of modernization and bridging legacy systems with the cloud using a bit of ingenuity and observability.

Bringing Observability to Claude Code: OpenTelemetry in Action

AI coding assistants like Claude Code are becoming core parts of modern development workflows. But as with any powerful tool, the question quickly arises: how do we measure and monitor its usage? Without proper visibility, it’s hard to understand adoption, performance, and the real value Claude brings to engineering teams. For leaders and platform engineers, that lack of observability can mean flying blind when it comes to understanding ROI, productivity gains, or system reliability.

Azure Data Factory Monitoring Integration

Microsoft Azure Data Factory is a cloud-based data integration service provided by Microsoft Azure. It enables you to create, manage, and automate data workflows that move and transform data from different sources to various destinations. Essentially, ADF allows you to design, orchestrate, and manage data pipelines, making it easier to work with large volumes of data across on-premises and cloud environments.

kubectl logs: How to View & Tail Kubernetes Pod Logs

When debugging containerized applications in Kubernetes, kubectl logs serves as your primary command-line tool for accessing container logs directly. Understanding how to effectively retrieve, filter, and analyze logs becomes essential for maintaining application health and resolving issues quickly, especially in multi-container environments where correlation across services can make or break your troubleshooting efforts.