Operations | Monitoring | ITSM | DevOps | Cloud

Metrics for Apache Kafka with Datadog and Aiven | Ryan Martin (Aiven)

Using managed services is all very well, but how do you get the data you need from the different services into Datadog so you can see it all in one place? This session will walk through the configuration for bringing your Aiven-managed Apache Kafka service metrics into your Datadog explorer. You’ll see how to filter the metrics to focus on specific topics or consumer groups, and how to use the Aiven client to create a repeatable, scriptable setup. This session is recommended for anyone living in the as-a-Service world who cares about data and is interested in using metrics to optimize their Kafka clusters.

Monitoring Open Source Success in Arduino | Silvano Cerza (Arduino)

Arduino is an open-source hardware and software company, project, and user community that designs and manufactures single-board microcontrollers and microcontroller kits for building digital devices. In the course of developing software downloaded and used by millions around the world, we have found it vitally important to be aware of the quality and performance of our software.

This Month in Datadog: September 2021 (Episode 5)

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. This month we put the Spotlight on Session Replay, go around the Water Bowl with Decalog, introduce a bunch of new features, and give you more information about our flagship conference Dash..

This Month in Datadog: August 2021 (Episode 4)

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. For the August 2021 episode, we take you behind the scenes to our NYC office, and sit down with two people from our Product leadership team.

Datadog on Chaos Engineering

As you scale your applications, remaining resilient to underlying network failures, resource constraints introduced by other applications, or spikes in traffic can become exponentially more complex, even with very thorough testing and processes. Chaos engineering is a discipline that encourages experimenting in production and injecting controlled failures into the system to understand how the system will react in such conditions and to improve its reliability.