Operations | Monitoring | ITSM | DevOps | Cloud

Ameet Talwalkar on Building the AI Research Lab

"We're doing cutting-edge AI, focused on real translational impact: getting our research over the wall and into production." Ameet Talwalkar, Datadog's Chief Scientist, shares what it took to build the AI Research Lab from the ground up — and what makes DAIR different from traditional research teams. At Datadog, research ships. Recent work from the lab includes Toto 2.0, open-weights time series forecasting models ranked on leading benchmarks, and ARFBench, a new benchmark for evaluating AI on real incident data.

Instant Java Client SDK, no spec required!

Learn how to generate a client SDK for a production service when you have no documentation, no OpenAPI spec, and no remaining team knowledge of the original Ruby code. This demo shows you how to capture real production data from a running app and transform it into a functional Java client library in minutes. Visit proxymock.io OR speedscale.com to learn more.

Search Azure Blob data in-place with BYOS for Cribl Lake

See how Bring Your Own Storage (BYOS) in Cribl Lake allows teams to connect directly to Azure Blob Storage and instantly search data in place — without moving, duplicating, or rehydrating telemetry. In this demo, Cribl Product Manager Risk Salsa walks through setup, dataset creation, and how to run fast investigations across your Azure-hosted data using Cribl Search.

AI Might Break Open Source Differently Than You Think

AI coding agents may not replace open source libraries overnight. But Adam Arellano, Field CTO at Harness, thinks models like Mythos could expose a bigger problem: finding bugs, vulnerabilities, and edge cases faster than maintainers can keep up. That might be the real threat to tools and libraries.

Lessons From a CI/CD Supply Chain Attack at Grafana Labs

When a compromised GitHub Actions workflow targets your CI/CD pipeline, how do you respond — and what do you change so it never happens again? Nick and David from Grafana Security walk through a real supply chain incident triggered by a pull_request_target misconfiguration, showing exactly what broke, what tools caught it, and what the team rebuilt afterward.

Getting Started with gcx: A CLI for AI Agents and Grafana Telemetry | Demo

AI agents are only as useful as the context they can access. With gcx, your coding agents can connect to Grafana and query real-time production telemetry from your Cloud, Enterprise, or OSS environment. The best part: it avoids the upfront context bloat that can come with loading tools before you even send a prompt. gcx uses a CLI approach, so there’s zero token cost until your agent actually needs to run a query.

Best Practices in the Slack Experience

PagerDuty’s slack experience is evolving to help your teams organize better and resolve incidents faster. Use Triage Channels to collect telemetry and updates from your systems. Create dedicated Incident Channels for coordination and resolution. Give stakeholders the updates they need in Announcements Channels. Everyone in your organization can get the information they need easily.

Inside the Grafana AI Team Weekly: Guards for AI Observability (May 5, 2026)

This is an excerpt from a real AI team weekly meeting where we talk about the stuff we build and occasionally also demo them! In this one, Principal Software Engineer Sven Großmann shows a new feature he's working on for AI Observability, called "guards". We're showing parts of our team meetings to build in public in some small way and give you a sneak preview of what's to come. But not all features we show may make it to production! You've been warned. :)