Operations | Monitoring | ITSM | DevOps | Cloud

OpenTelemetry Agents - The Complete Beginner's Guide (2025)

If you search for “OpenTelemetry Agent”, you will likely encounter two completely different definitions. This ambiguity often leads to confusion between infrastructure teams and application developers. SREs and DevOps engineers would describe it as a component deployed as a sidecar, whereas application developers would understand it as a language-specific library. Let’s break it down in the next section.

Building Trust in AI-Powered Kubernetes Ops: Why "Good Enough" Is a Production Killer

The air in the operations world is thick with AI and LLMs. EVERY vendor is rushing to slap an “AI-powered” badge on their product. But here’s the uncomfortable truth: In high-stakes Kubernetes operations, one bad AI recommendation can destroy months of trust-building in an instant. We aren’t building a chatbot to suggest recipes. We are building systems that, armed with kubectl permissions, have the potential to take down production with a single, wrong command.

The year in AI at Grafana Labs

2025 was the year we at Grafana Labs went all-in on AI—and boy, what a year it was. Not only did we establish and start to execute our overarching strategy (build actually useful AI), we also took one of our most exciting new features (Grafana Assistant) from idea to general availability in just nine months! Yes, there's no shortage of articles singing the praises of AI these days, but let's dispense with the hyperbole and focus on some actually useful content.

[Workshop] Building and Monitoring AI Agents and MCP servers

​See how Agent Monitoring gives you a better look at all things model usage, call duration, prompting, and more ​Go under the hood with MCP Monitoring - and learn how to debug client connection issues, tool call performance, transports, and all things MCP ​When things start breaking, use Seer, Sentry's AI Debugging Agent to troubleshoot those vague issues that are crashing and get help from a team of robots using Sentry’s AI PR Review.

Intelligent Systems Powering the Next Generation of Online Retail

Online retail is no longer driven only by attractive storefronts and competitive pricing. Behind every smooth shopping experience sits a complex network of decisions that must happen instantly and at scale. From predicting demand to responding to customer behavior in real time, cognitive AI agents are becoming a foundational layer that helps ecommerce businesses operate with speed, accuracy and consistency.

Struggling With Customer Drop-Off? AI Insights Can Help You Fix It Fast

Are you noticing more customers slipping away than sticking around? It's frustrating, right? Customer drop-off can feel like a mystery, but the good news is-it doesn't have to stay that way. Thanks to smart AI insights, you can quickly spot where things are going wrong and fix them before it's too late. Imagine having a clear map showing exactly why customers leave and what you can do to keep them coming back.

No-Code AI Tools That Are Changing Digital Marketing Forever

Artificial intelligence is no longer limited to data scientists or enterprise teams with large development budgets. Over the past few years, a new wave of no-code AI tools has emerged, allowing marketers to automate tasks, generate insights, and optimize campaigns-without writing a single line of code. For digital marketers, this shift is transformational. No-code AI tools reduce execution time, lower costs, and empower teams to focus on strategy rather than manual work. More importantly, they level the playing field, allowing small and mid-sized businesses to compete with larger brands.

Training Foundation Models on a Trillion Data Points with Apache Iceberg

Training an AI foundation model on over a trillion data points sounds impossible without hitting your production systems. Here's how Datadog did it with Apache Iceberg for their time series forecasting model TOTO. The key challenge: extracting massive historical observability data (metrics spanning years) and running incremental preprocessing pipelines without overwhelming production services. Iceberg solved this by providing schema governance, consistency guarantees, and seamless integration with ML tools like Ray and PyTorch.

Leading Open Source Teams w/ Daniel Roe

In this episode, Daniel Roe, Lead Maintainer of the Nuxt framework, discusses his journey from studying law and theology to leading a major open-source framework. He explains Nuxt's unique governance and how Nuxt manages contributions through volunteer-driven work, LLM-powered issue triage, and creating welcoming spaces for newcomers to open source. This week, our chat touches on a variety of topics including.