%term

Infinite Cardinality Metrics: Custom metrics built for modern systems

Jun 9, 2026 By Josh Mirchin In Datadog

Every technology shift adds new context you need to measure. Cloud computing added regions and services. Kubernetes added containers and pods. Multi-tenant applications added users and tenants. AI systems add models, prompts, agents, and execution paths. The result is that metrics are becoming dramatically more dimensional, faster than ever before. Over time, engineers are forced to make tradeoffs.

Read Post

Datadog

Read more about Infinite Cardinality Metrics: Custom metrics built for modern systems

Search and act across Datadog to resolve issues faster with Bits Chat

Jun 8, 2026 By Nicole Parisi In Datadog

Finding the right information across dashboards, monitors, and telemetry sources takes time, even for experienced engineers. When something breaks, it often means figuring out where to start, rebuilding queries, and jumping between metrics, logs, and traces before you can take action. The challenge isn’t a lack of data but the effort required to surface the right information at the right moment.

Read Post

Datadog

Read more about Search and act across Datadog to resolve issues faster with Bits Chat

Give your AI agents live Datadog access from the command line

Jun 5, 2026 By Cody Lee In Datadog

AI agents are becoming a standard part of how engineers write, deploy, and troubleshoot software. Getting observability data into those workflows, securely and without manual intervention, remains the harder problem.

Read Post

Datadog

Read more about Give your AI agents live Datadog access from the command line

Introducing Bits Agent Builder: Build agentic workflows for alert response and remediation

Jun 4, 2026 By Amber Tunnell In Datadog

Building automated workflows that adapt to real-world complexity can be a challenge. As systems scale and scenarios multiply, teams often end up hardcoding endless logic branches just to handle every potential outcome. That’s why we’re introducing Bits Agent Builder, a powerful new tool that lets you create custom AI agents that are fully hosted by Datadog.

Read Post

Datadog

Read more about Introducing Bits Agent Builder: Build agentic workflows for alert response and remediation

Migrate to Azure Managed Redis with Datadog and Eden

Jun 1, 2026 By Michael Cronk In Datadog

Azure Managed Redis is a Microsoft first-party, fully managed in-memory data store, replacing Azure Cache for Redis tiers. It includes Redis Enterprise features such as RediSearch for vector search and full-text search, in addition to RedisJSON, RedisTimeSeries, and Active Geo-Replication. As Azure Cache for Redis reaches end of life, more teams are planning migrations to Azure Managed Redis in search of better performance, lower cost, and modern capabilities for AI and real-time workloads.

Read Post

Datadog

Read more about Migrate to Azure Managed Redis with Datadog and Eden

How we cut Spark compute costs by 44% with agentic AI and Datadog Jobs Monitoring

Jun 1, 2026 By Charles Yu In Datadog

Spark jobs only get more expensive and harder to debug as they scale. It’s a problem we’ve run into ourselves. Our Referential Data Platform team builds and maintains the knowledge graph that maps relationships between customers’ observability entities. ServiceQueryEdge is at the center of that graph, mapping service entities to their associated metric and log queries.

Read Post

Datadog

Read more about How we cut Spark compute costs by 44% with agentic AI and Datadog Jobs Monitoring

A deep dive into AWS data perimeter misconfigurations

Jun 1, 2026 By Mallory Mooney In Datadog

In AWS environments, a data perimeter is a set of preventative controls that help ensure that your trusted cloud identities (principals or AWS services acting on your behalf) are accessing trusted resources from authorized networks. You can apply these controls at various levels of your infrastructure, such as per resource or across all resources in your AWS account.

Read Post

Datadog

Read more about A deep dive into AWS data perimeter misconfigurations

Monitor LLM routing with the Kubernetes Inference Extension

May 29, 2026 By David Lentz In Datadog

If you serve LLMs on Kubernetes without inference-aware routing, your load balancer is likely wasting inference capacity. Generic HTTP traffic management blindly routes requests, assuming the backends in your cluster are interchangeable. But your model-serving backends are stateful and unevenly prepared to handle any given request. As a result, requests are often routed to the backend that’s not the one best suited to respond.

Read Post

Datadog

Read more about Monitor LLM routing with the Kubernetes Inference Extension

How a unified data model improves feature flag rollout decisions

May 29, 2026 By Bridgitte Kwong In Datadog

Consolidation is reshaping the experimentation and feature management landscape. Tools are merging, and partnerships are being repackaged as platforms. But marketing a unified experience is not the same as building one. Right now, engineering leaders and product managers are reassessing whether the tools they depend on are built for the long term. It’s irrelevant which vendor has the most products.

Read Post

Datadog

Read more about How a unified data model improves feature flag rollout decisions

Monitor Azure Managed Redis with Datadog

May 28, 2026 By Michael Cronk In Datadog

Azure Managed Redis is Microsoft’s fully managed, enterprise-tier in-memory data store. It is designed for the low-latency caching, session storage, and real-time data needs of modern applications, including AI workloads that depend on fast vector and embedding lookups. Because user-facing applications often query Redis directly, even small regressions in latency, hit rate, or memory pressure can degrade the user experience.

Read Post

Datadog

Read more about Monitor Azure Managed Redis with Datadog

Operations | Monitoring | ITSM | DevOps | Cloud

Infinite Cardinality Metrics: Custom metrics built for modern systems

Search and act across Datadog to resolve issues faster with Bits Chat

Give your AI agents live Datadog access from the command line

Introducing Bits Agent Builder: Build agentic workflows for alert response and remediation

Migrate to Azure Managed Redis with Datadog and Eden

How we cut Spark compute costs by 44% with agentic AI and Datadog Jobs Monitoring

A deep dive into AWS data perimeter misconfigurations

Monitor LLM routing with the Kubernetes Inference Extension

How a unified data model improves feature flag rollout decisions

Monitor Azure Managed Redis with Datadog

Monthly Archive

Follow Us