Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on APIs, Mobile, AI, Machine Learning, IoT, Open Source and more!

How to Build Reliable Data Collection Workflows

When your business depends on form data, even small issues can turn into big problems. Missing submissions, incomplete responses, or delayed updates can break your workflow without you even noticing. As your operations grow, these gaps become harder to track and fix. Let's break down how to build a data collection workflow that stays reliable as you scale.

The 4 Golden Signals of Monitoring Explained

As a team, we have spent many years troubleshooting performance problems in production systems. Applications have become so complex that you need a standard methodology to understand performance. Our approach to this problem is called the Golden Signals. By measuring these signals and paying very close attention to these four key metrics, providers can simplify even the most complex systems into an understandable corpus of services and systems.

AI Cost Management: How To Track, Allocate And Optimize AI Spend

AI cost management is the practice of tracking, allocating, and optimizing the cloud infrastructure costs tied to building, running, and scaling AI workloads. It differs from traditional cloud cost optimization because AI infrastructure behaves differently at every layer of the stack. The biggest problem isn’t overspending. It’s that most organizations can’t see where their AI spending is going.

Observability and Security for the AI Era

Datadog has always been driven by a broader vision of helping teams understand and operate complex systems. In this session, you’ll hear from Yrieix Garnier, VP of Product, and Hugo Kaczmarek, Senior Director of Product, as they share the latest updates across the Datadog product suite and discuss how that vision continues to shape the platform’s evolution and support the next generation of AI-driven applications.

Winning in the AI Era: How Top Teams are Driving Their Velocity Gains with Alloy & Chime

While most teams struggle with the complexity of AI-generated code, Alloy and Chime have built internal cultures and processes that enable them to scale their development while maintaining quality. Join CircleCI’s CTO, Rob Zuber, in conversation with Maciej Makowski, Senior Software Developer at Chime, and Sunny Singh, Senior Software Engineer at Alloy, as they explore the dynamics that set their teams apart. They'll talk through the culture and delivery practices that actually moved the needle.

How Much Does It Cost To Keep Up With The AI Joneses?

I’ve been an engineering leader for over a decade, and I’ve spent most of those years in private Slack groups with other engineering leaders, comparing strategies and kvetching about Kubernetes. Of the hundreds of threads I’ve taken part in, the one that got the most engagement the fastest was a recent one around AI adoption. “Where are you on this continuum?”, it read. “A. You don’t really care how people use AI; B. You push people to use AI; or C.

Transform ticket hell into smooth operations #ITSM #AI

Infraon ITSM uses advanced "ai" capabilities to manage operational noise, significantly boosting "business efficiency". It features a robust "ticketing system" and "sla" management for prompt resolutions, alongside self-service portals and a comprehensive "knowledge base" to enhance the "service desk" experience.

Enhancing our API for better agentic consumption

AI coding agents like Claude Code and Codex are becoming a real part of developer workflows. They don't just write code, they call APIs, interpret responses, and take action based on what they find. That means the quality of your API responses directly affects how useful an agent can be. We've shipped a series of improvements to the Oh Dear API with this in mind. Every change helps humans too, but we specifically optimized for how agents consume and reason about data.