Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on APIs, Mobile, AI, Machine Learning, IoT, Open Source and more!

last9-genai: Closing the Conversation Gap in LLM Observability

OpenTelemetry's GenAI instrumentation gives you spans and token counts. It does not give you conversations, workflow cost rollups, or prompts visible in your dashboard. last9-genai is an OTel extension that fills those three gaps — without replacing your existing observability stack. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

Automate your critical workflows with AI agents in 5 steps

Many teams remain bogged down by operational chaos and manual drudgery, even with access to a variety of automation solutions. These tools often operate in silos, creating disconnected islands of automation that require significant human effort to bridge. Agentic AI offers a path forward, creating a cohesive system that can intelligently and autonomously handle complex operational workflows.

Your CEO Wants You To Ramp AI Usage Without Breaking Budgets. Here's How You Can Do It

Notes from a finance leader whose job this is. A few weeks ago, I traveled to Philadelphia for a conversation with a prospective CloudZero customer. We’d been working with the prospect’s engineering team for some weeks, demoing our platform in view of the RFP they’d drawn up. This stage had gone well, and so the next step was talking it over with the prospect’s CFO. We expected a conversation centered around the key criteria in the RFP.

Your AWS Kiro Agent Can Now Query CloudZero. Here's What To Ask It

CloudZero's new AWS Kiro integration puts cost intelligence directly in your agentic IDE. Ask plain-language questions about spend, attribution, and cost-per-serve without leaving your development workflow. We see a similar pattern playing out across engineering teams running agentic development tools: code gets shipped fast, something moves in the cost data, and understanding why still requires leaving your environment entirely.

Why Runtime Visualization Is the Missing Link in Teaching Real-Time Systems

Guest blog by Florent Goutailler, Associate Professor, Télécom Saint-Etienne, France Teaching real-time embedded systems has always involved a fundamental challenge: the most critical behaviors – task scheduling, timing, and concurrency – are largely invisible at runtime. When students begin working with a real-time operating system such as FreeRTOS, they are introduced to concepts like scheduling, task prioritization, semaphores, and inter-task communication.

Stop watching the looms: why the AI era belongs to infrastructure

I live in Manchester, England now. I moved here from Texas last summer (which is its own story), but the thing I wasn't prepared for is how the Industrial Revolution isn't history here. It's the city itself. And if you're American like me, you might need to hear this: the Industrial Revolution didn't start in the US. It started here. Manchester is where the modern world was born. You see it everywhere. The old cotton mills converted into apartments.

LogicMonitor Advances Autonomous IT with No Blind Spots, Trusted AI, and Closed-Loop Action

LogicMonitor’s latest innovations span the entire platform to deliver the operational foundation enterprises need for Autonomous IT—complete visibility from infrastructure to end user, AI that reasons in full context, and closed-loop automation that moves from detection to resolution. Over 90% of organizations rely on at least two to three monitoring solutions—and many enterprises operate five or more.

Context-Driven AI You Can Trust: How Edwin AI Earns Confidence in Production

Most legacy AIOps investments underdeliver because the AI lacks context, not capability. LogicMonitor’s latest innovations expand Edwin AI’s contextual intelligence across every dimension, so recommendations are accurate, explainable, and trusted by the teams that need to act on them. Reduce incident resolution time with AI that understands your environment—not just your alerts.

Who's on call? How Claude helped us calculate this 2,500x faster

Schedules are a core part of any on-call system. In ours, they define who to page and when. But people use them in lots of other ways too: checking their next shift, asking for cover while at the gym, keeping a Slack user group up to date, or updating a Linear triage responsibility. For many of our customers, they’re one of the main ways they interact with our product, and as they’re such a foundational part of On-call, it’s very important they work well.

When Offshore Software Development Makes Sense (and When It Doesn't)

Offshore development isn't a universal solution, and treating it like one is how companies end up with cautionary tales instead of successful products. The decision to go offshore should be strategic - based on your specific circumstances rather than the generic promise of "same quality at lower cost" that every vendor website offers. This article provides an honest framework for deciding whether offshore development fits your situation - and equally important, when it doesn't.