Operations | Monitoring | ITSM | DevOps | Cloud

AI in Software Delivery: Engineering Excellence or Just Market Hype? | Harness Blog

AWS re:Invent 2025 made one thing very clear: enterprise interest in AI is no longer theoretical. The conversation has moved beyond curiosity. Teams are actively experimenting, leaders are looking for production-ready use cases, and engineering organizations are trying to figure out where AI can create real leverage across software delivery, security, platform engineering, and operations.

Faster incident investigation with BigPanda and ServiceNow Now Assist

When an incident occurs, an L2/3 engineer or SRE can spend 20–30 minutes investigating across alert consoles, combing through change records, and pinging teams on Slack or Microsoft Teams. When you multiply that time spent across thousands of incidents per year by the cost of an IT outage at $14,056 per minute, the cost is staggering. Enterprises can’t afford to waste time searching across disparate tools.

The most debated DORA metric (even Google debates this)

What's the most debated DORA metric? Nathen H from Google's DORA team breaks down the change lead time debate — and why even the experts can't fully agree on when a change is "committed." Is it at commit? After merge? The answer matters more than you think. Subscribe for more DevEx and DORA insights from our Web Summit series.

AI Supply Chain Attacks Are Here. And Most Organizations Aren't Ready

When I read about the Vercel breach tied to a Context AI compromise, I wasn’t surprised. I’ve been talking with customers for a while now about how AI was going to introduce a new kind of supply chain risk. This is exactly what that looks like. What stands out to me is how familiar the pattern is. We saw it with open source, then again with SaaS, and again with cloud.

Why Does MTTD Stay High Despite Observability Tools Running?

Monitoring coverage, anomaly detection, and SLO-based alerting have significantly narrowed detection windows for most failure types, but MTTD remains stubbornly high for a specific silent failure. This blog covers why type mismatches, swallowed exceptions, and values that pass validation without occurring without triggering errors, and what changes when your monitoring stack can generate those signals without waiting for a failure to surface them.

How the Coralogix CLI Adds Production Intelligence to Any Agent for Any Use Case

The new interface into production telemetry is a tool call, made from whichever agent runtime the operator happens to be using at that moment. A finance lead in Claude Code, a product manager in Cursor, an engineer in Codex. Three different jobs, three different agents, three different reasoning loops. The thing they have in common is the data layer underneath.

AI Enablement for Dev Teams: The 6-Pillar Flywheel

AI adoption is already happening on your team, whether you have a strategy or not. Tracy Lee (CEO of This Dot Labs, Microsoft MVP, Google Developer Expert) breaks down the AI Enablement Flywheel — a 6-pillar framework used by successful engineering organizations to move from scattered experimentation to scalable, ROI-positive AI workflows.

Federated Search | From Silos to Insight | Unified Datasets in AWS S3 with Ingest Processor

Are storage costs and data silos slowing down your investigations? In this video, we dive into the Unified Dataset Experience to show you how to search data where it lives. Learn how to use the Splunk Ingest Processor to route high volume logs directly to AWS S3 while maintaining instant visibility via Federated Search. No more re-hydrating data, just fast cost-effective insights.