Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Streamline Incident Management with the New Netdata-ServiceNow Integration

When a critical alert fires at 2 AM, the last thing your on-call engineer should be doing is manual administrative work. Yet, for many teams, that’s exactly what happens. You see the alert in your monitoring tool, then you have to switch contexts, open a new browser tab, log into your ITSM platform, and manually create an incident—all while your systems are failing.

Reliability lessons from the 2025 AWS DynamoDB outage

On October 19th and 20th, 2025, the AWS region US-EAST-1 suffered a massive outage. What started with a 3-hour Amazon DynamoDB outage from a DNS issue led to an Amazon EC2 outage that lasted an additional 12 hours before normal service was restored. Over the course of the outage, there were over 17 million outage reports as companies like Snapchat, Roblox, Amazon, Reddit, Venmo, and more were impacted.

New Feature Friday: AI Readiness and AI Maturity

Everyone wants to move faster with AI. But are you ready for it? In this Feature Friday, Jeff from Cortex shares how working with AI tools like Claude helped him write better code — and why true AI maturity starts with solid engineering hygiene. You’ll learn: “With great power comes great responsibility… and better tests.".

From rollouts to results: Unlocking the value of Feature Management and Experimentation

Unlock Faster, Safer Releases with Feature Management and Experimentation Learn how top engineering and product teams use Harness Feature Management & Experimentation (FME) to accelerate innovation, reduce release risks, and continuously deliver value. In this on-demand webinar, Harness experts Alex Bock and Iram Khan share how to go beyond feature flags to achieve smarter, data-driven releases. Discover how to.

3 Ways to Embed Digital Strategy into DevOps and IT Operations

Let's be honest, in most companies, the people who handle "digital strategy" and the ones who keep the systems running barely speak the same language. The strategy folks are talking about growth, engagement, customer journeys. The ops teams? They're buried in uptime reports, patch schedules, and incident tickets. Somewhere in the middle, the actual connection between the two gets lost.

Speedscale Proxymock: Freely testing cloud native apps alongside AI code assistants

We’ll always remember 2025 as the year AI code assistants went big. Copilot, Cursor, Claude, Windsurf, whatever. Developers went from mistrusting these tools, to being expected to turn over much of their coding labor to them. Even if, according to an extensive Stack Overflow survey, only 3 percent of professional developers say they ‘highly trust’ AI coding tools.

How to Optimize Azure Costs and Improve Cloud Efficiency with FinOps

In this episode of the FinOps on Azure podcast, Dustin Mullenix from KPMG talks about his role leading a FinOps team that handles Azure spend for KPMG's audit business. He shares how internal FinOps teams work with consulting groups, the challenges of Azure's service tiers like managed disks and SQL options, and the trade-offs between cost and performance. Dustin discusses how to handle cost changes, manage knowledge in a big company, this talk is useful for anyone dealing with cloud costs in a large setup.

You're Late to the OpenTofu Party. Here's Why That's a Problem.

OpenTofu has emerged as the true open successor to Terraform, restoring transparency and community ownership after Terraform’s shift to a restrictive BSL license. With features like OCI registries, encryption at rest, and a public RFC process, it’s already outpacing Terraform’s innovation.