Operations | Monitoring | ITSM | DevOps | Cloud

First Look at the Next-Generation OnPage Enterprise Web Management Console

Get a first look at the next-generation OnPage Enterprise Web Management Console, a modernized platform designed to help critical response and operations teams across IT, Healthcare, and other industries improve visibility, streamline communication workflows, and respond faster from one centralized interface.

AI Asked Our General Counsel Anything. She Didn't Hold Back.

What happens when AI interviews a tech leader? You get unexpectedly honest answers. Harness General Counsel Hanna Steinbach sat down with ChatGPT — and skipped the corporate script. From the realities of parenting while leading a legal team at a high-growth startup, to the daily habits that keep her grounded, this is the kind of candid leadership perspective you rarely see. Oh, and she's definitely the person sprinting to the gate right as boarding starts.

The AI Agent Accountability Crisis: Why Governance Isn't Keeping Up With Deployment

Every enterprise is building AI agents. Marketing has one summarizing campaign performance. Engineering has one triaging incidents. Customer support has one resolving tickets. Finance has one processing invoices. Each was built by a different team, using a different framework, with different assumptions about security. Now those agents are talking to each other through agent-to-agent (A2A) communication. The incident-triage agent calls the customer-support agent to check affected accounts.

KubeVirt Live Migration Done Right: What it Takes to Run VMs on Kubernetes

Running VMs in Kubernetes sounds like a crazy workaround for avoiding vendor lock-in, and standardizing legacy applications and newer containerized workloads on one control plane with one set of security policies to govern them all. It is, however, a rapidly growing pattern, and KubeVirt live migration — moving running VMs between nodes without downtime — is increasingly central to platform engineering use cases that require full VMs, like on-demand CI/CD pipelines.

Automated Release Management: From CABs to Continuous Delivery | Harness Blog

The thing with Change Advisory Boards is that the intent was always good. Get smart people in a room, look at the evidence, and make sure nothing catastrophic goes out the door. In theory, that's hard to argue with. It doesn't scale in practice. Things happen between meetings. Teams rush to hit the window. The CAB meeting may not catch every risky deployment, but at least everyone can feel good about the process before the incident happens. Automated release management asks a different question entirely.

New ways to agentically build and edit dashboards

The traditional dashboard workflow, teams slowly handcrafting visualizations to track critical KPIs, is dying in a world of AI agents. A few years ago, in a pre-agentic-everything world, we tried to make it easier for developers to monitor critical experiences. We introduced Insights pages, which were pre-configured dashboards any Sentry user could adopt instantly that surfaced common health signals, like Web and Mobile Vitals.

90 Days Isn't Enough Notice: What Predictive Churn Warning Actually Looks Like

Your customer started their renewal evaluation on a Tuesday in March. You did not know about it. Their CFO had asked the procurement lead to "look at alternatives" during a quarterly budget review. Three weeks later, a competitor's SDR was on a discovery call with their head of operations. By the time your CS platform's health score turned amber, six weeks had passed inside their building. This is what most CS leaders miss when they evaluate early warning systems.

Simplify micro-frontend observability with Datadog RUM

Micro-frontend architectures, where independent teams build and deploy separate parts of a frontend application, introduce an observability challenge: Telemetry data is fragmented across services, making it difficult to determine which micro-frontend caused a performance degradation or error spike.

Attribute AI costs across providers with Datadog Cloud Cost Management

AI adoption is accelerating across organizations, and spending often follows a similar pattern: rapid growth, multiple providers, and limited visibility into where costs originate. Each provider exposes billing data differently, with distinct schemas, dimensions, and interfaces. FinOps and engineering teams often spend significant time consolidating fragmented data, only to end up with partial attribution and limited context about who or what generated the AI spending.

Improvements to our status pages as we tackle a DDoS

The uptime & availability of our status pages hasn't been great these past few days. The root cause is a persistent and pretty aggressive DDoS attack targeted at our own status page, status.ohdear.app. As a result, the overload on our systems also affected all other status pages we host for clients. We're not yet at Github or Claude levels of uptime sadness, but this isn't acceptable to us. In this post, I'll share what's happening and what steps we've already taken.