%term

Build an SRE Agent Harness for AIOps Without Context Blowout

Jul 9, 2026 By Mezmo In Mezmo

An agent harness for AIOps is the runtime layer that coding agents like Claude Code were never built to provide: context isolation, decision traceability, and gated execution for tools that touch production. Aura is Mezmo's open-source (Apache 2.0) agent harness, purpose-built for operations work rather than software development.

View Video

Mezmo

Read more about Build an SRE Agent Harness for AIOps Without Context Blowout

Builder in the loop: Tony Rogers on stress-testing AURA before production

Jun 22, 2026 By Mezmo In Mezmo

Builder in the loop is a Mezmo interview series focused on the engineers, product leaders, and operators shaping AURA, an open-source, MCP-native agent harness for production operations. This installment features Tony Rogers, whose work on AURA is less about building new features and more about trying to break them before users can.

Read Post

Mezmo

Read more about Builder in the loop: Tony Rogers on stress-testing AURA before production

Builder in the loop: Eric Lake on making AURA smarter after every incident

May 27, 2026 By Mezmo In Mezmo

Builder in the Loop is a Mezmo interview series focused on the engineers, product leaders, and operators shaping AURA, an open-source, MCP-native agent harness for production operations. The goal is to get past the polished product layer and talk through the decisions that matter when AI starts interacting with real systems. Key questions include: What should agents be allowed to do? How do they get better over time? Where should humans stay in the loop?

Read Post

Mezmo

Read more about Builder in the loop: Eric Lake on making AURA smarter after every incident

Why SRE agents need orchestration, not just more tools

May 19, 2026 By Mezmo In Mezmo

Single agents are a useful starting point for SRE workflows. They are not where the architecture should end. The first version is simple enough: connect an LLM to a few tools, give it a system prompt, and point it at your infrastructure. It can summarize an alert, pull logs, answer questions, and draft a useful next step. Then the workflow gets real. You add GitHub for runbooks, Kubernetes for cluster state, PagerDuty for incident context, Prometheus for metrics, and Mezmo for telemetry.

Read Post

Mezmo

Read more about Why SRE agents need orchestration, not just more tools

When your agents hallucinate at 2 am, it is not a model problem

May 15, 2026 By Mezmo In Mezmo

The first time an AI assistant suggests "restart the service" during a live incident and nobody on the bridge can tell whether that suggestion came from a current runbook, a stale wiki page, or thin air, you stop caring about model benchmarks. You start caring about what the agent actually knew, where that knowledge came from, and whether you can trust the chain of reasoning behind it.

Read Post

Mezmo

Read more about When your agents hallucinate at 2 am, it is not a model problem

Builder in the loop: Henry Andrews on building AURA like production software

May 13, 2026 By Mezmo In Mezmo

An interview series with the people building Mezmo’s open-source agentic harness for production operations. Builder in the loop is a Mezmo interview series focused on the engineers, product leaders, and operators shaping AURA, our open-source, MCP-native agentic harness for production operations. The goal is to get past the polished product layer and talk through the decisions that matter when AI starts interacting with real systems. What should agents be allowed to do?

Read Post

Mezmo

Read more about Builder in the loop: Henry Andrews on building AURA like production software

AURA in Practice: Mezmo's SRE bot, demo walkthrough

May 11, 2026 By Mezmo In Mezmo

A walkthrough of the Slack-based SRE bot Mezmo's engineering team built on AURA, the open-source agent harness, running against Mezmo's own production tooling. Adrian Furlong shows the bot answering questions in a DM with tool calls visible inline, then in a shared channel where it reads the conversation before responding. He opens a fresh PagerDuty incident on camera. The webhook fires AURA, and within seconds, the agent posts a triage note back on the incident and a structured analysis in the dedicated incident channel.

View Video

Mezmo

Read more about AURA in Practice: Mezmo's SRE bot, demo walkthrough

The Journey to Production AI: Five Steps for SRE and Platform Teams

May 8, 2026 By Mezmo In Mezmo

In a recent webinar, The Journey to Production AI, Andre Elizondo walked through what separates a working agent demo from an agent worth trusting on a 2 a.m. page. Live polls during the session put numbers behind a pattern most platform teams already feel. ‍ ‍ Most teams are early. The ones who are further along did not get there by shipping a flashier demo. They got there by treating production AI as a platform problem.

Read Post

Mezmo

Read more about The Journey to Production AI: Five Steps for SRE and Platform Teams

LiveTail: Real-Time Visibility for Active Telemetry

Apr 29, 2026 By Mezmo In Mezmo

See how Mezmo LiveTail helps teams move from passive log search to active, real-time investigation. In this demo, you'll watch live telemetry stream across services and environments, identify emerging issues as they happen, and use real-time context to troubleshoot faster before signals are delayed, buried, or lost in the noise. LiveTail is part of Mezmo's Active Telemetry platform — built for platform engineers and SREs who need immediate visibility into what's happening across their stack right now, not after the fact.

View Video

Mezmo

Read more about LiveTail: Real-Time Visibility for Active Telemetry

How Mezmo Uses Active Telemetry for Faster AI Root Cause Analysis

Apr 29, 2026 By Mezmo In Mezmo

AI-powered root cause analysis only works when the data going into the model is clean, relevant, and structured. In this demo, we show how Mezmo's Active Telemetry approach helps engineers and SREs move from noisy application errors to immediate clarity. Using a restaurant ordering application running in Kubernetes, we trigger a database connection pool exhaustion issue and walk through two ways to investigate it with Mezmo.

View Video

Mezmo

Read more about How Mezmo Uses Active Telemetry for Faster AI Root Cause Analysis

Operations | Monitoring | ITSM | DevOps | Cloud

Build an SRE Agent Harness for AIOps Without Context Blowout

Builder in the loop: Tony Rogers on stress-testing AURA before production

Builder in the loop: Eric Lake on making AURA smarter after every incident

Why SRE agents need orchestration, not just more tools

When your agents hallucinate at 2 am, it is not a model problem

Builder in the loop: Henry Andrews on building AURA like production software

AURA in Practice: Mezmo's SRE bot, demo walkthrough

The Journey to Production AI: Five Steps for SRE and Platform Teams

LiveTail: Real-Time Visibility for Active Telemetry

How Mezmo Uses Active Telemetry for Faster AI Root Cause Analysis

Monthly Archive

Follow Us