Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on AIOps, alerting in complex systems and related technologies.

From Signal Corps to Space: Building Networks That Can't Fail with Troy MacDonald

What does it take to succeed in networking when complexity is constantly increasing, and change never slows down? In this episode of Next-Gen Network Heroes, host Bob Slevin sits down with Troy (David) MacDonald, a network engineer at Blue Origin and former U.S. Army Chief Warrant Officer, to explore a career that spans from infantry beginnings to designing and managing large-scale, mission-critical networks.

Resolve Reels - Ep. 4 - Agent Lab

Episode 4 of Resolve Reels is live! See how Agent Builder helps teams create purpose-built AI agents with the right guardrails, routing logic, and orchestration for enterprise operations. In this episode: Build specialized agents with defined responsibilities Improve routing with conversation starters and guardrails Test and operationalize agentic AI at scale This is how enterprises move toward Autonomous Operations and Zero Ticket IT.

The New Compliance Crisis: AI Is Outrunning Its Controls

Enterprises have spent decades refining compliance frameworks around workflows that were linear, predictable, and well-documented. These frameworks were built for systems that executed actions deterministically and for human operators who made decisions slowly enough for oversight to keep up. In that environment, compliance could function as a retrospective discipline because the evidence required to validate behavior generally existed in complete, stable form.

Why Network Operations Needs Data-Centric AI

The discussion around AI in infrastructure and operations has become increasingly model-centric. Teams want to know what model a platform uses, how current it is, how much reasoning capacity it has, and how quickly it can be updated as the model landscape shifts. Those are reasonable questions, but they tend to arrive too early. In production operations, the more consequential question is what happens to the data before any model is asked to interpret it.

How to Monitor Applications and End User Experiences

In this video, see how Skylar One helps you understand the impact of changes on application performance and the end user experience. By tracking service level metrics across an e commerce environment, you can quickly identify when performance degrades and how it affects user behavior. Explore how Skylar One enables: With Skylar One, teams can quickly connect performance changes to real user impact, helping ensure a consistent and reliable digital experience.

Enhancing Your Search Skills with Liang Chen

What does it take to reinvent network visibility from the ground up? In this episode of Next-Gen Network Heroes, Bob sits down with Liang Chen, Senior Network Architect at Texas Children’s Hospital and creator of a next-generation network traffic analyzer built for real-time, packet-level visibility. Liang shares how he built a platform capable of analyzing traffic at up to 200Gbps with zero packet loss—unlocking deeper network forensics and faster troubleshooting in mission-critical environments.

True Visibility: How Liang Chen is Rethinking Network Monitoring

What happens when deep networking expertise meets low-level programming and a passion for invention? In this episode of Next-Gen Network Heroes, host Bob Slevin sits down with Liang Chen, Senior Network Architect at Texas Children's Hospital and a true innovator in network performance and visibility. With more than 25 years of experience in networking, plus advanced expertise in programming languages like C and Assembly, Liang has built his own next-generation traffic analysis platform from the ground up—designed to provide real-time, packet-level visibility at massive scale.