Operations | Monitoring | ITSM | DevOps | Cloud

Taking AI Apps From Prototype to Production

At this year’s AWS Summit in New York, agentic AI took center stage with Amazon’s launch of Bedrock AgentCore — a powerful step toward turning AI prototypes into scalable, production-ready applications. From low-code workflows to turnkey infrastructure, a new generation of tools is enabling teams of all skill levels to build, deploy, and monitor AI agents faster than ever.

Real-Time Alerting for AI-Optimized Data Centers

Kentik transforms real-time network telemetry into actionable alerts for AI-optimized data centers. By converting database queries into custom alerts, engineers can detect issues like elephant flows, idle links, and packet loss before performance suffers and triggers alerts in systems like ServiceNow or PagerDuty.

Identifying Idle Paths in a Data Center Leaf-Spine Fabric

In a perfect leaf-spine network, traffic evenly spreads across all links. But reality is often different, leaving costly, idle paths hidden in your data center fabric. Kentik's Phil Gervasi demonstrates how Kentik's network intelligence platform helps engineers quickly identify and address these underutilized paths. With powerful visualizations, detailed telemetry analysis, and customizable alerts integrated into your ticketing systems, Kentik makes it easy to spot persistent traffic imbalances, troubleshoot ECMP issues, and optimize your infrastructure.

Elephant Flows: The Hidden Heavyweights of AI Data Center Networks

Elephant flows are no longer rare. They’re foundational to AI workloads. In today’s GPU-heavy data centers, long-lived, high-volume flows can distort ECMP, overflow buffers, and rack up unexpected cloud bills. Kentik helps you see and tame these elephants with real-time flow analytics, automated alerting, and predictive capacity planning.

Introducing Cause Analysis: Instant Triage for Traffic Changes with Kentik AI

Introducing Cause Analysis from Kentik, designed to simplify network traffic analysis and rapidly identify the root cause of issues. Learn how this exciting new feature streamlines troubleshooting, makes complex insights accessible, and boosts team efficiency for all users.

AutoCon3: Network Automation's Premier Conference

AutoCon3 in Prague offered important takeaways on network automation’s evolution, from hands-on learning and design principles to the impact of AI and the power of community. Read Justin Ryburn’s recap to learn about key insights from the event, showing why network automation is now a core competency you’ll want to understand.