Operations | Monitoring | ITSM | DevOps | Cloud

Introducing Kentik Traffic Costs: Real-Time Network Cost Intelligence

Introducing Kentik Traffic Costs, an industry-first automated workflow delivering instant cost estimates for network traffic slices. Learn how this exciting new feature gives network, financial, and sales teams actionable insights to optimize spend, improve margins, and drive revenue.

Kentik Traffic Costs Workflow Demo

Learn how Kentik's automated traffic cost workflow provides instant visibility into network traffic costs, enabling you to optimize spend, improve margins, and make smarter business decisions. In this demo, you'll see practical examples like evaluating costs by AS group and downstream customer, helping network, finance, and commercial teams take immediate, actionable steps to reduce costs and boost efficiency.

(ServiceNow + Kentik) From Reactive to Proactive: The Rise of Agentic Networks

Agentic AI is not just hype—it’s a force multiplier that enables infrastructure and operations teams to do more, with less effort, in less time. Importantly, it helps IT teams compress time to resolution and even proactively detect and respond to issues, before they escalate.

Real-time Alerting for Data Center Networks

Kentik’s Phil Gervasi shows how modern data centers—especially those powering AI workloads—can spot and fix problems before they impact performance or budgets. See how Kentik’s Data Explorer helps you identify disruptive flows, reclaim wasted network capacity, and turn insights into real-time alerts. With monitor-only mode and integrations with systems like PagerDuty and ServiceNow, your network becomes its own early warning system—driving uptime, cost savings, and better AI performance.

Why (Enriched) Flow Data Belongs in Every Network Operator's Daily Toolbox

Flow data has always held immense potential, but was often inaccessible because it lacked context and speed. Kentik removes that friction by automatically enriching flow with human-readable context, making it a daily driver for everyone, not just specialists.

The Starlink Outage and Its Impact on Community Gateways

Last month, Starlink suffered its largest outage in years, arguably its biggest since becoming a major internet provider. In addition to the millions of individual customers around the world, the outage disconnected the Community Gateways, customers of Starlink’s new transit service. In this post, we delve into the outage and its impact on these far-flung networks.

Data Center VXLAN Overlay Visibility at Scale

VXLAN overlays bring flexibility to modern data centers, but they also hide what operators most need to see: true host-to-host and service-to-service traffic. Kentik restores that visibility by decoding VXLAN from sFlow, exposing both overlay endpoints and underlay paths in a single view without the cost and complexity of pervasive packet capture — the result: faster troubleshooting, smarter capacity planning, and confident operations at scale.

Cloudflare's DNS Downtime: Why BGP Hijacks Were Never to Blame

On July 14, Cloudflare’s popular public DNS service (known as 1.1.1.1) suffered an outage lasting over two hours. As rumors swirled about the cause, we were the first to push back on the theory that a BGP hijack had caused the outage. In fact, the hijack was actually a consequence. How did we know this so early when other internet watchers did not? We’ll discuss in this post.

Kentik Cause Analysis in 60 Seconds

In a world where network traffic can suddenly spike, manually sifting through flow data is often a daunting task. Kentik AI's new Cause Analysis simplifies troubleshooting by quickly identifying changes in traffic by application, IP, ASN, or service. With just a few clicks, Cause Analysis helps you compare time periods, understand traffic shifts, and detect changes in your network. Kentik: Take the hard work out of running your network.

The Network Impact on Job Completion Time in AI Model Training

In large-scale AI model training, network performance is no longer a supporting actor — it’s center stage. Job Completion Time (JCT), the key metric for measuring training efficiency, is heavily influenced by the network interconnecting thousands of GPUs. In this post, learn why JCT matters, how microbursts and GPU synchronization delays inflate it, and how platforms like Kentik give network engineers the visibility and intelligence they need to keep training jobs on schedule.