Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Network Documentation Best Practices: What to Create & Why

Everybody agrees network documentation is extremely important, but there tends not to be a lot of agreement on what that documentation should include. The short answer is that it should include everything that’s relevant—but what that means varies between networks. For example, in a really small network with one switch and a firewall, and perhaps a single wireless access point, there isn’t much to document. It might be enough to put everything in a single diagram.

"Managing OpenTelemetry Through the OpAMP Protocol" by Mike Kelly, observIQ

Managing thousands of data collection Agents across just as many servers can overwhelm DevOps teams. Open Agent Management Protocol (OpAMP) is a new network protocol from the OpenTelemetry Project that enables remote management of OpenTelemetry collectors, allowing them to report their status to and receive configuration from a Server and to receive agent package updates from the server. This eliminates the need to create new custom distributions and redeploy, drastically simplifying Agent management.

Diagnose Any Microsoft Teams Problem in 3 Clicks or Less!

In this video, we demonstrate how to diagnose Microsoft Teams problems with hop-by-hop analysis and insights in real time - All in three clicks or less. Whether your users work from home, the office, or anywhere in between, a superb call-quality experience is a must. How can IT operations staff ensure that? Using a combination of synthetics and real user monitoring (RUM), support teams can now get comprehensive visibility into Teams performance and use those insights for optimization.

The 3 Keys To A Successful Microsoft Teams Meeting Room Strategy

A Microsoft Teams rooms strategy might seem like more effort than it’s worth on the surface, but you’d be surprised how much impact it can make. With many businesses now mixing in-person and remote Teams meeting participation, planning and optimizing how it works just makes sense.

Touching Grass With SLOs

One of the things that struck me upon joining Honeycomb was the seemingly laissez-faire approach we took towards internal SLOs. From my own research (beginning with the classic SRE book, following Google’s example), I came to these conclusions: If you read the original SRE book when it was released, before the workbook came out, these conclusions all made sense.

Grafana Agent 0.29.0 release: New OpenTelemetry components

Today the Grafana Agent team is excited to announce the release of Grafana Agent v0.29.0. This September, we introduced a new way to easily run and configure Grafana Agent called Grafana Agent Flow, our new dynamic configuration runtime built on components. Within Flow, we are also embracing Grafana Labs’ big tent philosophy by introducing OpenTelemetry (OTel) Collector components and converters for traces, metrics, and logs in Agent v0.29.0.

Experience at 35,000 Feet w/ Derek Whisenhunt (Southwest Airlines)

This week we bring you another special “live from the road” episode of the DEX Show – as we sat down with Southwest Airlines’ Derek Whisenhunt ahead of his amazing talk at Experience Everywhere in New York City! If you’ve ever wondered what separates a best-in-class airline from the rest of the pack, this episode’s for you.

Cribl's Fall Launch: Beyond the Pipeline

What's new in Cribl's Fall release? Stream 4.0: A UX refresh, new DB collector, and a Pipeline profiling capability for better visibility and reduced time to resolution. Cribl.Cloud 4.0: BYO IdP, cloud-hosted queueing for sources and destinations, and the ability to purchase a Cribl.Cloud subscription directly from the AWS Marketplace. Edge 4.0: The addition of fleet management, AppScope Edge integration, enhanced Kubernetes support, and the power to handle up to 15k Edge nodes for even more visibility, at scale.