Operations | Monitoring | ITSM | DevOps | Cloud

January 2022

What is Network Observability and Why Should You Care

If you haven’t heard of network observability yet, you will very soon. And you’ll be hearing it a lot. Some say it is just marketing hype. Some say networks have always been observable. But network observability is the most important new concept to hit the network performance monitoring space in years. Join Kevin Woods, Kentik director of product marketing, to learn.

From Kálmán to Kubernetes: A History of Observability in IT

You know that observability plays a crucial role in helping to manage today’s distributed, cloud-native, microservices-based applications. But you may be surprised to learn that – despite its close association with modern applications – observability as a concept was born more than a half-century ago. Its origins stretch all the way back to the late 1950s, long before anyone was talking about microservices and the cloud.

Model driven observability with Prometheus, Alertmanager, Grafana and Loki

The end-to-end monitoring of complex software systems is difficult, toil-intensive and error-prone. Developers, SREs and Platform teams must continuously invest effort in setting up and maintaining the monitoring setups that underpin the observability of their systems, or accept the risk of being unaware of ongoing issues and their impact on end users. Enter model-driven observability powered by Juju!

Techstrong Predict 2022: Future of Observability

In a customer-centric world, observability is mission-critical for delivering great digital experiences. Join the "The Future of Observability" panel discussion to learn: What drivers create the need for observability What steps you should take to reach your observability goals What elements are necessary for an effective observability strategy What the future of observability looks like Panelist: Mitch Ashley - Techstrong Research Lodewijk Bogaards - StackState Brian Dawson - Dawson and Dawson, Inc. Cyrille Le Clerc - Elastic

What is Observability? Benefits, Use Cases & More

The year is over, and the word ‘Observability’ has been one of the buzzwords that kept everyone checking throughout the year for deserving reasons. The organizations do not want to leave any stone unturned to maintain performance and offer robust services from ‘monitoring’ practices to ‘observability’, ‘telemetry’, and visibility capacities. So let’s get into the meaning of each term and understand how they are vital for business growth.

The Five Tenets of Observability

A new year is a chance to have a new start, and one thing that it’s a great opportunity to think about is the monitoring and observability platform you’re using for your applications. If you’ve been using a legacy monitoring system, you’ve probably heard about observability all over the ‘net and want to figure out if this is really something you need to care about.

Make the most of your observability data with the Data Volume app

As a DevOps, SecOps, or IT operations manager, you're surrounded by all the technology for the systems running the entire organization. This means legacy infrastructure, multi-cloud environments, services, tools, and applications. All of these components generate data—a huge amount of data—some of which you need to leverage for full-stack observability to ensure those systems supporting the business are running efficiently.

Tanzu Observability Brings Full-Stack Monitoring of Kubernetes Clusters for OpenShift

Red Hat OpenShift is an enterprise Kubernetes platform that provides users with a unified cloud experience wherever it’s deployed. VMware Tanzu Observability by Wavefront offers observability and analytics for multi-cloud Kubernetes environments. Now these two products work even better together.

Observability Pipelines for Dummies

How do you get the data out of your infrastructure and applications in order to properly observe, monitor, and secure their running states while minimizing overlap, wasted resources, and cost? Many business folks need a broad category of tools in all their environments to solve challenges such as up and down monitoring, metrics, a time series database (TSDB), log analytics, event streaming, security information and event management (SIEM), user behavior analytics (UBA), and data lakes. The answer to the proposed question to solve these hurdles is using an observability pipeline.

How Reliability and Product Teams Collaborate at Booking.com

With more than 1.5M room nights booked per day, Booking.com requires a solid infrastructure that’s constantly monitored. And indeed, Booking.com now has a footprint of 50,000+ physical servers running across four data centers and six additional points of presence. The sheer size of this server fleet makes it viable for Booking.com to have dedicated teams specializing into looking only at the reliability of those servers.

The Observability Pipeline

Today’s systems are more distributed, dynamic, and complex than ever before – plus, users have more expectations. Also, the historical reliance on an operations team to monitor, triage, and/or resolve issues has become untenable as the number of services increased. This means that many of the tools that were well-suited before might no longer be adequate.

Ask Miss O11y: Long-Running Requests

You need not fear a long-lived streaming workload. A few simple tricks can transform a request that may not ever terminate for hours or days into something you can get regular health and status updates on. We in fact have one of those continuous processing services—Beagle, our Service Level Objective stream processor—which we’ve instrumented in this fashion.

The Business Case for Observability and Site Reliability Engineering

Unlike traditional IT Ops, the role of the SRE isn’t simply focused on finding and solving technical problems. The big win for today’s SREs is supporting the organization’s strategic innovation initiatives. With the appropriate observability capabilities, it’s possible to quantify the value that software infrastructure contributes to this innovation effort.

Wisdom of the Crowds: The Value of User Sentiment Observability

What’s the first thing most people do when they’re unhappy with a business? Take to social media to complain about it. Observing those comments – otherwise known as “user sentiment observability” – gives you a head’s up as to when problems become big enough to impact user experience. How can you monitor that voice of the customer? And why is it important to do so? Let’s take a deeper look at the issues.

ICYMI: Honeycomb Developer Week: The Partner Ecosystem

We know that you value collaboration. That’s why we share incident reviews and learnings—because we believe the entire community benefits by working together transparently. In the spirit of working better together, we invited ecosystem partners from ApolloGraph, Cloudflare, LaunchDarkly, and PagerDuty to present at Honeycomb Developer Week, a three-day event filled with snackable, time-efficient learning sessions to help you uplevel your observability skills.

Operations Analytics - The Next Big Thing

Cloud Data Warehouses (CDW) were designed to support business intelligence use cases focused on historical data analysis, but less so on “what is happening now?” class of queries. We think operational analytics are the next big focus and we want to discuss the space and how enterprises will connect their operational data to these new tools to get results right now instead of next week.

The Importance of Network Insights in Achieving Full End-To-End Observability

When we talk about observability, we tend to focus first and foremost on the metrics, logs, and traces that you can collect from applications – such as request rates, error rates, and request duration. Infrastructure-level metrics, like CPU and memory utilization, might factor into the discussion as well. Here’s a third category of critical observability insights that teams tend to overlook: the network.

Why cloud native requires a holistic approach to security and observability

Like any great technology, the interest in and adoption of Kubernetes (an excellent way to orchestrate your workloads, by the way) took off as cloud native and containerization grew in popularity. With that came a lot of confusion. Everyone was using Kubernetes to move their workloads, but as they went through their journey to deployment, they weren’t thinking about security until they got to production.

The Importance of Observability for the SRE

The term Site Reliability Engineer (SRE) first appeared in Google in the early 2000s. In Google’s 2016 SRE Book, Benjamin Treynor Sloss wrote that, generally speaking, “an SRE team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their service(s).” This means that the SRE teams at Google decide how a system should run in production as well as how to make it run that way.

Observability - Software & Tools.

A developer's viewpoint is distinct. It can be difficult to keep track of operations and detect the fault that is causing the software to malfunction when handling numerous sectors. What if you could detect the issue ahead of time and fix it as soon as possible? The tactics that we concentrate on and put into action are those that assist us in properly managing our tasks. Knowing about observability makes this possible. Let's take a closer look at it in this blog.

Accelerating software delivery through observability at two very different organizations

Delivering value to customers quickly and efficiently is critical to the success of modern businesses. Understanding the process and timeframe during which an organization generates new ideas and then designs, develops and deploys them is vital to success. At the recent Illuminate User Conference, Drew Horn, Director of Business Development, held a discussion with Clara Ko, Director of Engineering at Sauce Labs, and Bryan Veselka, Director and Product Owner for Cloud and Automation at Vizient.

Ask Miss O11y: How Can I Add o11y to Databases?

Oh goody, I’m so tickled to get this one. *rubs hands gleefully* Funny story, back in 2016–2017 we thought we were building Honeycomb primarily for DB use cases. The use cases are that killer. I’ve never seen another tool do the kinds of things you can do on the fly with Honeycomb and databases.

Don't Settle for Observability. Strive for Actionability

You’ve heard of observability, which has fast become one of the IT industry’s buzzwords du jour. But what about actionability, or the ability to translate observability into meaningful action? The latter term may not be a trending buzzword (not yet) – indeed, “actionability” perhaps sounds almost boring – but it’s just as essential as observability in managing complex, cloud-native environments.

Sponsored Post

Speedscale Launches CLI: Free API Observability Tool

We are excited to announce the launch of Speedscale CLI, a free observability tool that inspects, detects and maps API calls on local applications or containers. The offering underscores the importance of continued and proactive API testing to quickly detect and debug defects within a shifting array of upstream and downstream interdependencies.

Detecting Log4J/Log4Shell exploits with LogStream

Shortly before the December holidays, a vulnerability in the ubiquitous Log4J library arrived like the Grinch, Scrooge, and Krampus rolled into one monstrous bundle of Christmas misery. Log4J maintainers went to work patching the exploit, and security teams scrambled to protect millions of exposed applications before they got owned. At Cribl, we put together multiple resources to help security teams detect and prevent the Log4J vulnerability using LogStream.

Why "AIOps vs. Observability" Is a False Dilemma

What comes first – observability or AIOps? Can you achieve observability without AIOps? Do you need AIOps if you already have an observability solution in place? These are all questions that any team considering AIOps will want to answer in order to determine the real-world value that AIOps tools stand to offer.

Exoprise 2021 Year in Review

Happy New Year 2022! In 2021, Exoprise’s critical focus was on improving its product for monitoring digital experiences and mobilizing internal teams to improve customer adoption and SaaS/network experiences everywhere. As Covid continues to dominate the world, IT and business teams are increasingly looking for solutions like Exoprise Digital Experience Monitoring (DEM) to ensure end-users are productive with a seamless work-from-home experience.

Why Intuitive Troubleshooting Has Stopped Working for You

It’s harder to understand and operate production systems in 2021 than it was in 2001. Why is that? Shouldn’t we have gotten better at this in the past two decades? There are valid reasons why it’s harder: The architecture of our systems has gotten a lot more sophisticated and complex over the past 20 years. We’re not running monoliths on a few beefy servers these days.

Big Data Observability and Continuous Tuning at Scale

Increasingly, many organizations find that their current legacy monitoring solutions are no longer adequate in today's modern IT world. These enterprises find themselves struggling to manage and understand unprecedented amounts of data. With such large amounts of data needing to be dealt with, it is no wonder why it's a struggle for enterprises to leverage it for business success. Not to mention that optimizing performance and keeping costs in line is a technical challenge they must face at the same time.