Operations | Monitoring | ITSM | DevOps | Cloud

Observability

The latest News and Information on Observabilty for complex systems and related technologies.

Incident Resolution: Do You Remember, the Twenty Fires of September?

From September to early October, Honeycomb declared five public incidents. Internally, the whole month was part of a broader operational burden, where over 20 different issues interrupted normal work. A fraction of them had noticeable public impact, but most of the operational work was invisible. Because we’re all about helping everyone learn from our experiences, we decided to share the behind-the-scenes look of what happened.

Introducing Grafana Enterprise Traces, joining metrics and logs in the Grafana Enterprise Stack observability solution

Today, we are launching a new Grafana Labs product, Grafana Enterprise Traces. Powered by Grafana Tempo, our open source distributed tracing backend,.and built by the maintainers of the project, this offering is an exciting addition to our growing self-managed observability stack tailored for enterprises.

Observability for Banks and Financial Services Organizations: Top 4 Challenges and How Monitoring Can Help

The Covid-19 pandemic has disrupted how most businesses operate. In fact, 87% of global IT decision makers agree that the Covid-19 pandemic will cause organizations to accelerate their migration to the cloud. While digital transformation allows for a more agile online customer experience is a focus for every business, it’s an even bigger priority for banking and FSI organizations because they previously relied on in-person trade and office-based operations.

What's Wrong With Observability Pricing?

There’s something wrong with the pricing of observability services. Not just because it costs a lot – it certainly does – but also because it’s almost impossible to discern, in many cases, exactly how the costs are calculated. The service itself, the number of users, the number of sources, the analytics, the retention period, and extended data retention, and the engineers on staff who maintain the whole system are all relevant factors that feed into the final expense.

Game Launches Should Be Exciting for Your Players, Not for Your LiveOps Team

The moment of launching something new at a game studio (titles, experiences, features, subscriptions) is a blockbuster moment that hangs in the balance. The architecture—distributed and complex, designed by a multitude of teams, to be played across a variety of devices in every corner of the world—is about to meet a frenzy of audience anticipation, along with the sky-high expectations of players, executives, and investors.

Observability in Practice

After years of helping developers monitor and debug their production systems, we couldn’t help but notice a pattern across many of them: they roughly know that metrics and traces should help them get the answers they need, but they are unfamiliar with how metrics and traces work, and how they fit into the bigger observability world. This post is an introduction to how we see observability in practice, and a loose roadmap for exploring observability concepts in the posts to come.

Mario vs. Steve: What Video Games Can Teach Us about Monitoring vs. Observability

Credit: Unsplash What is monitoring? What is observability? Monitoring shows you how a Kubernetes environment and all of its layers are operating. Observability, on the other hand, is a measure of how well internal states of a system can be inferred from knowledge of its external outputs.