Operations | Monitoring | ITSM | DevOps | Cloud

November 2021

Observability and SaaS Providers

SaaS is exploding and so it should; it takes commoditized work and infrastructure away from tech teams so that they can focus on differentiating features. But what happens when it goes wrong? How do SaaS platforms make sure they aren't letting their customers down and in turn, letting their customers down? Observability, bolstered with AI gives all the partners the best chance to optimize availability and customer experience. Here's how.

User experience is a focus of Sumo Logic Observability innovations

Technology environments are rapidly evolving as organizations look to remain competitive, accelerate innovation and make themselves more agile. But in the process, many of the observers, i.e., stakeholders who track infrastructure and application metrics, are falling behind, unable to monitor and manage modern, cloud-native apps and multi-cloud environments due to the complexity that comes with them.

OpenTelemetry Browser Instrumentation

One of the most common questions we get at Honeycomb is “What insights can you get in the browser?” Browser-based code has become orders of magnitude more complex than it used to be. There are many different patterns, and, with the rise of Single Page App frameworks, a lot of the code that is traditionally done in a backend or middle layer is now being pushed up to the browser. Instead, the questions should be: What insights do frontend engineers want?

Dashbird explained

Dashbird is an observability, debugging, and intelligence platform designed specifically to help serverless developers build, operate, improve, and scale their modern cloud applications on AWS environment fast, securely, and with ease. It’s free to use for up to 1M invocations and doesn’t require any code changes. Dashbird fills the gaps left by CloudWatch and other traditional monitoring tools by offering enhanced out-of-the-box monitoring, operations, and actionable insights tools for architectural improvements, all in one place.

Kubernetes Master Class Security & Observability feat. Tigera

In this RKE-focused workshop for networking, security, and observability on containers, Kubernetes, and Calico, you will work with a Calico and RKE expert to learn how to design, deploy, and observe security and networking policies in an RKE environment. This 90-minute hands-on lab comes with your own provisioned Calico Cloud environment, designed to provide more complete knowledge on how to implement:– Workload access controls– Compliance and reporting– Run-time visualization of traffic flow and security policies– Service-level observability– Anomaly detection and live troubleshooting You will come away from this workshop with an understanding of how others in your industry are doing Kubernetes, container and cloud security, and observability in RKE, and how you can implement it in your own organization.

How AWS & xMatters Drive Monitoring and Observability Forward - xMatters Demo

Join Tiberiu Oprisiu, Solution Architect at AWS, Eric Maxwell, Solution Architect at xMatters, and Rutuja Rajwade, Partner Marketing Manager at xMatters, as they highlight and demo the benefits that come from pairing AWS with xMatters. Learn from Tiberiu which business imperatives drive observability, and what, why, and how AWS can do just this. And, stick around to see Eric dive deep into matters Flow Designer to see how these workflows can be set up with ease!

Save 85% in Time Spent on Root Cause Analysis with Topology-based Observability | StackState demo

During the 2021 Gartner IT Infrastructure, Operations and Cloud Strategies conference, our senior solution engineer Mark Arts showed you how you can use topology-based observability - based on our 4T Data Model - to save 85% in time spent on root cause analysis. Useful links.

Observability vs. monitoring debate: An irreverent view

In the past few years, the word “observability” has steadily gained traction in the discussions around monitoring, DevOps, and, especially, cloud-native computing. However, there is significant confusion about the overlap or difference between observability and monitoring.

How Snyk, TripAdvisor, and Citibank use Grafana to effectively scale observability

It’s one thing to set up an observability strategy. But what’s it like to introduce and scale observability effectively across an organization? In a wide-ranging conversation at ObservabilityCON 2021, three technical pros from Snyk, TripAdvisor, and Citibank joined Grafana Labs VP Global Solutions Engineering Steve Mayzak and — with more than 75 years experience between them — they shared the triumphs and turbulence in their respective observability journeys.

How to Make Splunk Run 100x Faster With Cribl LogStream

Enterprises leveraging Splunk for data ingestion and analytics need an observability solution that scales well with their business requirements and provides a cost-effective way to retain data long-term. Cribl LogStream is an essential part of observability, providing a pipeline that works with all tooling, keeps costs down, and scales with any business – making it the perfect complement to Splunk.

Ask Miss O11y: Mapping Out Your Observability Journey

Dear Trapped, Thanks for asking the question! Approaching observability as an all-or-nothing problem often leads to the project feeling daunting. But that’s not specific to observability—any project can be overwhelming if you think it needs to be done all at once, perfectly. Such as, erm, writing an entire book on observability! *looks around worriedly*

Logz.io Anomaly Detection: Shedding Light on "Unknown Unknowns"

Moving beyond traditional monitoring to embrace full stack observability offers a seemingly endless range of benefits. Beyond unifying logs, metrics, and traces in a single platform, the opportunity to enlist advanced analytics and engage a more predictive approach represents another huge step forward.

Monitoring & Observability for Sales, Marketing and Business ops teams with StackMoxie and PagerDuty

Before Stack Moxie, every business ops team needed PagerDuty, but finding and pushing errors was a manual process. With Stack Moxie + PagerDuty, every business op professional can manage their sales, marketing, HR or customer success stack with the same quality engineers bring to code.

Implementing SLAs, SLIs, and SLOs in an observability suite

Implementing SLAs, SLIs, and SLOs in an observability suite is now business-critical. Over time, a company’s decision-makers can add a burdensome number of KPIs that force servers and other IT assets to devote excessive processing time to business intelligence. Eventually, the burden becomes so great that employees, managers, and executives start to complain about the system’s sluggishness. Developers know that they need to strike a balance between business needs and IT processes.

Observability - An Ultimate Guide

A developers perspective is different. While managing various sectors in a software, sometimes it would be difficult to monitor the activities and identify the bug that is disrupting the functions. What if you can spot the error beforehand, and resolve it at the earliest? The strategies that we focus on, and implement are the ones that help us effectively manage our tasks. That is possible by knowing about Observability. Let's learn in detail about it through this blog. TABLE OF CONTENTS.

Serverless Observability: It's easier than you think!

Observability is a measure of how well the internal state of a system can be inferred from its external outputs. It helps us understand what is happening in our application and troubleshoot problems when they arise. It’s an essential part of running production workloads and providing a reliable service that attracts and retains satisfied customers.

Observability into Your FinOps: Taking Distributed Tracing Beyond Monitoring

Distributed tracing has been growing in popularity as a primary tool for investigating performance issues in microservices systems. Our recent DevOps Pulse survey shows a 38% increase year-over-year in organizations’ tracing use. Furthermore, 64% of those respondents who are not yet using tracing indicated plans to adopt it in the next two years. However, many organizations have yet to realize just how much potential distributed tracing holds.

Incident Resolution: Do You Remember, the Twenty Fires of September?

From September to early October, Honeycomb declared five public incidents. Internally, the whole month was part of a broader operational burden, where over 20 different issues interrupted normal work. A fraction of them had noticeable public impact, but most of the operational work was invisible. Because we’re all about helping everyone learn from our experiences, we decided to share the behind-the-scenes look of what happened.

Introducing Grafana Enterprise Traces, joining metrics and logs in the Grafana Enterprise Stack observability solution

Today, we are launching a new Grafana Labs product, Grafana Enterprise Traces. Powered by Grafana Tempo, our open source distributed tracing backend,.and built by the maintainers of the project, this offering is an exciting addition to our growing self-managed observability stack tailored for enterprises.

What's Wrong With Observability Pricing?

There’s something wrong with the pricing of observability services. Not just because it costs a lot – it certainly does – but also because it’s almost impossible to discern, in many cases, exactly how the costs are calculated. The service itself, the number of users, the number of sources, the analytics, the retention period, and extended data retention, and the engineers on staff who maintain the whole system are all relevant factors that feed into the final expense.

Game Launches Should Be Exciting for Your Players, Not for Your LiveOps Team

The moment of launching something new at a game studio (titles, experiences, features, subscriptions) is a blockbuster moment that hangs in the balance. The architecture—distributed and complex, designed by a multitude of teams, to be played across a variety of devices in every corner of the world—is about to meet a frenzy of audience anticipation, along with the sky-high expectations of players, executives, and investors.

Observability in Practice

After years of helping developers monitor and debug their production systems, we couldn’t help but notice a pattern across many of them: they roughly know that metrics and traces should help them get the answers they need, but they are unfamiliar with how metrics and traces work, and how they fit into the bigger observability world. This post is an introduction to how we see observability in practice, and a loose roadmap for exploring observability concepts in the posts to come.

Mario vs. Steve: What Video Games Can Teach Us about Monitoring vs. Observability

Credit: Unsplash What is monitoring? What is observability? Monitoring shows you how a Kubernetes environment and all of its layers are operating. Observability, on the other hand, is a measure of how well internal states of a system can be inferred from knowledge of its external outputs.

Sponsored Post

Hybrid Multi-Cloud Demands Holistic Observability

As I said before, Speed is King. Business requirements for applications and architecture change all the time, driven by changes in customer needs, competition, and innovation and this only seems to be accelerating. Application developers must not be the blocker to business. We need business changes at the speed of life, not at the speed of software development.

Open Source for Better Observability

Monitoring cloud-native systems is hard. You’ve got highly distributed apps spanning tens and hundreds of nodes, services and instances. You’ve got additional layers and dimensions—not just bare metal and OS, but also node, pod, namespace, deployment version, Kubernetes’ control plane and more. To make things more interesting, any typical system these days uses many third-party frameworks, whether open source or cloud services.

How Secure Tenancy Keeps Your Secrets Secret

The best way to be sure that you keep a secret is not to know it in the first place. Managing secrets is a notoriously difficult engineering problem. Across our industry, secrets are stored in a bewildering variety of secure (and sometimes notoriously insecure) systems of varying complexity. Engineers are often trying to balance the least worst set of tradeoffs. At Honeycomb, we asked: What if we didn’t need to know your secrets to begin with?

New: Optimize Slow Queries with Enhanced Database Visibility in Splunk Observability

Databases have always been the backbone of applications – both web and enterprise. Now, more than ever before, you need to know not just overall statistics about your database, but you must identify how database performance interacts with the network, operating system, servers, configuration, and even third party dependencies.

Introducing Log Observability for Microservices

Two popular deployment architectures exist in software: the out-of-favor monolithic architecture and the newly popular microservices architecture. Monolithic architectures were quite popular in the past, with almost all companies adopting them. As time went on, the drawbacks of these systems drove companies to rework entire systems to use microservices instead.

Observable Web Applications

Users don’t see your distributed services, cloud architecture, or instrumentation—they only see how the web app is working. Understanding their experience in the client-side is the first step towards understanding the rest of the system. We’ll explore how to make your client-side applications more observable through error tracking, web performance, and usage analytics. With better understanding of real-user experience, you’ll better understand the real behavior of your systems.

How Freshly is Scaling Business Metrics Observability with AI

Anodot recently took part in the 2021 Data Agility Day, an event dedicated to examining how organizations are extracting value from data. CEO and Co-Founder David Drai was joined by David Ashirov, VP of Data at Freshly, where he has worked to build a data stack that departments across the company could leverage to drive business. Ashirov is a senior executive with two decades of experience in data engineering, business intelligence, and marketing.