Operations | Monitoring | ITSM | DevOps | Cloud

July 2021

Remote Debugging & Device Observability: How Memfault & Diamond Kinetics fix firmware bugs together

With their original products in the market, Diamond Kinetics relied on customer reports with little detail to fix issues in the field. They soon realized that this process of “psychic debugging” sensors in the wild was not enough and even delayed release dates as they tried perfecting each device before shipment. When they decided to launch a new generation of their product in 2020, they knew they needed a more efficient process.

Tale of the Beagle (Or It Doesn't Scale-Except When It Does)

If there’s one thing folks working in internet services love saying, it’s: "Yeah, sure, but that won’t scale." It’s an easy complaint to make, but in this post, we’ll walk through building a service using an approach that doesn’t scale in order to learn more about the problem. (And in the process, discovering that it actually did scale much longer than one would expect.)

PD Summit21: Transforming Infrastructure Teams Through Observability

What is this ""observability"" thing that everyone is talking about? Observability allows you to navigate the dark unknowns with echolocation while others attempt to fly blindly without it. Are your dashboards all green, but you still have an issue brewing? Do you need instant feedback based on the Core Analysis loop? Are your engineers tired of waking up at 3 AM for the expected issues? Is there a lack of time for experimentation? Generate your own answers and create a meaningful course of action with observability.

Detect any issue with Splunk APM before it turns into a customer problem

With 100% of spans and traces captured, Splunk APM meets any necessary business KPI’s and SLO metrics while investigating and troubleshooting transaction errors related to a backend application. Easily construct error budgets that measure performance of services today - learn how with this free trial Splunk Observability Cloud.

PD Summit21: MUX: Video Observability: Operational Alerting for Responding to Issues In Real-time

Streaming video accounts for the majority of internet traffic and your applications and infrastructure almost certainly include video. Mux Data allows you to easily monitor the real-time quality of experience delivered to your video viewers and integrating with PagerDuty you can automate a response and reduce the time to resolution when something goes wrong. We will cover the basics of video monitoring and how integrating with PagerDuty can ensure a great experience for viewers.

How Vanguard used Observability to Accelerate and De-risk their Cloud Migration

Rich Anakor, chief solutions architect at Vanguard, is on a small team with a big goal: Give Vanguard customers a better experience by enabling internal engineering teams to better understand their massively complex production environment—and to do that quickly across the entire organization, in the notoriously slow-moving financial services industry. They also had a big problem: The production environment itself.

What's the Difference between Observability and Monitoring?

Wondering what the difference is between observability and monitoring? In this post, we explain how they are related, why they are important, and some suggested tools that can help. The difference between observability and monitoring is that observability is the ability to understand a system’s state from its outputs, often referred to as understanding the “unknown unknowns”.

Five Problems Your Current Network Monitoring Can't Solve but Network Observability Can

Public and hybrid cloud has led to a new era of agility, scale and performance, particularly for the networking that underlies enterprise applications. Yet, more than 80% say their network monitoring hasn’t kept up with major problems that need to be solved. A new approach is required – it’s network observability. Join Kentik co-founder and CEO Avi Freedman as he discusses how to reduce networking issues and risks while continuing to allow your organization to innovate at the speed of cloud.

SRE's Guide to Chaos & Observability

Today’s distributed, cloud-based environments are incredibly complex. Not only does each component depend on many others, but modern systems are also highly dynamic—changing frequently as teams push new code or make updates to infrastructure. Taming this complexity to ensure reliability requires end-to-end observability to understand how components depend on each other. Additionally, proactive Chaos Engineering combined with AI-driven observability lets you uncover “unknown unknowns” that impact how your system will respond to different failure scenarios.

Observability with Zero Code Instrumentation? Meet eBPF

Current observability practice is largely based on manual instrumentation, which requires adding code in relevant points in the user’s business logic code to generate telemetry data. This can become quite burdensome and create a barrier to entry for many wishing to implement observability in their environment. This is especially true in Kubernetes environments and microservices architecture.

Improving Our Typography to Optimize the Honeycomb User Experience

This is the second post in our series about Lattice, Honeycomb’s new design system and how we’re applying a user-centric design philosophy to our product. Lattice begin! At Honeycomb, we understand that our users are often under a great deal of pressure when troubleshooting complicated issues in their applications.

Coffee Break Webinar Series: "Intelligent Observability - What the Analysts Say"

We know commitment issues are the real deal, especially when it comes to significant and costly tech investments. Understanding how the market is performing and what’s up ahead is critical for investing in AIOps. Our crew is here to help you through the challenging decision-making days and offer up the best analyst guidance.

The State of Observability 2021: Mature Teams Ship Better Code Faster and You Can Too

The 2021 Observability Maturity Community Research report is the first year-over-year look at the observability landscape and how practices are evolving. And they are 3X more likely to deliver high customer satisfaction. Which practices make all the difference when it comes to advancing the impact of your observability practice? Honeycomb's VP of Engineering, Emily Nakashima leads a discussion with Redmonk Co-founder James Governor, Honeycomb CTO and Co-founder Charity Majors, and Eaze's Sr. Software Engineer, Joe Thackery.

How to Optimize Your Cloud Spend Using Observability

The rise of public cloud services has enabled businesses to innovate faster, scale effortlessly, and adopt more advanced technologies easier than ever before. However, there’s a dark side to using public cloud services: complexity and cost. Public cloud services can scale to handle almost any workload, but in doing so, they can quickly generate unpredictable costs for your business.