Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

o11ycon Keynote

presented at o11ycon+hnycon, June 9-10, 2021 Nora Jones, CEO @ Jeli, Charity Majors, CTO & Co-founder@ Honeycomb o11ycon Keynote Nora Jones and Charity Majors will share their experiences leading major movements shaping the future of shipping software. Nora Jones is CEO of Jeli, and former engineer at Netflix and Slack will share her research and experience with Chaos Engineering, human factors, and site reliability. Charity Majors is Honeycomb's CTO and co-founder, who pioneered Observability as a software practice for modern teams.

Performance analysis for supported modules with Honeycomb

The Infrastructure Automation Content (IAC) team noticed some supported modules tests were taking significantly more time than others. David Schmitt, Principal Software Engineer on the IAC team, explains how Puppet utilises Honeycomb to debug our supported modules for potential performance bottlenecks.

Module development failure analysis with Honeycomb

Writing modules for yourself is easy, but writing modules for other people to use? Not so much. Failures in modules can have major repercussions, and our IAC team in Puppet takes that very seriously. Listen as David Schmitt and Daniel Carabas walk you through how we utilise Honeycomb for failure analysis with Github Actions during module development.

OpenTelemetry, Not Just for Production Troubleshooting

OpenTelemetry, Not Just for Production Troubleshooting: How to Prevent Downtime as Early as Local Dev OpenTelemetry is a great tool for observability and debugging in production. It provides you with data that empowers understanding of what is slow or broken, as well as what you can do to fix problems that occur in production. But what if you could leverage those same OpenTelemetry capabilities in pre-production? What if you could use those capabilities during development and testing phases to proactively prevent downtime in production?

Conditional Distributed Tracing

Distributed tracing is generally a binary affair—it's off or on. Either a trace is sampled or, according to a flag, it's not. Span placement is also assumed to be an "always-on" system where spans are always added if the trace is active. For general availability and service-level objectives, this is usually good enough. But when we encounter problems, we need more. In this talk, I'll show you how to "turn up the dial" with detailed diagnostic spans and span events that are inserted using dynamic conditions.

Observability is More Fun With Friends: Stories From OpenTelemetry Collaboration

Panel Guests: Amy Tobey | Equinix Metal, Andrew Hayworth | GitHub, Liz Fong-Jones | Honeycomb, Ted Young | Lightstep The modern open source landscape is hard enough, given the (sometimes) conflicting interests of commercial partners, end-users, and project maintainers. It takes a real, intentional effort to build collaborative relationships across these groups in order to make improvements to projects. In this panel, we'll share stories about what's worked from our involvement in OpenTelemetry as maintainers, community representatives, and end-users.

How To Implement Cloud Observability Like A Pro | Pepperdata

Do traditional on-prem observability techniques translate to the cloud? Many big data enterprises lack observability and thus struggle to manage and understand unprecedented amounts of data in the cloud. A monitoring solution may alert to a problem, but it can’t pinpoint the issue or quickly get to the root cause.