Operations | Monitoring | ITSM | DevOps | Cloud

February 2023

Sweet Trick! Mastering O11y for Cloud-Native Technologies

Using the cloud can be challenging. Everyone likes to tell stories about a surprise cloud bill they received and laugh, but let’s be honest: it’s happened to all of us. When dealing with complex cloud environments and cloud-native architectures, gaining visibility into the state of the environment is critical. Today, developers are looking to monitor how things are working and performing as part of every step in the software development life cycle.

2023 is When More FinOps Practices will Shift Left and Cost Optimization around Logging will Get Central Stage

Effective troubleshooting and resolution of critical production issues require DevOps and R&D teams to utilize logging and observability. However, selecting the right logging solution can be challenging, given the wide range of available options and associated costs. Additionally, the strategy for logging usage should be tailored to the needs of different personas and use cases, such as DevOps engineers versus developers.

Using Cribl Search for Anomaly Detection: Finding Statistical Outliers in Host CPU Busy Percentage

In this video, we'll demonstrate how to use Cribl Search for anomaly detection by finding statistical outliers in host CPU usage. By monitoring the "CPU Busy" metric, we can identify unusual spikes that may indicate malware penetration or high load/limiting conditions on customer-facing hosts. The best part? This simple but powerful analytic is easily adaptable to other metrics, making it a versatile tool for any data-driven organization.

How the All in One Worker Group Fits Into the Cribl Stream Reference Architecture

Join Ed Bailey and Eugene Katz as they go into more detail about the Cribl Stream Reference Architecture, designed to help observability admins achieve faster and more valuable stream deployment. In this live stream discussion, Ed and Eugene will explain guidelines for deploying all in one worker group. They will also share different use cases and talk about the pros and cons for using the all in one worker group.

How CCP Games Used Honeycomb to Modernize and Migrate its Codebase

Imagine a universe in which a massively multiplayer online role-playing game (MMORPG) sets Guinness World Records for the size of its online space battles—and that game is built on 20-year-old code. Well, imagine no more. Welcome to the world of EVE Online, where hundreds of thousands of players interact across 7,800+ star systems and participate in more than one million daily market transactions.

FinOps Observability: Monitoring Kubernetes Cost

With the current financial climate, cost reduction is top of mind for everyone. IT is one of the biggest cost centers in organizations, and understanding what drives those costs is critical. Many simply don’t understand the cost of their Kubernetes workloads, or even have observability into basic units of cost. This is where FinOps comes into play, and organizations are beginning to implement those best practice standards to understand their cost.

SumUp Uses Honeycomb to Improve Service Quality and Strengthen Customer Loyalty

Growing pains can be a natural consequence of meteoric success. We were reminded of that in our recent panel discussion with SumUp’s observability engineering lead, Blake Irvin, and senior software engineer Matouš Dzivjak. They shared how SumUp’s rapid growth spurt compelled them to change their resolution process—both logistically and culturally—to ensure a service level quality that reflects their customer obsession.

Deciding Whether to Buy or Build an Observability Pipeline

In today's digital landscape, organizations rely on software applications to meet the demands of their customers. To ensure the performance and reliability of these applications, observability pipelines play a crucial role. These pipelines gather, process, and analyze real-time data on software system behavior, helping organizations detect and solve issues before they become more significant problems. The result is a data-driven decision-making process that provides a competitive edge.

Fixing Security's Data Problem: Strategies and Solutions with Cribl and CDW

Cribl's Ed Bailey and CDW's Brenden Morgenthaler discuss a foundational issue with many security programs that lack the right data to detect issues and make fast decisions. Data drives every facet of security and bad data/incomplete data weakens your overall program. Ed and Brenden will discuss common issues and strategies for solving security's data problem.

How Database Observability Increases Operational Reliability

Early on in my career as a DBA, I began to realize I wasn’t a Database Administrator as much as the Default Blame Acceptor for nearly any application problem. Let’s face it—the first thing blamed when there’s an issue is typically going to be the database, and it’s a “guilty until proven otherwise” situation.

Bracing for Impact: Why a Robust Observability Pipeline is Critical for Security Professionals in 2023

2023 is well underway and now more than ever it’s important to stay ahead of data trends and security concerns that are ever mounting. With the cost of catastrophic cyber attacks estimated to be ten times that of all other disasters combined, businesses need to take proactive measures to implement a security data pipeline to protect their data and comply with security and retention requirements.

Introducing OpenTelemetry Support: Take Action on Your Observability Data

As an open source company that grew out of a side project in 2008 to an application and performance monitoring platform (APM) used by over 3.5 million developers, Sentry is committed to open source and the community of developers maintaining and building in the open. Similarly, we take a public approach to building our software, which is why it’s a natural extension of our values to announce our support for OpenTelemetry (or OTel), the leading open standard for observability.

Sponsored Post

Discovering Efficiency Through 2 Steps Synthetic Monitoring for Splunk

You're probably familiar with Splunk. It's one of the most popular big data solutions organisations worldwide use to monitor their systems in real-time. But you may not know that Splunk also offers synthetic monitoring solutions via 2 Steps. 2 Steps Synthetic Monitoring for Splunk is a powerful tool that can help you speed up your application troubleshooting process. Today we'll take a closer look at what it is and how it can benefit your organisation.

Understand user journeys with AppDynamics Business iQ

Understand where in their journey, do your users abandon your application. It could be due to latency or other similar types of performance issues, or perhaps when trying to communicate with a 3rd party endpoint, such as a payment processing service. Learn how observing key metrics provides the insights you need to ensure your customers have the best-of-class experience.

How We Manage Incident Response at Honeycomb

When I joined Honeycomb two years ago, we were entering a phase of growth where we could no longer expect to have the time to prevent or fix all issues before things got bad. All the early parts of the system needed to scale, but we would not have the bandwidth to tackle some of them graciously. We’d have to choose some fires to fight, and some to let burn.

Introducing the Cribl Stream Reference Architecture

Join Ed Bailey and Eugene Katz as they unveil the first Cribl Stream Reference Architecture, designed to help observability admins achieve faster and more valuable stream deployment. In this live stream discussion, Ed and Eugene will explain the importance of a quality reference architecture in successful software deployment, and guide viewers on how to begin with the Cribl Stream Reference Architecture by first establishing end-state goals. They will also share different use cases and help viewers identify which parts of the reference architecture are applicable to their specific situation.

Iterating Our Way Toward a Service Map

For a long time at Honeycomb, we envisioned using the tracing data you send us to generate a service map. If you’re unfamiliar, a service map is a graph-like visualization of your system architecture that shows all of its components and dependencies. We didn’t want it to be a static service map, though—the kind you’d view once before going “huh, neat”—and then never looking at it again.

Symantec Edge SWG (formerly ProxySG) Performance Monitoring: Gain Full Observability with DX NetOps and AppNeta

For teams running secure web gateways (SWGs), also referred to as proxies, in today’s complex, dynamic network environments, extensive observability is a must have. Symantec offers a range of flexible deployment options for its SWGs, offering support for cloud, edge, and hybrid approaches. This blog explores a Broadcom solution that provides comprehensive observability for the Symantec edge offering, Symantec Edge SWG (formerly ProxySG).

Webinar Recap: Observability Data Orchestration

Today, businesses are generating more data than ever before. However, with this data explosion comes a new set of challenges, including increased complexity, higher costs, and difficulty extracting value. With this in mind, how can organizations effectively manage this data to extract value and solve the challenges of the modern data stack?

"I can now sleep at night." How Corevist Achieved Single-Pane-of-Glass Observability

In October 2022, we released SolarWinds® Observability, our cloud-native SaaS observability solution. For companies like Corevist, the solution provided them with the ability to define customized monitoring specifically configured to the Corevist instances within each customer deployment. For them, this was a major game changer.

Empowering SecOps Admins: Getting the Most Value from CrowdStrike FDR Data with Cribl Stream

Join Ed Bailey and Sidd Shah as they discuss how Cribl Stream can empower Security Operations Admins to make the most of their CrowdStrike FDR data. During the discussion, Ed and Sidd will address the challenges faced by CrowdStrike customers who generate a vast amount of valuable data each day but struggle to leverage it fully due to complexity and size. They will explain how Cribl Stream can help SecOps admins extract the right data for their SIEM, while moving the rest to their Security Data Lake, enabling them to get the maximum value from their data and be cost-effective at the same time.

Observability vs Monitoring - The difference explained with an example

Observability vs monitoring has been a common topic in DevOps recently. There has been a lot of debate, and I have learned a lot from them when I started my observability journey. Most literature on observability is associated with a particular product or shares a textbook definition. In this blog post, I want to give you a practical understanding of observability and the differences between observability and monitoring with different scenarios and examples. We will cover the following topics here.

Watch: How to pair Grafana Faro and Grafana k6 for frontend observability

Grafana Faro and xk6-browser are both new tools within the Grafana Labs open source ecosystem, but the pairing is already showing a lot of potential in terms of frontend monitoring and performance testing. Faro, which was announced last November, includes a highly configurable SDK that instruments web apps to capture observability signals that can then be correlated with backend and infrastructure data.

It's time for government to move beyond monitoring and into observability

When thinking about holistic end-to-end observability, it can help to start with what you already have. Many government agencies are already strategically ingesting and storing logs — a key component of observability. More than a year and a half after the release of M-21-31, US government agencies continue to work through the logging maturity models outlined in the memorandum.

The Evolution of Applications and Current Trends To Know

IT industry veterans have seen significant changes in the way applications are not only developed but also used. This evolution has been driven by several factors, including the ever-growing need for mobility, vast ecosystems, and the increasing demand for faster and more efficient ways to get work done. Let's take a closer look at how apps have evolved over time.

How Security Engineers Use Observability Pipelines

In data management, numerous roles rely on and regularly use telemetry data. The security engineer is one of these roles. Security engineers are the vigilant sentries, working diligently to identify and address vulnerabilities in the software applications and systems we use and enjoy today. Whether it’s by building an entirely new system or applying current best practices to enhance an existing one, security engineers ensure that your systems and data are always protected.

A Snapshot of our IT Ops Predictions for 2023

Today executives and customers expect IT and digital services to be available and performant at all times; compromised availability or performance is no longer tolerable. Think about it; when was the last time a digital service was unavailable and it didn’t make the news or social media? When was the last time you visited a website that was unavailable and you waited for the outage to be over, rather than finding an alternative in the moment?

Communicating Context Across Splunk Products With Splunk Observability Events

When an IT or Security issue impacts a development team’s software how are they notified? Is your organization still relying on mass emails that lack context and most engineers have probably already filtered out of their inbox? Communicating between siloed tools and teams can be difficult. How would you like to put IT, Security, legacy processes, and business notifications specific to development teams right into one of their most important tools? Now you can!

Connecting OpenTelemetry to AWS Fargate

OpenTelemetry is an open-source observability framework that provides a vendor-neutral and language-agnostic way to collect and analyze telemetry data. This tutorial will show you how to integrate OpenTelemetry with Amazon AWS Fargate, a container orchestration service that allows you to run and scale containerized applications without managing the underlying infrastructure.

Root cause log analysis with Elastic Observability and machine learning

With more and more applications moving to the cloud, an increasing amount of telemetry data (logs, metrics, traces) is being collected, which can help improve application performance, operational efficiencies, and business KPIs. However, analyzing this data is extremely tedious and time consuming given the tremendous amounts of data being generated. Traditional methods of alerting and simple pattern matching (visual or simple searching etc) are not sufficient for IT Operations teams and SREs.

In a Toxic Relationship with Your Current Observability Search Tool? There's Other Fish in the Sea

IT tools are similar to romantic relationships. Over time, you tend to fall into the same old dull routines, like Rupert Holme’s song Escape (The Piña Colada Song). That routine — collect dataset, route, ingest ($$) and then search, collect dataset, route, ingest, then search, … this approach is not only breaking your heart but your budget too.

Get the Big Picture: Learn How to Visually Debug Your Systems with Service Map-Now Available in Sandbox

Honeycomb recently announced the launch of Service Map, a new feature that gives users the ability to quickly unravel and make sense of the interconnectivity between services in highly complex and intricate environments.

Cribl's Zachary Kilpatrick Awarded 2023 Channel Chief Award from CRN for Second Consecutive Year

The Cribl Partner Program is designed to be a comprehensive solution for organizations looking to grow their customer relationships and revenue streams, while also enabling a fast deployment of observability solutions to serve customers. Our partners receive extensive training, tools, and support to unlock the full potential of observability data for their customers.

Profiling: Buzzword or Critical Observability Tool? | Snack of the Week

Profiling may seem like the latest buzzword in the monitoring and observability world, but profiling tools have actually been in use for decades. I’m going to quickly explain what profiling is and why modern profilers are getting so much attention lately.

Cyber Resilience: The Key to Security in an Unpredictable World

Join Ed Bailey and Jackie McGuire as they delve into the topic of cyber resilience and its growing significance in today's digital landscape. In this informative video, you will learn what cyber resilience means, why it's important, and how to manage and improve it in an increasingly unpredictable world. With cyber threats becoming more sophisticated and frequent, cyber resilience has become a critical aspect of protecting personal and business assets. This discussion is perfect for anyone looking to better understand the importance of cyber resilience and how to safeguard against potential threats.

Autocatalytic Adoption: Harnessing Patterns to Promote Honeycomb in Your Organization

When an organization signs up for Honeycomb at the Enterprise account level, part of their support package is an assigned Technical Customer Success Manager. As one of these TCSMs, part of my responsibilities is helping a central observability team develop a strategy to help their colleagues learn how to make use of the product.

Sponsored Post

Complete observability & monitoring of your integration infrastructure

Integration is a fundamental part of any IT infrastructure. It allows organizations to connect different systems and applications together in order to share data and information. As organizations become more complex and interconnected, they need to ensure they have complete observability and monitoring of their integration architecture. This is essential in order to discover, understand and fix any issues that can arise.

SolarWinds Observability: Helping to Accelerate Application Development

Observability is the practice of equipping software and infrastructure with tools capable of gathering actionable data showing not only when an application error or issue occurs but why it occurred. Most traditional monitoring tools gather information passively; observability practices are different. They focus on actively gathering relevant data, especially factors driving operational decisions and actions.

Best Practices for Enriching Network Telemetry to Support Network Observability

Network observability is critical. You need the ability to answer any question about your network—across clouds, on-prem, edge locations, and user devices—quickly and easily. But network observability is not always easy. To be successful, you need to collect network telemetry, and that telemetry needs to be extensive and diverse. And once you have that raw telemetry data, you need to interpret it.