Operations | Monitoring | ITSM | DevOps | Cloud

November 2023

Sponsored Post

Buyer Beware! Three Challenges with Elasticsearch and OpenSearch

Elasticsearch and OpenSearch are powerful enterprise search and analytics engines that have become popular in the world of data management and telemetry analysis. Their ability to swiftly search, analyze, and visualize data has made them indispensable for businesses and organizations. However, in this blog, we will explore a few key challenges faced by companies using Elasticsearch and OpenSearch, shedding light on important considerations when selecting the right tool for your needs.

What is Cardinality? Cardinality Metrics for Monitoring and Observability

The transition to cloud-native architectures has led to an explosion in metrics data, both in volume and cardinality. This necessitates the development of monitoring systems capable of managing large-scale, high-cardinality data to achieve effective observability in these environments . In this blog post, we’ll explore the important role of cardinality in monitoring and observability.

Metrics to Monitor for AWS (ELB) Elastic Load Balancing

Amazon Elastic Load Balancing (ELB) allows websites and web services to serve more requests from users by adding more servers based on need. There are several challenges to operating load balancers, as discussed in a previous blog post: Microservices Load Balancing: Navigating the Waves of Modern Architecture. An unhealthy ELB can cause your website to go offline or slow to a crawl.

Istio Roadmap, Ambient Mesh, and the Service Mesh Landscape: KubeCon 2023 Updates

In the dynamic landscape of microservices and cloud-native architectures, the role of service meshes has become increasingly crucial. These programmable frameworks empower users to seamlessly connect, secure, and observe their microservices, relieving them of the complexities associated with these critical tasks within their applications. Istio, a leading service mesh project, has been at the forefront of this evolution since its inception in 2017.

Syslog-NG: The Sandbox That Taught Me to Appreciate Cribl Even More

Recently, we launched a new Sandbox focused on handling syslog at scale with Cribl. The marketing messaging behind the Sandbox has been done a couple times already; therefore I wanted to let y’all see what we as Cribl Technical Marketing Engineers(TMEs) actually do in our daily lives. I’ll try to keep it engaging, with tales of danger and subterfuge, but I can only take so much artistic license. What’s in a Sandbox and how the Sandbox platform functions (i.e.

Splunk SOAR 6.2 Introduces New Automation Features, Workload Migration, and Firewall Integrations

The Splunk team is proud to announce the release of Splunk SOAR 6.2 (Security Orchestration Automation and Response). We’ve been hard at work developing the latest and greatest features for this update, several of which have come from requests and suggestions from our users over on Splunk Ideas.

Paving the way for modern search workflows and generative AI apps

Elastic’s innovative investments to support an open ecosystem and a simpler developer experience In this blog, we want to share the investments that Elastic® is making to simplify your experience as you build AI applications. We know that developers have to stay nimble in today’s fast-evolving AI environment. Yet, common challenges make building generative AI applications needlessly rigid and complicated. To name just a few.

How Generative AI Makes Observability Accessible for Everyone

We are pleased to share a sneak peek of Query Assistant, our latest innovation that bridges the world of declarative querying with Generative AI. Leveraging our large language models (LLMs), Coralogix’s Query Assistant translates your natural language request for insights into data queries. This delivers deep visibility into all your data for everyone in your organization.

What's IT Monitoring? IT Systems Monitoring Explained

Whether on the cloud or on-premises, visibility into the inner workings of our IT services and infrastructure is an essential ingredient of a well working IT system. The drive for digital transformation as a core strategic objective for most modern enterprises has meant that ensuring IT systems are working well, secured and delivering value for money is a critical endeavor.

Logit.io Unveils Exciting Enhancements: Integrating OpenSearch 2.10.0

We're thrilled to share an exciting update from Logit.io. As part of our ongoing commitment to providing cutting-edge observability solutions to our users, we've integrated OpenSearch 2.10.0 into our platform, bringing a host of advanced features to enhance your experience. Let's dive into what's new and how these changes can benefit your observability workflows.

Announcing Service Map: Troubleshoot With Context and Confidence

Logz.io is excited to announce Service Map, a new way to visualize the data flow, dependencies, and critical performance metrics throughout your microservices architecture, which makes it easy to gather critical troubleshooting context as you investigate production issues.

Enhanced Linux Visibility with Sumo Logic

In the continually evolving digital landscape, the importance of effective and efficient logging cannot be overstated. When we journey into the realm of Linux, this rings particularly true. Today, we'll delve into why Linux logging is vital, the challenges customers commonly encounter with it, and how Sumo Logic has emerged as a market leader in providing unparalleled SIEM solutions.

Coralogix named as AWS Rising Star Partner for 2023

Amazon Web Services, Inc. (AWS), an Amazon.com company, today announced the 2023 AWS Partner Award winners, recognizing leaders around the globe playing a key role helping customers drive innovation and build solutions on AWS. Announced during re:Invent 2023, AWS Partner Awards recognize our Top Partners of the Year and Rising Star Partners of the Year, whose business models have embraced specialization, innovation, and collaboration over the past year.

Using the Cribl Redux Stats Pack

Cribl’s internal metrics are very handy for seeing what Cribl is doing. And while there are many data points related to input vs output volumes, sometimes you need more control over what you’re tracking. This pack allows you to route arbitrarily defined traffic through a stats tracker to capture changes in event count and volume. Perhaps you are onboarding a new host, or trialing a new Pipeline.

Lightning-fast troubleshooting for AWS: How to find the root cause fast with Sumo Logic

It’s time to stop firefighting. With Sumo Logic’s AWS Observability, companies like Snoop have been able to simplify data collection, achieve unified visibility across AWS accounts and regions and leverage machine learning to troubleshoot — fast. This re:Invent, we’re excited to showcase how our capabilities for AWS have evolved.

Using the Cribl API Part II: The Replay

Our previous post was all about dipping your toes into the wonderful world of API interaction. By leveraging Cribl’s API you can automate many parts of your event pipeline management and tasks. So we got that goin’ for us. Which is nice. One of the common use cases for the API I hear about is kicking off data collection automatically. Use cases include: Cribl gives you the tools to collect data when you want, from where you want, and to where you want.

Simplify Kubernetes with Cribl Edge on EKS Add-on

Let’s be honest, working with Kubernetes (K8s) has never been the easiest tech to work with. As a seasoned Kubernetes professional, I find myself constantly looking for ways to set up collecting data from my clusters, only to find out that there is a new, more complicated way to get the data I’m looking for.

Jaeger vs Zipkin: The Complete Comparison Guide

To monitor and troubleshoot the performance of microservice-based applications, Jaeger and Zipkin are examples of the most commonly used open-source distributed tracing systems. They both supply users with insight into the flow of requests through various components of a system, which can be utilized to find latency bottlenecks, errors, and performance problems in the system.

Monitoring Microsoft SQL Server login audit events in Graylog

One of the most important events you should be monitoring on your network is failed and successful logon events. What comes to most people’s minds when they think of authentication auditing is OS level login events, but you should be logging all authentication events regardless of application or platform. Not only should we monitor these events across our network, but we should also normalize this data so that we can correlate events between these platforms.

A Simplified Guide to Kubernetes Monitoring

The open-source Kubernetes platform has become the de facto standard for deploying, managing, and scaling containerized services and workloads. In fact, 83% of DevOps teams are using Kubernetes to deploy containerized applications in production, taking advantage of its workload orchestration and automation capabilities to optimize the software development process and reduce web server provisioning costs.

How to Collect .NET Application Logs with OpenTelemetry

In the realm of modern software development, achieving true observability is paramount for understanding application behavior and performance. This demonstration focuses on a.NET application that harnesses the capabilities of OpenTelemetry to seamlessly integrate logging and tracing functionalities. OpenTelemetry, a key player in the Cloud Native Computing Foundation, provides a unified framework for comprehensive observability.

Micro Lesson: Monitoring and Troubleshooting with AWS Observability Solution

This video introduces Sumo Logic's AWS Observability solution, which is an all-in-one approach to give visibility into the important elements of the cloud infrastructure and assist in troubleshooting complex issues. This video further describes the features of the observability solution such as pre-built dashboards, prepackaged log searches, and the out-of-the-box alerts that help in monitoring and troubleshooting.

Large Enterprise Cuts Elasticsearch and SIEM Costs by 40% with Observo.ai

A large, global Data Management and AI software company with over 5,000 customers across more than 100 countries had seen unprecedented growth (more than 30% year over year) in telemetry data from their multi-cloud infrastructure being sent to the Elasticsearch Observability and SIEM Platform. The growth of this data contributed to a multi-million dollar price tag for Elasticsearch.

Observo.ai Enables Global E-Commerce Giant to Slash Splunk Costs by 50%

A Global 1000 E-commerce company struggled with the rapid growth in telemetry data that their security team analyzes with Splunk, Grafana, and other Observability tools in the cloud. Specifically, the increase in VPC Flow log and Firewall log volumes caused a spike in Splunk costs on certain data sets and triggered daily indexing limit overage fees. As this deluge of data began piling up in block storage within their Splunk index, the team saw corresponding spikes in storage costs.

How to create log sinks

Are you wondering how you can route your Google Cloud logs to your desired destination? Then check out this video, where we introduce you to log sinks which can be used to route logs to various supported destinations, walk you through how it works and the list of supported destinations to which logs can be routed. It covers the different use cases and scenarios, where the logs sinks can be very useful. We’ll also demonstrate how to create and configure an aggregated log sink that sends all VPC flow logs to BigQuery.

Key Value Parser Delivers Useful Information Fast

Parsers make it easier to dig deep into your data to get every byte of useful information you need to support the business. They tell Graylog how to decode the log messages that come in from a source, which is anything in your infrastructure that generates log messages (e.g., a router, switch, web firewall, security device, Linux server, windows server, an application, telephone system and so on).

Elastic Observability monitors metrics for Google Cloud in just minutes

Developers and SREs choose to host their applications on Google Cloud Platform (GCP) for its reliability, speed, and ease of use. On Google Cloud, development teams are finding additional value in migrating to Kubernetes on GKE, leveraging the latest serverless options like Cloud Run, and improving traditional, tiered applications with managed services. Elastic Observability offers 16 out-of-the-box integrations for Google Cloud services with more on the way.

Understanding Log Levels

In this video, we will discuss what log levels are, how to use them in your application, and how to monitor your logs with Sematext. We break down the intricacies of log levels, guiding you through their significance and practical implementation. Elevate your DevOps game with insights on proactive issue detection and rapid problem resolution. With a centralized logging solution like Sematext Cloud, you can enhance collaboration, minimize downtime, and boost overall system performance.

Solving Complexity Challenges with Kubernetes 360

Here at Logz.io, we realize Kubernetes is the most common infrastructure component that organizations are running on to keep their applications going. In return, we’ve made a big investment to support Kubernetes properly and give customers the tools they need to investigate and troubleshoot any issues that arise.

C-suite insights on the transformative power of generative AI

Generative AI is revolutionizing the way businesses operate, from improving operational resilience to mitigating security risks and enhancing customer experiences. In a recent roundup of c-suite insights from three IT leaders — Matt Minetola, CIO, Mandy Andress, CISO, and Rick Laner, chief customer officer — we gain a comprehensive understanding of how generative AI is being used to improve business outcomes across organizations.

Splunk Edge Hub: Physical Data, Sensing and Monitoring on the Edge

Splunk Edge Hub device is a multi-component solution that includes a hardware device coupled with the Splunk platform and solutions that our partners build on top of both. It is a powerful tool that can help collect, distribute and act on data from edge devices and sensors, making it easier to capture and act on data that can be difficult to access physically or digitally.

How SpyCloud Architected Its Cribl Stream Deployment

In this livestream, I talked to Ryan Saunders – Manager of Security Operations at SpyCloud, about how he used the Cribl Reference Architecture to build a scalable deployment. He explained how this approach enabled SpyCloud to grow alongside its evolving needs without requiring significant rework. The reference architecture also facilitated a repeatable data-onboarding process, reducing administrative time and allowing the team to focus on critical security and data analysis tasks.

Optimizing VPC Flow Logs - Part 1

Amazon Web Services (AWS) VPC Flow Logs is a feature designed to capture and provide information about the IP traffic that flows to and from network interfaces within your Virtual Private Cloud (VPC). This data can be published to various destinations, including AWS CloudWatch Logs, AWS S3, or AWS Kinesis Data Firehose. Flow logs serve several important purposes, such as diagnosing security group rule issues, monitoring incoming and outgoing traffic, and determining traffic directions.

Active vs. Passive Monitoring: What's The Difference?

Today, it’s perfectly normal for businesses to continuously monitor software applications and IT infrastructure to ensure uninterrupted customer service. Active and passive monitoring are the two popular methods enterprises use for infrastructure and application performance monitoring (APM). As the names indicate, these two approaches to monitoring are very different.

The Leading Jaeger Dashboard Examples

Unlocking the full potential of observability and tracing in modern software ecosystems has become imperative for businesses striving to deliver improved reliability and user experience. In this comprehensive roundup, we will dive into the world of Jaeger-incorporated observability and tracing dashboards, offering a curated selection of the best use cases that empower DevOps teams, engineers, and developers to gain unparalleled insights into the inner workings of their applications.

Manage metrics & logging costs with Grafana Cloud + Log Volume Explorer demo | ObservabilityCON

Are your SRE and platform teams under pressure to ingest fewer metrics and logs in the name of cost savings? Reducing costs does not have to mean reduced observability. This recording walks through the cost management features in Grafana Cloud that allow you to analyze, attribute, monitor, and optimize your metrics and logs usage – and lower costs – without compromising your observability strategy.

SIEM Implementation Guide: A How-To Guide

In an era where cybersecurity threats are not just frequent but increasingly sophisticated (and becoming more costly), the need for robust defense mechanisms has never been more critical. Security Information and Event Management (SIEM) emerges as a cornerstone in this complex data environment. It’s not just another tool in your cybersecurity toolkit; it’s a solution designed to elevate your organization’s security posture.

Generative AI & Enterprise IT: Overhyped or Radically Under Estimated?

Join Cribl’s Jackie McGuire and Ed Bailey as they discuss AI's current and future state. They will discuss the many challenges and vast promise of this promising way to increase productivity and solve problems. In addition, Jackie and Ed will also comment on SolarWinds’ response to the SEC charges alleging Solarwinds and its CISO defrauded investors by repeatedly misleading them about its cybersecurity posture. Please join us for a great conversation.

Announcing the Splunk Add-on for OpenTelemetry Collector

The Splunk Add-on for OpenTelemetry Collector is a variation of the Splunk Distribution of the OpenTelemetry Collector that simplifies metrics and traces data collection, configuration and management. Since it is an add-on, users can deploy it alongside Universal Forwarders using tools like Deployment Server to start collecting high-fidelity metrics and traces from 1000s of their hosts easily. We’re happy to announce that the Add-On is now generally available in Splunkbase.

Deployment Frequency (DF) Explained

Technical teams use various metrics and indicators to track performance and success. For DevOps teams, among the most important metrics is deployment frequency. Deployment frequency can help you evaluate the software delivery performance of teams that develop software and apps. In this article, I’ll look at using this metric to calculate deployment rate, the importance and best practices for improving your deployment rate and setting your DevOps team up for success.

Mastering Firewall Logs - Part 1

A firewall is a network security device or software that is used to monitor and control incoming and outgoing network traffic based on predetermined security rules. Firewall Logs contain valuable information about network and security events. These logs are essential for security and infrastructure monitoring for enterprises. While this data is critical to securing enterprise networks, they are also one of the most voluminous data types security teams use to monitor and secure their networks.

5 Elasticsearch Disadvantages You Should Know

Since its initial release in 2010, Elasticsearch has grown into the most popular enterprise search engine with use cases that range from web crawling and website search to application performance monitoring and security log analytics. But despite its widespread adoption and success, Elasticsearch does have some notable disadvantages that you should consider - especially if you’re envisioning a high-scale deployment with a large amount of daily ingestion.

The future of Sumo Logic begins at the atomic level of logs

This time of year, complete with Thanksgiving, re:Invent and December holidays around the world, ends up feeling like a natural moment to pause, reflect, and plan for what’s ahead. This is especially true this year, as it also marks my half-year anniversary as CEO of Sumo Logic. I have a strong sense of why I joined, what I’ve learned since leading the incredible team of Sumos, and where I see us going in the future.

Infrastructure Management & Lifecycle Explained

IT infrastructure must meet enterprise needs for effective service delivery while also providing value for money. This is a critical undertaking. Massive data growth, increased complexity of hybrid cloud environments, and emphasis on digital-first strategies are just some of the challenges. This requires an advanced approach to how infrastructure is configured and controlled — infrastructure management.

Modernize Your SIEM Architecture

In this Livestream conversation, I spoke with John Alves from CyberOne Security about the struggles teams face in modernizing a SIEM, controlling costs, and extracting optimal value from their systems. We delve into the issues around single system-of-analysis solutions that attempt to solve detection and analytics use cases within the same tool.

Aggregating Logs From Microservices-Best Practices

Depending on where you are on your journey with microservices, you may have noticed visibility into the system can be a bit tricky at times. Well, there’s good news. Not knowing what’s going on in the system is a solvable problem. One of the first things you can do is get your logs in order. And one of the best ways of doing so is aggregating your logs into a single logging service.

Introducing Responsive Pipelines from Mezmo

The ability to swiftly resolve incidents is central to SREs responsible for a service's reliability and its users' satisfaction. Mezmo has recognized this need and, at Kubecon, unveiled an innovative solution: Mezmo Responsive Pipelines. Responsive Pipelines enable users to pre-configure a Pipeline to respond automatically in the case of an incident.

Recapping KubeCon North America 2023

If you missed KubeCon North America 2023 in Chicago, or you were there and spent more time in the “hallway tracks,” you may have missed some of the big news that came out of the show. We covered the big happenings in the open source cloud native and observability realm in the latest episode of OpenObservability Talks!

Managing Cisco Switch Logs with Kiwi Syslog Server

Network management, particularly the effective handling of system logs, is crucial in maintaining a high-performance and secure IT infrastructure. Log files, or simply logs, are generated by network devices such as switches and routers, serving as valuable resources to understand the intricacies of network performance, spot anomalies, and even comply with regulatory requirements.

From Data Deluge to Strategic Advantage: Cribl and Elastic Chart the Future of Flexible Data Management and Operationalization

In an era where industry standards are as dynamic as the data they govern, Cribl’s core value of putting ‘Customers First, Always’ drives us to stay ahead of the curve. It’s with immense pride and excitement that we announce our strategic partnership with Elastic. This alliance isn’t just a meeting of minds; it’s a bold stride towards a future where flexibility in data management isn’t just a luxury – it’s the standard.

My First Kubecon - Tales of the K8's community, DE&I, sustainability, and OTel

I went to my first Kubecon ever this last week. If you’re not familiar with Kubecon, it is a convention that is around Kubernetes, a Cloud Native Community Foundation (CNCF) open source project. With this being my first Kubecon ever, it was an adventure all around building community, education, kindness, and of course, a love for Kubernetes technology.

Live Render Log Monitoring with Papertrail

Cloud platforms like Render have made developers’ lives easier by handling many of the underlying infrastructure concerns. You can deploy web services, spin up databases, and schedule cron jobs without ever setting up a server manually. However, this convenience comes with a challenge: Accessing logs across these disparate services takes time and effort. To overcome this challenge, many developers centralize their logs with a log management service.

How To Investigate a Reported Problem

Getting to the root cause of a problem in cloud-native environments requires engineers to navigate through immense complexity within a distributed system. Oftentimes, you didn’t write the code and you lack the background and context to quickly understand what’s going on when a problem occurs. The stakes are even higher when a problem is reported - meaning it’s already started to impact the business and the executives and your customers are not pleased.

Evolution of Workplace Search: Search your private data with Elasticsearch

Workplace Search functionality will merge with Elastic Search in the future. Here’s what you need to know. Recent advancements in generative AI technologies have opened up a wave of possibilities with search. As developers build new experiences, users are adopting new ways of using search — from search queries written in natural language, to searching by uploading images or voice samples.

Uncovering Business Insights from Logs

In the world of modern business, data drives decision-making. Every interaction, every transaction, and every click generates a series of data in the form of logs. These logs, often seen as plain text records, have the potential to unlock valuable business insights when analyzed correctly. In How to Create Log-Based Metrics to Improve Application Observability, we described the process of creating log-based metrics to improve application observability using Sematext Cloud.

What is Multicloud? An Introduction

Simply defined, multicloud (or multi-cloud) describes a computing environment that relies on multiple SaaS or cloud services for different workloads within a single architecture. In a multicloud approach, organizations may use public cloud providers such as Amazon Web Services (AWS) for infrastructure, Microsoft Azure for platform, and Google Cloud Platform for development.

The Internet of Medical Things (IoMT): A Brief Introduction

The Internet of Medical Things (IoMT), a subset of Internet of Things (IoT) technologies, comprises inter-networked devices and applications used in medical and healthcare information technology applications. IoMT devices connect patients, doctors and medical devices — including hospital equipment, diagnostic gear, and wearable technology — by transmitting information over a secure network.

Azure Monitoring: What it is and why you need it

Even before the push to the cloud, your company was a Microsoft shop. From workstations to servers, you’ve invested heavily in the Microsoft ecosystem because it gave your business all the technologies necessary for success. As part of your organization’s digital transformation strategy, Azure offered the easiest onboarding experience.

Grabbing the Datadog by the Tail

Datadog is a monitoring and analytics tool for information technology (IT) and DevOps teams that can be used to determine performance metrics as well as event monitoring for infrastructure and cloud services. The software can monitor services such as servers, databases, tools, and applications. Cribl Stream makes it easy to move data from anywhere, to anywhere. We take the saying to heart, and we also allow you to send our Cribl application metrics anywhere.

Building Dashboard and Dashboard Inputs in Cribl Search

This video demonstrates how to create “inputs” to Cribl Search dashboards. An Input is a control widget that we can add to our Dashboards to control how they execute. They allow the user to supply a range of inputs to customize one or many of the Searches in each of the panels on a given dashboard.

How to Create Log-Based Metrics to Improve Application Observability

As a Site Reliability Engineer (SRE) or DevOps professional, you are well aware of the importance of observability in ensuring the smooth functioning and performance of your applications. Observing and monitoring your applications can help you identify and resolve issues in real-time, resulting in increased reliability and improved user experience. Logs play a crucial role in this process as they provide detailed information about the activity and behavior of your applications.

Improvements to DSDL Container Build Process

We’re happy to announce that with the upcoming release of Splunk App for Data Science and Deep Learning (DSDL) 5.1.1 we’re significantly overhauling the build process for containers in DSDL. More and more customers are adopting DSDL for some of their most complex and advanced workloads. In this newest release, we’re making the process of deploying, building and maintaining containers for DSDL more modular, more secure, more robust, and more scalable as well as adding some new features!

Distributed Tracing: Your Ultimate Guide

When all your IT systems, your apps and software, and your people are spread out, you need a way to see what’s happening in all these minute and separate interactions. That’s exactly what distributed tracing does. Distributed tracing is a way to tracking requests in applications and how those requests move from users and frontend devices through to backend services and databases.

6 Reasons Your Data Lake Isn't Working Out

Since the data lake concept emerged more than a decade ago, data lakes have been pitched as the solution to many of the woes surrounding traditional data management solutions, like databases and data warehouses. Data lakes, we have been told, are more scalable, better able to accommodate widely varying types of data, cheaper to build and so on. Much of that is true, at least theoretically.

Enrich Kubernetes with New Deployment Tracking Capability

When things go wrong, we’d all love the ability to go back in time, return things to the way they were, and fix whatever issues pop up at the start so they never happen in the first place. This is no different when maintaining complex microservices-based architectures. With any complex system, things are bound to go wrong from time to time.

The Importance of Microservices

What are microservices? Microservices are a software approach that creates applications as a loose coupling of specific services or functions, rather than as a single, “monolithic” program. A microservice architecture increases the speed and reliability with which large, complex applications are delivered. What makes a service a microservice? Microservices are defined not by how they’re coded, but by how they fit into a broader system or solution.

How to Use Tags to Speed Up Troubleshooting

Maybe as a kid, you pretended to have a magic wand. You would say something like, “Show me the answer to this long division question” then wave your magic wand and wait for the answer. Sadly, mine never seemed to work – for math questions or to make magical snacks appear. Now, imagine if you had a magic wand for your application stack where you could ask it a question about your data and it would give you immediate insights.

Building Dashboard and Dashboard Inputs in Cribl Search

This blog demonstrates how to create “inputs” to Cribl Search dashboards. An Input is a control widget that we can add to our Dashboards to control how they execute. They allow the user to supply a range of inputs to customize one or many of the Searches in each of the panels on a given dashboard. Currently, there are four types of inputs: a time picker, a dropdown, a string, and a number. This blog shows how to create all four types of Inputs on a dashboard using built-in sample data.

Observability Shifts Right

Observability first emerged as a focal point of interest in the DevOps community in the 2017 time frame. Aware that business was demanding highly adaptable digital environments, DevOps professionals realised that high adaptability required a new approach to IT architecture. Whereas historically, digital stacks were monolithic or, at best, coarsely grained, the new stacks would have to be highly modular, dynamic, ephemeral at the component level, and spread over multiple cloud-based services.

How to Quickly Find What's Broken in Your Complex, Cloud Environment

With the rapid adoption of cloud, distributed systems and microservices are standard, resulting in increasingly complex environments. Once straightforward troubleshooting workflows have become chaotic, frustrating, and time-consuming. When something breaks, multiple teams are called to the table to prove they’re “not it”; each with their singular view of the problem.

Introducing Three Powerful Commands in Cribl Search: .show objects, .show queries, and .cancel

Empty spaces, what are we searching for? Abandon queries, but do you know the score? On and on, Does anybody know what we are looking for? … Inspired by “The Show Must Go On”, Queen. Since we launched Cribl Search back in late 2022, we’ve been hard at work on adding features and functionality that continue to empower data engineers to do more with their data without needing to collect it first.

Officially Worldwide: Cribl.Cloud and Cribl Search are now available in EMEA!

At Cribl, we give the people what they want. And what they want is to keep their data close to their sources and destinations. The less data has to travel, the better — lower latency and fewer security risks. This commitment to data locality is even more pronounced among our valued customers in the EMEA region, who are enthusiastically embracing cloud-first strategies.

Tackling Staffing, Funding, and Data Challenges Head-On with TAQA

Join Ed Bailey and TAQA Group's Andrew Ochse as they discuss the diverse services that TAQA offers, look at the challenges with scaling and staffing, and explore in great detail the solutions to classic problems such as insufficient funding, poor data quality, and slow connections linking global sites to their Security Operations Center (SOC).

Quantifying the value of AI-powered observability

Organizations saw a 243% ROI and $1.2 million in savings over three years In today’s complex and distributed IT environments, traditional monitoring falls short. Legacy tools often provide limited visibility across an organization’s tech stack and often at a high cost, resulting in selective monitoring. Many companies are therefore realizing the need for true, affordable end-to-end observability, which eliminates blind spots and improves visibility across their ecosystem.

Value Stream Management: A Brief Explainer

Simply put, value stream management (VSM) is the practice of measuring and improving the flow of business value created by an organization’s software delivery efforts.By monitoring the software delivery life cycle end-to-end, organizations can better identify processes that add value and eliminate those that create waste to optimize the flow of work. Ultimately, this enables teams to move away from activities that don’t directly contribute to customer value and focus more on those that do.

A Data Engineers Journey to Modernizing with Cribl

Terry Mulligan, is a Splunk consultant with Discovered Intelligence (and Notre Dame’s biggest fan)— a data intelligence services and solutions provider that specializes in data observability and security platforms. He shares what Cribl has brought to the table for his organization and his clients, and how it’s changed their processes and the role of the Splunk data engineer.

Setting up better logging in Azure Functions

We have been using Azure Functions for years. Being able to easily deploy and run code on both Azure App Services and real serverless has been a killer feature for all of our asynchronous jobs and services. Unfortunately, the logging approach provided as part of the default template is not ideal. In this post, I'll introduce you to the first steps we take in all of our existing and new function apps to improve logging. A quick note about the Azure Functions runtime.

Why public sector needs AI-powered observability: Cost savings, ROI, and analyst efficiency

Elastic Observability customers saw 243% ROI and $1.2 million in savings over 3 years For government and education organizations around the world, facilitating an efficient, reliable customer experience is essential when providing critical services and building trust with stakeholders. As technology infrastructure expands and the IT landscape becomes a complex mix of private cloud, public cloud, and air-gapped environments, the ability to see across all systems and data is challenging yet critical.

Elasticsearch and LangChain collaborate on production-ready RAG templates

For the past few months, we’ve been working closely with the LangChain team as they made progress on launching LangServe and LangChain Templates! LangChain Templates is a set of reference architectures to build production-ready generative AI applications. You can read more about the launch here.

Elastic Observability ES|QL Demo

Elevate Your Data Game with Elastic Observability and ES|QL! Discover the future of data querying with Elastic’s groundbreaking new feature: ES|QL! In this video you'll deep dive into how ES|QL revolutionizes the way you interact with complex, distributed data, ensuring seamless and efficient data analysis. Who Is This For? Whether you are a data analyst eager to optimize your query writing skills, or a business leader looking to democratize data insights across your organization, this video is tailor-made for you!

Best Practices for Using Git in Your Cribl Workflows

In this conversation, Sanjay Shrestha, Principal Detection Engineer at Bayer, and Raanan Dagan, Principal Sales Engineer from Cribl, talk about the integration of Git in Cribl Stream. They discuss how to manage configuration files and pipelines as code, simplifying their deployment. They also share a demo and give best practices for optimizing your GitOps workflow. In the 10+ years that Bayer has worked with Splunk, they’ve gone from processing just 80 GB/day to more than 13 TB/day.

Data Platforms Explained: Features, Benefits & Getting Started

A data platform is a comprehensive end-to-end solution for all your data. A true data platform can ingest, process, analyze and present data generated by all the systems and infrastructures within your organization. In this topic, there’s a lot of things to understand and consider. So, let’s take a deep look at data platforms, including the definition and related terms, the benefits and use cases, and how to start building your data strategy.

ELT: Extract Load Transform, Explained

Businesses today rely on analytics and insights derived from different data types for gaining competitive advantages. These data often come from different sources and in different formats. Without a unified solution, aggregating those data and performing analytics tasks is challenging. ELT has been invented to solve the complexities associated with processing data from multiple sources while retaining the raw data as it is.

Customer Data Analytics: An Introduction

Simply put, customer analytics (or customer data analytics) is the process of using information about customer preferences and behavior to improve sales, marketing and product development. You can think of customer analytics as the type of customer behavior where buyers are doing internet research before making a purchase. There is now a vast amount of information available for nearly every product category online.

SolarWinds Kiwi Syslog Server Overview

SolarWinds® Kiwi Syslog® Server is an affordable on-premises solution designed to help you manage syslog messages, SNMP traps, and Windows event logs. It centralizes and simplifies log message management across network devices and servers. Kiwi Syslog Server lets you collect, filter, alert, react to, and forward syslog messages and SNMP traps, and it helps you adhere to regulatory compliance. Learn how to simplify syslog and SNMP trap management with SolarWinds Kiwi Syslog Server.

Using Cribl Edge to Collect Metrics from Prometheus Targets in Kubernetes

We continue our exploration of the fascinating world of Kubernetes, logs, and metrics. In our previous installment, we delved into the intricate tale of Cribl Edge and its role in unraveling the mysteries of logging and metrics in Kubernetes environments with the Cribl Edge native sources for Kubernetes Metrics and Logs. Today, we’re picking up where we left off, shining a spotlight on a new and powerful tool that has the potential to demystify this complex ecosystem further.

SEC Charges on SolarWinds: A Wake-Up Call for Cybersecurity and Risk Management

Cribl’s Ed Bailey and Jackie McGuire look into the recent SEC fraud charges leveled against SolarWinds and its CISO, concerning alleged fraud and internal control failures tied to known cybersecurity risks and vulnerabilities. These charges carry long-term implications for corporate handling of cybersecurity and risk management. Tune into the live stream for an engaging conversation, and come prepared with your questions and insights on the future of cybersecurity.

Quick Demo of Logs Pipelines in SigNoz

Log pipeline allows you to preprocess your logs for enrichment, transformation, and attribute extraction before they get indexed. Here's a quick demo of using the Logs pipeline feature in SigNoz to parse Nginx logs. More about SigNoz: SigNoz - Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc. Backed by Y Combinator.

System Operators: Unlock Log Management Mastery with systemd-journal and Netdata

System operators know the drill: as the complexity of systems scales, so does the deluge of logs. Traditionally, taming this relentless tide demands a concoction of costly tools and laborious configurations—until now. The dynamic duo of systemd-journal and Netdata is revolutionizing log management, turning what was once a Herculean task into a streamlined, powerful, and surprisingly straightforward process.

What is IT Asset Management (ITAM)?

Organizations collect technologies like kids collecting baseball cards. As a company’s IT strategy matures, it adds new technologies to supplement previously existing ones, just like kids add new rookie cards to their collections of classics. While kids can leave their baseball cards randomly piled in a shoebox, organizations need to carefully identify and track their IT assets so that they can appropriately manage digital performance and cybersecurity.

Enhance your cloud security with MITRE ATT&CK and Sumo Logic Cloud SIEM

As cloud applications and services gain prominence amongst organizations, adversaries are evolving their toolset to target these cloud networks. The surge in remote work and teleconferencing presents unprecedented opportunities for nefarious activities. Enter the MITRE ATT&CK Framework, also known as a MITRE ATT&CK Matrix—a treasure trove for defending cloud infrastructure and on-premises infrastructure against the newest adversary tactics, techniques, and procedures (TTPs).

What is AIOps? AIOps Explained

What is AIOps? Simply put, AIOps uses big data, analytics and machine learning to automate and improve IT operations (ITOps). AI is particularly important in ITOps functions such as anomaly detection and event correlation, as it has the ability to analyze large volumes of network and machine data to find patterns, identify the cause of existing problems and find ways to forecast and prevent future issues.

What Is OpenTelemetry? A Complete Introduction

What is OpenTelemetry? Simply put, OpenTelemetry is an open source observability framework. It offers vendor-agnostic or vendor-neutral APIs, software development kits (SDKs) and other tools for collecting telemetry data from cloud-native applications and their supporting infrastructure to understand their performance and health. Managing performance in today’s complex, distributed environment is extremely difficult.

Okta evolving situation: Am I impacted?

Cybersecurity is never boring. In recent months, we’ve seen major cyberattacks on Las Vegas casinos and expanded SEC cybersecurity disclosure rules are top of mind. Is it any wonder we consistently recommend taking a proactive approach to secure your environment with a defense-in-depth strategy and appropriate monitoring? News outlets reported the recent compromise at the Identity and Authentication (IAM) firm, Okta.

How To Recover a Cribl Stream Instance Without GitOps/GitHub

When Cribl Stream becomes the center of your data universe, your individual settings, routes, pipelines, and packs become a critical aspect of your work. What happens if you lose access to the UI? If you are on a licensed version of Cribl Stream backing up the work that you are in Sources, Destinations, Routes, Pipelines, and Packs would be done easily using the GitOps remote repo.

Observability for Sustainability

For the past 20 years, the various stakeholder communities that together constitute the IT industry have attempted to address sustainability. The original efforts grew out of the realisation that even as far back as 2005, the hardware and software that underlay the digital world were responsible for approximately 5% of overall energy consumption and that both the percentage and absolute amounts of energy required were growing in the double digits.

What is the Purpose of Syslog Monitoring in Enterprise Software Companies?

Baseball fans know about the various in-game statistics and actions requiring someone to keep them as records. From a player's overall performance at-bat to a game's final score at the bottom of the ninth, dozens (possibly hundreds) of different statistics are happening throughout a season. In Major League Baseball, these records are essential for the team owner, front office workers and coaches to figure out strategies on the diamond or how to distribute fair pay.

APM vs Tracing vs Observability

Application Performance Monitoring (APM), tracing, and observability are fundamental software development and system management approaches. Each of these three concepts uniquely ensures that your applications operate, efficiently, smoothly, and reliably. Your organisation will more than likely already adopt one of these approaches, or even two, potentially all three.

What is Infrastructure Monitoring?

Infrastructure Monitoring can be a powerful tool for engineers to analyze, visualize and comprehend if a backend is affecting users, by collecting health and performance data from containers, servers, databases, virtual machines, and other backend components in a tech stack. Within this article, we will outline what Infrastructure Monitoring is, how it works, what Infrastructure Monitoring as a Service is, and some benefits of the solution.