Operations | Monitoring | ITSM | DevOps | Cloud

October 2023

Create a logs app plugin with Grafana Scenes and Grafana Loki

Grafana’s plugin tools help developers extend Grafana’s core functionality and create plugins faster, with a modern build setup and zero configuration. Grafana Scenes, meanwhile, is a new front-end library, introduced with Grafana 10, that enables developers to create dashboard-like experiences — such as querying and transformations, dynamic panel rendering, and time ranges — directly within Grafana application plugins.

Using Cribl Search to Aid in Threat Hunting by Enriching Data in Motion

Cribl Search is reshaping the data search paradigm, empowering users to uncover and analyze data directly from its source. Cribl Search can easily reach out and query data already collected in Amazon S3 (or S3 compatible), Amazon Security Lake, Azure Blob, Google Cloud Storage, and more. By searching data where it lives, you can dramatically speed up your search process by avoiding the need to move data before analyzing it.

Achieving observability in Heroku applications with Sumo Logic

Are you one of the many companies harnessing the power of Heroku to build, deliver and scale your applications seamlessly? If so, you're likely aware of the need for robust observability to ensure your Heroku environment runs smoothly. Let’s delve into the world of Heroku monitoring and explore how Sumo Logic, a leading observability platform, can provide invaluable insights into your Heroku infrastructure and application logs.

What is Network Monitoring?

Today, more than ever, as IT environments become more diverse and complex, the need for an effective network monitoring solution has become paramount. However, with the digital environment, it’s constantly ever-evolving, so, these tools must keep pace with these changes to ensure they are still effective for users diagnosing issues and identifying bottlenecks within their network.

Sponsored Post

SIEM Logging for Enterprise Security Operations and Threat Hunting

Today's enterprise networks are diverse and complex. Rather than the simple network perimeter of old, bad actors can attack through multiple entry points, including cloud-based applications. Not to mention, these networks generate massive amounts of transactional data. Because enterprise networks have become larger, they're more difficult to secure and manage. As a result, IT operations teams and security analysts seek better ways to deal with the massive influx of information to improve security and observability.

PromCon Recap: Prometheus Ecosystem Updates

In the first part of our 2023 PromCon recap, we spent OpenObservability Talks exploring the Perses open source project. We found heavy users of open source Grafana who found themselves grappling with issues arising from managing a vast number of dashboards, and the need to manage dashboards as code in a GitOps fashion.

IT Operations Management (ITOM): The Basics

What is ITOM? Information technology operations management (ITOM) is the administration and management of an organization’s hardware, network, applications and technology needs. Generally regarded as the true meaning of “tech support,” it is a service-centric approach to IT infrastructure, IT support operations, IT networking and end user support.

Top 10 Distributed Tracing Tools For Your Success

In the intricate web of modern software systems and full-stack observability, knowing how requests flow and interact across distributed components is paramount. Distributed tracing tools can help you. To better understand how distributed tracing works and benefits, here’s our selection of top distributed tracing tools to choose from.

Troubleshoot and Monitor LogStash using Cribl Edge and Cribl Search

I have worked as a helpdesk specialist, cyber security analyst, information systems security engineer, professional services consultant, etc. At this point in my career, I have seen enough to relate with anyone in the IT world. Let’s narrow our focus and chat about monitoring system health and troubleshooting. Tool sprawl is the standard.

A Guide to Docker Adoption

Whether you’re a developer or a security analyst, you probably already know the name Docker. Developers use Docker’s open-source platform to build, package, and distribute their applications. Since the application and all dependencies sit in the container, it runs consistently across different operating systems and environments. As with everything technology, Docker adoption is a good news/bad news story. Good news: DevOps teams can ship applications faster.

DevOps & DORA Metrics: The Complete Guide

In in order to achieve DevOps success, you must measure how well your DevOps initiatives work. Tracking the right DevOps metrics will help you evaluate the effectiveness of your DevOps practices. In this article, I’ll explain many DevOps metrics, including their significance, the key metrics for various goals, and — best of all — tips for improving the score of each DevOps metric discussed here.

What Is ITSM? IT Service Management Explained

ITSM, which stands for IT service management, is a strategy for delivering IT services and support to an organization, its employees, customers and business partners. ITSM focuses on understanding end users’ expectations and improving the quality of both IT services and their delivery. In the early days of computers, employees relied on the company IT department for help whenever a computer issue arose.

Use Datadog Dynamic Instrumentation to add application logs without redeploying

Modern distributed applications are composed of potentially hundreds of disparate services, all containing code from different internal development teams as well as from third-party libraries and frameworks with limited external visibility. Instrumenting your code is essential for ensuring the operational excellence of all these different services. However, keeping your instrumentation up to date can be challenging when new issues arise outside the scope of your existing logs.

Continuous profiling: The key to more efficient and cost-effective applications

Recently, Elastic Universal ProfilingTM became generally available. It is the part of our Observability solution that allows users to do whole system, continuous profiling in production environments. If you're not familiar with continuous profiling, you are probably wondering what Universal Profiling is and why you should care. That's what we will address in this post.

OpenSearch vs. Elasticsearch: Which is Better?

Following its release under the open-source Apache 2.0 license in 2010, Elasticsearch rose to prominence as the world’s most popular enterprise search engine. Elasticsearch is frequently deployed alongside Logstash and Kibana, a combination known as the ELK stack, to enable log analytics use cases that include application observability, security log analysis, and understanding user behavior.

HTTP Monitor Overview: What It Is, Why & How to Create One [Tutorial]

The World Wide Web’s transmission system is built on HTTP. To ensure an application that uses the HTTP transmission works, you must monitor it constantly. This is where an HTTP monitor comes in. In this tutorial, we’ll cover the fundamentals of HTTP monitors, including what they are, why they matter, and how to set one up.

Solr Monitoring Tools

Solr is widely adopted by startups and enterprises alike. It’s powerful and open-source, so it’s very appealing to just about everyone looking for a search platform to build off of. Being easily accessible, many people overlook the importance of monitoring Solr. Even when that importance is put into question, a lot of people continue with the trend and use an open-source tool for their monitoring needs.

What is Observability? An Introduction

Simply put: Observability is the ability to measure the internal states of a system by examining its outputs. A system is considered “observable” if the current state can be estimated by only using information from outputs, namely sensor data. More than just a buzzword, the term “observability” originated decades ago with control theory (which is about describing and understanding self-regulating systems).

Best practices for Elasticsearch on Kubernetes | Kubecon

In this talk, Radu will delve into the world of Elasticsearch and OpenSearch within Kubernetes. In this informative snippet, we uncover the best practices for deploying, managing, and optimizing these powerful search and analytics engines in your Kubernetes environment. Whether you're a seasoned developer, a DevOps enthusiast, or a data-driven professional, this presentation offers invaluable insights that will enhance your Elasticsearch and OpenSearch deployment strategies.

Elastic's contribution: Invokedynamic in the OpenTelemetry Java agent

As the second largest and active Cloud Native Computing Foundation (CNCF) project, OpenTelemetry is well on its way to becoming the ubiquitous, unified standard and framework for observability. OpenTelemetry owes this success to its comprehensive and feature-rich toolset that allows users to retrieve valuable observability data from their applications with low effort. The OpenTelemetry Java agent is one of the most mature and feature-rich components in OpenTelemetry’s ecosystem.

PromCon Recap: Unveiling Perses, the GitOps-Friendly Metrics Visualization Tool

In the vibrant atmosphere of PromCon during the last week of September, attendees were treated to a plethora of exciting updates from the Prometheus universe. A significant highlight of the event has been the unveiling of the Perses project. With its innovative approach of dashboard as code, GitOps, and Kubernetes native features, Perses promises a revolutionary experience for Prometheus users, which gained a lot of traction at the conference.

OKRs, KPIs, and Metrics: Understanding the Differences

In the world of business management and performance tracking, OKRs, KPIs, and Metrics are common terms thrown around. Each plays a distinct role in helping organizations define their vision, measure their progress, and improve their performance. Let's dive deep into understanding the nuanced differences between these three concepts.

Kubernetes + Cribl Edge: Because Logging and Metrics Shouldn't Be a Mystery Novel!

To fully utilize the capabilities of Kubernetes, it’s crucial to have a reliable system for gathering and organizing logs, metrics, and events. With the complex nature of container orchestration, it’s crucial to understand the significance and process behind the data generated in a Kubernetes environment at scale. Cribl Edge works seamlessly with Kubernetes and can cater to various needs.

Building the Future of Data for IT and Security

Today, Cribl surpassed $100 million in annual recurring revenue (ARR), becoming one of the fastest companies to ever reach this milestone in under four years––an incredible achievement on our journey to building a generational company. Reaching $100 million in ARR so quickly shows that our unique approach and steadfast focus on IT and Security continues to be validated by the market.

Availability: A Beginner's Guide

Availability is the amount of time a device, service or other piece of IT infrastructure is usable — or if it’s available at all. Because availability, or system availability, identifies whether a system is operating normally and how effectively it can recover from a crash, attack or some other type of failure, availability is considered one of the most essential metrics in information technology management. It is a constant concern.

Stream your Google Cloud logs to Datadog with Dataflow

IT environments can produce billions of log events each day from a variety of hosts and applications. Collecting this data can be costly, often resulting in increased network overhead from processing inefficiencies and inconsistent ingestion during major system events. Google Cloud Dataflow is a serverless, fully managed framework that enables you to automate and autoscale data processing.

API Monitoring: A Complete Introduction

At the most basic level, application programming interface (API) monitoring checks to see if API-connected resources are available, working properly and responding to calls. API monitoring has become even more important (and complicated) as more elements are added to the network and the environment evolves, including multiple types of devices, microservices as a key part of application delivery, and, of course, the widespread move to the cloud.

Challenge Met: Adopting Intelligent Observability Pipelines

Over the last year or so, the unavoidable topic of overwhelming cost has emerged as the number one issue among today’s observability practitioners. Whether it is in conversations among end users, feedback from customers and prospects, industry chatter or the coverage of experts including Gartner, the issue of massive telemetry data volumes driving unsustainable observability budgets prevails.

Using the Cribl API - Part 1

Cribl’s interface is Super Neato: Reactive, beautiful, and easy to use. But sometimes you need to access settings and configurations programmatically. The good news is that interactive API docs are baked into your Cribl instance. The better news is that everything that happens in the GUI is making API calls. With your browser’s developer mode, you can easily take a peak behind the curtain to see exactly how the API was called and what the payload looked like.

How to Parse JSON With BindPlane

About ObservIQ: observIQ brings clarity and control to our customer's existing observability chaos. How? Through an observability pipeline: a fast, powerful and intuitive orchestration engine built for the modern observability team. Our product is designed to help teams significantly reduce cost, simplify collection, and standardize their observability data.

Transforming Observability with Elastic AI Assistant: A Proactive, AI-Driven Approach

Discover how Elastic AI Assistant is transforming the world of observability by offering proactive, AI-powered insights that help SRE teams manage complex systems more efficiently. Say goodbye to manual, reactive processes and hello to a proactive, AI-driven approach with the Elastic AI Assistant for Observability.

How to deploy a Hello World web app with Elastic Observability on Azure Container Apps

Elastic Observability is the optimal tool to provide visibility into your running web apps. Microsoft Azure Container Apps is a fully managed environment that enables you to run containerized applications on a serverless platform so that your applications scale up and down. This allows you to accomplish the dual objective of serving every customer’s need for availability while meeting your needs to do so as efficiently as possible.

Leveraging Tines and Cribl Search for Security Automation

At Cribl, we have the privilege of helping our customers achieve their strategic data goals by giving them visibility and control over all of their observability data. The reality today is that data is commonly stored across many places. Whether intentional (such as using Cribl Stream to create a security data lake) or unintentional (because of silos and tool sprawl), organizations desire the ability to access and analyze all of this information at any time.

What is Infrastructure as Code? An Introduction to IaC

Infrastructure as Code, or IaC, is the practice of automatically provisioning and configuring infrastructure using code and scripts. IaC allows developers to automate the creation of environments to generate infrastructure components rather than setting up the necessary systems and devices manually.

Getting Started with Kubernetes | Start learning Kubernetes in 2023

Ready to dive into the world of Kubernetes? Join us in this beginner-friendly tutorial where we break down Kubernetes infrastructure, explore its fundamental components, and understand how it all fits together at a high level. Whether you're a developer, sysadmin, or just curious about Kubernetes, this video has you covered.

Cribl Stream Demo with Max Weber

Join Cribl's Ed Bailey and Max Weber, Senior Detection Engineer, for a fun discussion about the challenges of detection engineering and how Max is solving these problems every day. We will discuss the current state of detection engineering, why data engineering is a prerequisite for better detection engineering, and what Max would like to see to help drive better outcomes. Max will demo Cribl Stream and show how his data engineering skills drive better detections.

How to Monitor MySQL Using OpenTelemetry

MySQL is the trusted open-source database management system for many desktop, mobile, web, and cloud applications. Monitoring the performance of MySQL is critical but as the applications expand over multi-cloud, cloud-native, and hybrid cloud, monitoring also grows in complexity. Continuous monitoring and scaling help applications take advantage of MySQL’s capabilities such as reliability, security, flexibility, availability, and performance scalability.

How To Profile and Optimize Telemetry Data: A Deep Dive

We recently had the privilege of presenting our telemetry data pipelining platform at Cloud Field Day. Today, we'd like to share a recap of our demo with you. In this demo, we explore the transformative potential of data profiling, telemetry pipeline optimization, and incident response. Foundationally, we follow an Understand, Optimize, and Respond workflow.

Coffee Talk with SURGe: The Interview Series featuring Michael Rodriguez

Join Mick Baccio and special guest Michael Rodriguez, Principal Strategic Consultant for Google Public Sector, for a conversation about Michael’s career path into cybersecurity, the origin of his nickname “Duckie,” and his work as a cybersecurity subject matter expert for Google Space.

OpenTelemetry Logs - A Complete Introduction & Implementation

OpenTelemetry is a Cloud Native Computing Foundation(CNCF) incubating project aimed at standardizing the way we instrument applications for generating telemetry data(logs, metrics, and traces). OpenTelemetry aims to provide a vendor-agnostic observability framework that provides a set of tools, APIs, and SDKs to instrument applications.

Connect and Federate Searches Across Your Cloud Data Lakes with Cribl Search

The way we handle massive volumes of data from multiple sources is about to change fundamentally. The traditional data processing systems don’t always fit into our budget (unless you have some pretty deep pockets). Our wallets constantly need to expand to keep up with the changing data veracity and volume, which isn’t always feasible. Yet we keep doing it because data is a commodity.

Turbo-charging AI Ops with the Elastic Observability AI Assistant: ElasticON AI

Elastic Observability experts Bahubali Shetti and Gagan Singh take a deep dive into how the Elastic Observability AI Assistant can help you get deeper contextual insights into telemetry, troubleshoot issues more effectively, reduce time to resolution, and streamline operations.

Real-Time Analytics: Definition, Examples & Challenges

Businesses need to stay agile and make data-driven decisions in real time to outperform their competitors. Real-time analytics is emerging as a game-changer, with 80% of companies showing an increase in revenue due to real-time data analytics as companies can gain valuable insights on the fly. This blog post will explore the concept of real-time analytics, its examples, and some challenges faced when implementing it. Read on for a detailed explanation of this exciting area in data analytics.

Viewing Audit Logs in BPOP Enterprise

About ObservIQ: observIQ brings clarity and control to our customer's existing observability chaos. How? Through an observability pipeline: a fast, powerful and intuitive orchestration engine built for the modern observability team. Our product is designed to help teams significantly reduce cost, simplify collection, and standardize their observability data.

Your Guide to Prometheus Observability

Imagine you’re piloting a spaceship through the cosmos, embarking on a thrilling journey to explore the far reaches of the universe. As the captain of this ship, you need a dashboard that displays critical information about your vessel, such as fuel levels, navigation data, and life support systems. This dashboard is your lifeline, providing you with real-time insights about the health and performance of various systems within your ship, so you can quickly make critical decisions.

Latest breakthroughs in vector search for Elasticsearch and Lucene: ElasticON AI

Elastic experts Jim Ferenzi and Ben Trent discuss key Elasticsearch and Lucene improvements — including intuitive vector search support, multi-threading, RRF, and hybrid search with filtering and doc-level security. Plus, hear what they are working on next! Additional resources.

How to Parse with Regex in BindPlane

About ObservIQ: observIQ brings clarity and control to our customer's existing observability chaos. How? Through an observability pipeline: a fast, powerful and intuitive orchestration engine built for the modern observability team. Our product is designed to help teams significantly reduce cost, simplify collection, and standardize their observability data.

Breaking Through the Observability Wall: Scaling Your Telemetry Architecture

In today's digital landscape, Observability and telemetry data play a crucial role in ensuring the performance, reliability, and security of modern applications and services. However, as data volumes explode due to the proliferation of micro-services, cloud-based applications, and connected devices, existing architectures are hitting a scalability wall.

Introducing Mezmo Edge

Mezmo Edge enables users to deploy telemetry pipelines and process data in their own environment. A significant advancement in Mezmo’s capabilities, Edge is especially useful when working with sensitive medical or financial records. Organizations that need to comply with PCI, GDPR, or CCPA or that generally work with PII will benefit from Edge’s secure approach to data protection. Edge also provides the telemetry data optimization benefits of a pipeline without cloud data egress charges.

Industry Cloud Platforms, Explained

Cloud computing changed the way enterprise IT works. Investments in public technologies are forecasted to grow by 21.7% to reach the $600 billion mark by the end of this year. The trend is driven by two major factors: Business organizations view these capabilities as an imperative for digital transformation — especially the domain-specific IT services that solve problems unique to their industry verticals.

Maturity Models for IT & Technology

Setting meaningful goals for your technology investment decisions requires an understanding of your requirements. Primarily, that’s… Measuring your IT maturity is one way to advance your IT performance — in a way that aligns with your organizational goals and minimizes the risk of failure. You can compare your current situation to a group of peers or competitors and also to industry benchmarks. Let’s take a look.

Unpacking the Hype: Navigating the Complexities of Advanced Data Analytics in Cybersecurity

The cybersecurity industry is experiencing an explosion of innovative tools designed to tackle complex security challenges. However, the hype surrounding these tools has outpaced their actual capabilities, leading many teams to struggle with complexity and extracting value from their investment. In this conversation with Optiv‘s Randy Lariar, we explore the potential and dangers of bringing advanced data analytics and artificial intelligence tools to the cybersecurity space.

The Evolution of Search: How Multi-Modal LLMs Transcend Vector Databases

As we venture deeper into the data-driven era, the traditional systems we have employed to store, search, and analyze data are being challenged by revolutionary advancements in Artificial Intelligence. One such groundbreaking development is the notable advent of Large Language Models (LLMs), specifically those with Multi-Mod[a]l abilities (e.g., Image & Audio).

Coralogix vs Grafana Cloud: Pricing, Features and More

While Grafana is one of the better known names in the industry, Coralogix offers a full-stack observability platform. Despite the popularity of the Grafana brand, the cloud based solution lacks in some key areas. This article will go over the differences between Coralogix and Grafana Cloud, from features, customer support, pricing and more.

Field Data vs Doc Values | Understanding Elasticsearch Performance Issues

🚀 Dive into the world of Elasticsearch performance with our expert at Sematext! In this insightful conference talk, we explore the crucial differences between Doc Values and Field Data, shedding light on the best practices for optimizing your Elasticsearch clusters. Discover how the choice between Doc Values and Field Data can significantly impact your Elasticsearch queries, indexing, and overall system efficiency. Gain the knowledge and insights to supercharge your Elasticsearch deployments.

Introduction to Grafana Plugins

Grafana is a powerful open-source platform for monitoring and observability, but what truly makes it shine are its plugins. For technology engineers looking to expand Grafana's capabilities, plugins are the way to go. In this post, we'll dive into the world of Grafana plugins and offer some unique tips to get the most out of them.

Can You Use the ELK Stack as a SIEM? A Fresh Take

A SIEM system (Security Information and Event Management) is often used by security operations centers (SOCs) for real-time detection of suspicious activity and security events. While some teams choose to adopt a purpose-built SIEM, others rely on the same DevOps tools they are already using for tasks like troubleshooting and operational log data analysis.

Elasticsearch to OpenSearch Migration Facilitated by Sematext Cloud

OK, so you’ve decided to move from Elasticsearch to OpenSearch. Maybe our comparison helped you decide and maybe you’ve checked our guide on how to perform the migration. But how do you know if your new OpenSearch performs as well and functions as correctly as the existing Elasticsearch? Even when comparing old with new versions, upgrades don’t always translate into better performance.

Is a $1 million Datadog bill worth it?

In a recent reddit thread, I got into a conversation about justifying the cost of observability. It got to a really basic question about running a tech company: how do you know that any cost is justified? While a small number of expenses have clear and direct business values, a bunch of other costs, I would even say most costs, just aren’t that clear cut.

Migrating 1 billion log lines from OpenSearch to Elasticsearch

What are the current options to migrate from OpenSearch to Elasticsearch®? OpenSearch is a fork of Elasticsearch 7.10 that has diverged quite a bit from itself lately, resulting in a different set of features and also different performance, as this benchmark shows (hint: it’s currently much slower than Elasticsearch).

An Overview of the Essential Observability Metrics

Metrics are closely associated with cloud infrastructure monitoring or application performance monitoring – we monitor metrics like infrastructure CPU and request latency to understand how our services are responding to changes in the system, which is a good way to surface new production issues. As many teams transition to observability, collecting metric data isn’t enough.

Predictive Maintenance: A Brief Introduction

Predictive maintenance is a maintenance strategy that uses machine learning algorithms trained with Industrial Internet of Things (IIoT) data to make predictions about future outcomes, such as determining the likelihood of equipment and machinery breaking down. Using a combination of data, statistics, machine learning and modeling, predictive maintenance is able to optimize when and how to execute maintenance on industrial machine assets.

The Quest to Process Microsoft Windows Event Logs in Snare Format with Cribl

One of the things I really love about working for Cribl is the ability to help our customers optimize their data. Microsoft Windows Event Logs are something I have always looked to as a proverbial Rosetta Stone to help translate semi-structured, classic-style events into something more efficient and less resource-intensive to search. Extracting field values requires a large number of regular expressions to parse the events, which isn’t ideal.

Exploring systemd journal logs with Netdata

Today, we released our systemd journal plugin for Netdata, allowing you to explore, view, search, filter and analyze systemd journal logs. Like most things about Netdata, this is a zero-configuration plugin. You don’t have to do anything apart from installing Netdata on your systems.This is key design direction for Netdata, since we want Netdata to be able to help even if you install it mid-crisis, while you have an incident at hand.

CapEx vs OpEx for Cloud, IT Spending, & More

Capital expenditures (CapEx) and operational expenditures (OpEx) are two ways organizations categorize their business expenses. Every organization has a variety of expenses, from office rent to IT infrastructure costs to wages for their employees. To simplify accounting, they organize these costs into different categories, two of the most common being CapEx and OpEx.

Container Orchestration: A Beginner's Guide

Container orchestration is the process of managing containers using automation. It allows organizations to automatically deploy, manage, scale and network containers and hosts, freeing engineers from having to complete these processes manually. As software development has evolved from monolithic applications, containers have become the choice for developing new applications and migrating old ones.

Elasticsearch and Arduino: Better together!

An easy way to communicate with Elasticsearch and Elastic Cloud using Arduino IoT devices At Elastic®, we are constantly looking for new ways to simplify search experience, and we started to look at the IoT world. The collection of data coming from IoT can be quite challenging, especially when we have thousands of devices. Elasticsearch® can be very useful to collect, explore, visualize, and discover data — for all the data coming from multiple devices.

Ingesting and analyzing Prometheus metrics with Elastic Observability

In the world of monitoring and observability, Prometheus has grown into the de-facto standard for monitoring in cloud-native environments because of its robust data collection mechanism, flexible querying capabilities, and integration with other tools for rich dashboarding and visualization.

Centralized Logging & Centralized Log Management (CLM)

Centralized logging provides visibility into the system by consolidating all the log data in a single all-in-one source. It supports two particular enterprise needs: Once all the data is ingested in a central location, you can seamlessly identify the problems in systems and troubleshoot them. But with ease comes challenges, too. For example, your team members may struggle with locating their desired details from this sea of data.

systemd journal logs: A Game-Changer for DevOps and Developers

“Why bother with it? I let it run in the background and focus on more important DevOps work.”— a random DevOps Engineer at Reddit r/devops In an era where technology is evolving at breakneck speeds, it's easy to overlook the tools that are right under our noses. One such underutilized powerhouse is the systemd journal. For many, it's a mere tool to check the status of systemd service units or to tail the most recent events (journalctl -f).

How to Monitor SQL Server with OpenTelemetry

At observIQ, we've seen growing interest in observing the health of Windows systems and applications using OpenTelemetry. Requests on the SQL Server receiver continue to garner the most interest, so let's start there. Below are steps to get up and running quickly with the contrib distribution of the OpenTelemetry collector. We'll be collecting and shipping SQL Server metrics to a popular backend, Google Cloud.

Predictive Network Technology in 2024

IT networks generate large volumes of information in the form of security, network, system and application logs. The volume and variety of log data makes traditional network monitoring capabilities ineffective — especially for monitoring use cases that require proactive decision making. These decisions are based on things like: All of this makes large-scale and complex enterprise IT networks a suitable use case for advanced AI and machine learning capabilities.

Monitor Azure Resource Events with LogicMonitor Logs

The integration of Azure’s event-driven model with LogicMonitor’s monitoring capabilities offers businesses a robust solution for real-time IT infrastructure monitoring. LogicMonitor’s cloud-based platform provides a comprehensive overview of an organization’s IT infrastructure, both on cloud and on-prem.

Cloud Imperium Games moves ELK stack with ChaosSearch.

Cloud Imperium Games (CIG) is a prominent video game development company known for its ambitious project, Star Citizen, which aims to be an open-world, massively multiplayer online space simulation game. As a result of the game's popularity, all the metrics, events, and logs, generated to track every single action during gameplay, also experienced explosive growth in terms of volume and also in diversity (a consequence of the dynamic and fast-paced development environment).

Telemetry 101: An Introduction To Telemetry

Understanding system performance is critical for gaining a competitive advantage. Telemetry provides deeper insights into the system, helping business owners make better decisions. This article take a comprehensive look at the topic of telemetry. We’ll look at its functionality and telemetry types. We’ll also look at all the things telemetry data can help you with — plus the challenges companies with telemetry systems might face.

Listen, Learn and Adapt: The Keys to a Nimble Customer Experience Strategy

In celebration of Customer Experience Day 2023, this post is part of a series on customer experience and the ways that Splunk strives to deliver superior customer experience at every level. Any resilient customer experience (CX) team knows that in order to create superior customer experiences, listening is the first step. This is made apparent when you consider that 73% percent of customers expect companies to have a firm grasp on their unique needs and expectations.

Full Stack Observability Guide - Examples and Technologies

As modern software systems become increasingly distributed, interconnected, and complex, ensuring production reliability and performance is becoming harder and more stressful. Seemingly nondescript changes to our infrastructure or application can have massive impacts on system uptime, health, and performance, all while the cost of production incidents continues to grow.

Harmonizing Digital Channels and Business Operations to Deliver a Good Customer Experience

In celebration of Customer Experience Day 2023, this post is part of a series on customer experience and the ways that Splunk strifves to deliver superior customer experience at every level. Today, customers interact with brands through a variety of channels and platforms. In fact, 57% of customers prefer to engage with brands through digital channels first.

Tracing Your Steps Toward Full Kubernetes Observability

Kubernetes is one of the most important and influential technologies for building and operating software today because it’s so incredibly capable. It’s flexible, available, resilient, scalable, feature-rich and backed by a global community of innovators — that’s a pretty impressive list of intangibles to apply to any particular capability.

How to Build a ROI Plan for Cribl Stream

Getting your organization to invest in a new tool requires telling a story that helps decision-makers understand its benefits. In a recent webinar, our experts discussed how to define an ROI for Cribl Stream. They also shared a sample proposal you can use to craft the story you’ll tell to leadership, and gave some tips and tricks for justifying the purchase of these key tools for your business. Engineers and architects understand core technical problems better than anyone.

Cloud Migration Basics: A Beginner's Guide

What is a cloud migration? A cloud migration is the practice of moving IT workloads (data, applications, security, infrastructure, and other objects) to a cloud environment. Quick Links: Cloud migration can take many forms, including: There is also another type of cloud migration called a reverse cloud migration (also known as cloud repatriation or cloud exit) where existing applications are moved from a public cloud back to an on-premises data center.

Anomaly Detection in 2024: Opportunities & Challenges

Anomaly detection is the practice of identifying data points and patterns that may deviate significantly from an established hypothesis. As a concept, anomaly detection has been around forever. Today, detecting anomalies today is a critical practice. That’s because anomalies can indicate important information, such as: Let’s talk a look at the wide world of anomaly detection.

Enterprises Realize Benefits from Migrating to Cloud with Splunk

Today, for a lot of organizations, moving to the cloud provides the best strategy to drive higher business efficiency and scale. But moving to the cloud can be challenging. IT leaders are continuously looking for ways to focus more on driving business value while moving to the cloud.

How continuous profiling improved code performance for a new Grafana Loki feature

Throughout the software development process, engineers can use a number of methods and tools to ensure their code is efficient. When using Go, for example, there are built-in tools, including those for benchmarking and CPU/memory profiling, to check how efficiently code will run. Engineers can also run unit tests to validate code quality.

ING's bold leap into the future: Building a global, cloud-based financial messaging system with Elastic

ING Group is a Dutch-based multinational banking and financial services corporation serving more than 38 million customers globally. It’s one of the biggest banks in the world, consistently ranking among the top 30 largest banks globally. At ING, our 20-year-old COBOL-based financial messaging system — which provides electronic instructions to enable financial transactions between banks and customers — is slowly becoming obsolete and difficult to integrate.

The Top 10 OpenSearch Plugins

OpenSearch is a powerful, open-source analytics and search engine that can be utilized to construct custom search solutions for a broad variety of applications, from websites to enterprise-level systems. It enables flexible search and indexing abilities, making it suitable for a range of uses, a great example of this is scalability. OpenSearch is designed for horizontal scalability, enabling organizations to input additional nodes to their cluster as data volumes and query loads increase.

Triangulate: Add Logs to Your Monitoring Mix

For many IT organizations, triaging or troubleshooting starts with assessing symptoms. As practitioners investigate the causal factors by answering each of the “5 whys,” logs are often where the actual root cause answers lie. This is even more true for issues related to configuration changes, change management, and security. However, diving into log data can be overwhelming as a first step due to the high volume and velocity of logs and missing context.

How to deploy a Hello World web app with Elastic Observability on AWS App Runner

Elastic Observability is the premiere tool to provide visibility into web apps running in your environment. AWS App Runner is the serverless platform of choice to run your web apps that need to scale up and down massively to meet demand or minimize costs. Elastic Observability combined with AWS App Runner is the perfect solution for developers to deploy web apps that are auto-scaled with fully observable operations, in a way that’s straightforward to implement and manage.

When and How to Use Aggregators

There's lots of great reasons to run OTel agents as aggregator / gateway collectors. In this video we discuss 4 of the most common! About ObservIQ: observIQ brings clarity and control to our customer's existing observability chaos. How? Through an observability pipeline: a fast, powerful and intuitive orchestration engine built for the modern observability team. Our product is designed to help teams significantly reduce cost, simplify collection, and standardize their observability data.

Why Does Observability Need OTel?

To successfully observe modern digital platforms, a new data collection approach was needed. And OpenTelemetry (OTel) was the answer - an industry-agreed open standard - not a single vendor's approach - on how observability (O11y) data should be collected from a platform. This separates out data collection from the vendors’ platform of data processing and visualisation, making the data collecting approach vendor agnostic.

Predictive vs. Prescriptive Analytics: What's The Difference?

Imagine being able to foresee future trends, anticipate customer behaviour, optimize your operations, and take actions that are not just reactive — they shape the future of the market. In the world of data-driven decision-making, we're able to do all that by paying attention to the information we analyze from predictive and prescriptive analytics. A large and growing field, data analytics is often broken into four categories — of which predictive and prescriptive are two!

Announcing Splunk Federated Search for Amazon S3 Now Generally Available in Splunk Cloud Platform

Splunk is pleased to announce the general availability of Federated Search for Amazon S3, a new capability that allows customers to search data from their Amazon S3 buckets directly from Splunk Cloud Platform without the need to ingest it. Enterprises rely heavily on cloud object storage services as the de facto destination for their new data to leverage the cost, compliance, security, scalability and manageability benefits that cloud platforms can offer.

Forwarding Windows Events to CLM

Looking at your IT environment, you probably have various machines and applications connected to your networks. From network devices to servers to laptops, you need to know what’s happening at all times. While your log data provides the monitoring information you need, your environment’s diversity makes aggregating and correlating this information challenging. If your company invested in Windows devices, then your struggle is even more real because Microsoft uses proprietary format.