Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Log Management, Log Analytics and related technologies.

Kubernetes Monitoring and Alerting Made Easy with Splunk Observability Cloud and OpenTelemetry

In this video, I'll show you how to quickly setup monitoring and alerting for your Kubernetes clusters using Splunk Observability Cloud. We’ll start by deploying the Splunk OpenTelemetry Collector using Helm, and then use the Kubernetes Navigator inside Splunk Observability Cloud to view the health of our cluster and the applications it’s hosting. I’ll demonstrate AutoDetect detectors and alerts by intentionally triggering an issue in the cluster and walk through the alerting process. We’ll review the alerts in Splunk Observability Cloud and then resolve the issue in the cluster.

Petabyte Scale, Gigabyte Costs: Mezmo's Evolution from ElasticSearch to Quickwit

At Mezmo, we handle an enormous volume of telemetry data for our customers and ourselves, requiring a robust and efficient search and analytics backend. For years, ElasticSearch served us well, but as our infrastructure grew to a multi-cluster, multi-petabyte scale, we started to see the cracks—rising costs, performance bottlenecks, and scalability concerns. We needed a change, one that would make our system more cost-effective while maintaining speed and reliability.

SSHD Logs 101: Configuration, Security, and Troubleshooting Scenarios

Secure Shell (SSH) is a fundamental tool for remote system administration, and its logs play a critical role in security monitoring, debugging, and compliance. SSHD logs provide insights into authentication attempts, connection successes, failures, and potential intrusions. This guide explores everything you need to know about SSHD logs, including their location, format, analysis, and lesser-known security practices to maximize their effectiveness.

Resolving Heroku deployment issues using comprehensive log data

Deploying applications on Heroku offers a streamlined process for developers, but even the most well-optimized setups can encounter deployment issues. To effectively resolve these issues, it's crucial to gain real-time insights into your app’s behavior, traffic, and performance metrics. The solution to resolving Heroku deployment challenges lies in leveraging the power of log management.

Kubernetes 101

When you get behind the wheel of your car, one of the first things you see is the dashboard. Your dashboard provides various information about all the different technologies that make the car run smoothly, like helping you control your speed, providing insight into your fuel levels, and offering suggestions for regular maintenance, like oil changes. For developers, Kubernetes acts as that one-glance dashboard to provide insights about container performance, maintenance needs, and storage requirements.

The Basics of Log Parsing (Without the Jargon)

Logs are crucial for understanding what's happening in your system, but they can often be hard to make sense of. Log parsing is the key to turning raw, unstructured data into something useful. In this blog, we'll explore the basics of log parsing, its importance, and how it helps you extract valuable insights from your logs without all the clutter.

Syslog Protocol: A Reference Guide

Syslog was developed in the 1980s by Eric Allman as part of the Sendmail project and adopted by many systems over the years. When looking at Syslog, there are a few protocol options, each with slight differences. In this reference guide, I’ll break down the differences so that you have a guide to see these formats when utilizing this protocol.

9 essential metrics to track for effective IT operations with log management tools

Monitoring the correct metrics is crucial for efficient IT operations, as it ensures the smooth functioning of an organization's infrastructure. One crucial aspect of this process is log management, which empowers IT teams to address critical aspects of IT infrastructure, including performance, availability, security, resource usage, and integration.
Sponsored Post

How to Discover Advanced Persistent Threats in AWS

When it comes to managing AWS cloud security, a growing concern for security operations (SecOps teams) is the increasing sophistication of digital threats. While conventional cyber threats deploy widely known tools and techniques in crude, all-or-nothing attempts to breach enterprise security controls, sophisticated attacks known as Advanced Persistent Threats (APTs) employ more advanced technologies and methods to gain and maintain access to secure systems for long periods of time.