Operations | Monitoring | ITSM | DevOps | Cloud

Sponsored Post

The Top 5 Security Logging Best Practices to Follow Now

Security logging is a critical part of modern cybersecurity, providing the foundation for detecting, analyzing, and responding to potential threats. As highlighted by OWASP, security logging and monitoring failures can lead to undetected security breaches. With the average cost of a data breach adding up to $4.45 million, most organizations can't afford to miss a security incident.

Simplifying public sector observability with OpenTelemetry and Elastic

Public sector organizations today face unique challenges in maintaining and optimizing their IT infrastructure and prioritizing efficiency and interoperability. With a mix of modern cloud and legacy systems, ensuring consistent performance, reliability, and security is paramount. To effectively observe across these environments, government agencies need observability tools that are open, flexible, and scalable. OpenTelemetry (OTel) is fast becoming a pivotal part of that flexible toolset.

Introduction to Private Locations in Splunk Synthetic Monitoring

In this tutorial, we’ll demonstrate how to create and use private locations in Splunk Synthetic Monitoring to test internal or pre-production applications within a Kubernetes environment. You'll learn exactly what private locations and private runners are, common use cases, and step-by-step instructions on how to deploy a private runner using Helm. Finally, you'll see how to set up a simple browser test to run synthetics against a service available only within a Kubernetes cluster.

Troubleshoot microservice-based apps faster with Splunk Observability Cloud

When something goes wrong with your microservice-based apps, Splunk Observability Cloud offers a unified Observability platform to make debugging processes easier and faster. By using features like the Service Map to identify the cause of the error and Related Logs in Log Observer to pinpoint its location, you can get back up and running quickly, limiting the impact to your bottom line and keeping your customers happy.

The Rise of Shadow AI & the Tech Debt Tsunami

Recently, Logz.io co-founder and CTO Asaf Yigal teamed up with DevOps legend John Willis for an engaging webinar exploring the exciting—and occasionally intimidating—world of Shadow AI and the “tech debt tsunami” on the horizon. This lively session dove into how generative AI (GenAI) is reshaping software development, DevOps practices, and infrastructure management, along with some friendly advice on how organizations can navigate these changes without getting swept away.

MySQL Logs: Your Guide for Database Performance

MySQL logs are basically your database's diary – they record everything happening behind the scenes. Think of them as the black box of your database operations. You've got error logs showing you when things go sideways, query logs documenting every question asked of your database, and binary logs tracking changes like they're gossip in a small town.

Python Loguru: The Logging Cheat Code You Need in Your Life

Debugging is rarely anyone's idea of a good time. You're cruising along, building something cool, when suddenly your code breaks and you're stuck digging through console outputs that look like they were written by a robot having an existential crisis. Enter Loguru – the Python logging library that feels like it was built for humans, not machines.

From Chaos to Clarity With Victorialogs - Tech Talks #3

In the third episode we will guide you through efficiently ingesting and optimizing log pipelines with. We'll provide actionable insights on streamlining your processes, enhancing performance, and, most importantly, extracting valuable insights from your data to improve your operations, troubleshoot issues, and gain a competitive edge.

Everything you need to know about HAProxy log format

HAProxy is one of today’s fastest and most widely used load balancing solutions. If you’re already using HAProxy or considering using it in your environment, understanding HAProxy logging is essential. Let’s discuss why HAProxy logging is vital to the load balancer implementation, the logging HAProxy offers, and how to manage and configure HAProxy logs to suit your unique needs.

The Future of Dynamic Observability with Sumo Logic -- Customer Brown Bag -- March 27th, 2025

Join us as Sr. Dir. Technical Marketer, Adam White, and Sr. Product Marketing Manager, Hadijah Creary, go beyond the usual technical deep dive—focusing on the mindset, industry trends, and thought leadership shaping modern observability and the future of dynamic observability with Sumo Logic.

Ubuntu Crash Logs: Find, Fix, and Prevent System Failures

If your system keeps crashing and you have no clue why, Ubuntu’s crash logs might have the answers. Whether you’re running a production server or just trying to keep your personal setup stable, these logs tell you exactly what went wrong. Instead of sifting through endless system logs, Ubuntu gives you focused crash reports—kind of like a security camera that only records when something breaks. Let’s break down where to find these logs and how to make sense of them.

Understanding observability metrics: Types, golden signals, and best practices

Observability metrics provide insights into the performance, behavior, and health of applications, systems, and infrastructure — enabling observability practices, which is how a system’s internal state is understood by examining its data. As organizations continue to collect more and more data, observability metrics are a key telemetry signal for observability.

A Guide to Logging in React Native

Basic console logging is a good starting point for debugging and understanding an app. For larger, more complex apps, it’s helpful to include additional information and persist logs. In this guide, you’ll learn how to create and view logs in React Native and how to create and save custom logs to a file. We’ll focus on JavaScript logs.

7 Open-Source Log Management Tools that You Can Consider in 2025

Open-source log management tools provide cost-effective, customizable approaches for collecting and analyzing log data. They help teams quickly identify patterns, spot anomalies, and resolve issues. With numerous options available, it's important to understand their strengths and limitations. This article examines the top open-source log management tools in 2025, focusing on their capabilities, performance, and best use cases.

Revolutionize Product Development with Feedback-Driven Customer Advisory Boards

In a rapidly evolving business landscape, understanding and responding to customer needs is not just an advantage — it's a necessity. At Splunk, we've taken a bold step by applying a product manager mindset to our Customer Advisory Board (CAB) program, transforming it into a dynamic platform for both customers and our product teams.

Tiered Observability: How To Prioritize and Mature Observability Investments

You may be surprised that delivering observability is a journey and isn’t about observing everything at once — it’s about driving outcomes like proactive detection, faster troubleshooting, and aligning with business priorities. If you’ve followed this series, you’ve already taken steps to.

State of Observability in Communications and Media

We surveyed ITOps and engineering professionals worldwide to learn how communications and media organizations build leading observability practices. In our webinar, “The State of Observability in Communications and Media,” we explore three priorities for today’s organizations — and what it takes to claim your spot on the observability leaderboard. Join us to discuss the implications of insights including.

How to redact secrets from logs with Grafana Alloy and Loki

In any observability stack, logs are essential for uncovering insights, troubleshooting issues, and ensuring system health. However, managing the security of logged data presents its own challenges, especially when it comes to preventing sensitive information, like API keys and credentials, from slipping into logs. Secrets can originate from a variety of sources, and it’s often challenging to predict which applications or services might inadvertently expose sensitive information.

Introducing Coralogix's AI Center: Real-time AI Observability

Traditional observability wasn't built for. The reason? AI operates in shades of grey, where outcomes are non-deterministic. That's why we built the AI Center, bringing real-time AI observability to thousands of enterprises worldwide. As part of our AI Center, we built an evaluation engine, designed to oversee and detect specific issues that are most common when building AI agents. Teams can choose the evaluators they want to oversee each agent and receive live alerts and reports into specific quality, security and compliance issues.

Best Logging Practices: 14 Do's and Don'ts for Better Logging

Ever found yourself drowning in a sea of log data, struggling to make sense of the overwhelming noise? Or perhaps faced a major system breakdown, only to find that your logs didn’t provide the answers you needed, leaving you in the dark? Effective logging is a critical yet often overlooked aspect of software development and operations, highlighting why logging is important – it’s the foundation upon which observability, troubleshooting, and system maintenance are built.

Elasticsearch in the aviation industry: A game-changer for data management

Digital customer experience is no longer a luxury but a necessity for European airlines. It drives customer satisfaction, enhances operational efficiency, and creates a sustainable competitive advantage. As the industry continues to evolve, airlines that prioritise investment in cutting-edge digital technologies and platforms will be better positioned to thrive in a dynamic and demanding market.

Log File Analysis: A Guide for DevOps Engineers

Ever found yourself buried in endless log files, trying to piece together what went wrong? For DevOps engineers, log analysis isn’t just about debugging—it’s a crucial skill for maintaining reliable systems and catching issues before they escalate. In this guide, we’ll cover everything you need to know about log file analysis, from the fundamentals to the best tools available today.

Website Logging: Everything You Need to Get Started

If you're new to DevOps, you’ve likely noticed that website logging plays a bigger role than it seems at first. It’s not just a routine task—it’s how you keep systems stable, troubleshoot issues, and understand what’s happening under the hood. A good logging setup captures what went wrong, when, and why—helping you fix problems faster instead of guessing.

AWS ALB vs ELB: Which load balancer is right for you?

Load balancers play a key role in Amazon Web Services (AWS) systems by maintaining traffic distribution, detecting server issues, and redirecting client requests to available servers without any downtime. But, choosing the right AWS load balancer can be daunting, as it’s essential for optimizing your application performance and scalability. Depending on your use case, you may find that an Elastic Load Balancer (ELB) or Application Load Balancer (ALB) better suits your needs.

3CX VoIP Call Detail Records In Graylog

Even with the rise of high-speed networks and sophisticated monitoring tools, VoIP Call Data Records (CDR) remain an essential resource for troubleshooting and optimizing bandwidth usage. These records provide a granular view of call quality, latency, jitter, and packet loss—critical factors that directly impact voice performance.

How to Set Up Logging in Node.js (Without Overthinking It)

Logging in Node.js might not be the most exciting part of development, but it’s one of the most important. Whether you're troubleshooting bugs or keeping track of how your app is running, good logs make life easier. Let’s break down how to set up logging the right way.

Logging and metrics services and Aiven: an overview

Logging and metrics data, regardless of what your business does digitally, are some of the most prolific data types that any service or application can produce. Logging data is generally about specific events that you care about in your application and metrics data is generally about overall system performance and uptime.

Syslog Monitoring: A Guide to Log Management and Analysis

Relying on syslogs to debug issues at odd hours? It happens to the best of us. A solid syslog setup isn’t just about collecting logs—it’s about making them useful. This guide walks through setting up syslog, configuring it for better visibility, and using monitoring techniques that actually help when things go wrong. No fluff, just practical steps you can use right away.

PHP Error Logs: The Complete Troubleshooting Guide You Need

That moment when your PHP application runs flawlessly on your local machine but crashes in production—we've all been there. The key difference between struggling with issues and resolving them efficiently often comes down to understanding PHP error logs. This guide will help you move from trial-and-error debugging to a structured approach for identifying and fixing problems faster.

What To Know About Parsing JSON

If you grew up in the 80s and 90s, you probably remember your most beloved Trapper Keeper. The colorful binder contained all the folders, dividers, and lined paper to keep your middle school and high school self as organized as possible. Parsing JSON, a lightweight data format, is the modern, IT environment version of that colorful – perhaps even Lisa Frank themed – childhood favorite.

Monitor and troubleshoot logs in real-time with Sumo Logic's Live Tail

Troubleshooting production logs shouldn’t be a hassle. Developers and IT operations need real-time insights without jumping between tools or manually sifting through endless log files. Sumo Logic Live Tail simplifies this process. You can instantly search, filter, and troubleshoot log tails in real-time within a single interface to get the data you need without logging into business-critical applications.

How to Monitor Server Uptime Without Missing Critical Failures

Server uptime monitoring is critical for ensuring the reliability and availability of your infrastructure and services. By keeping track of server uptime, you may be able to identify and address potential issues before they impact your end-users. Why just “may be able to”? Because “it depends”. It depends on whether your infrastructure/applications/deployments are built with redundancy in mind. Even if you have a redundant setup, it depends whether it actually works.

Istio Zero-Code Instrumentation

Tracing in Istio environments should be seamless, but too often, teams run into a frustrating problem—traces are broken. Requests jump between services, but instead of a complete flow, Coralogix displays fragmented spans. Tracing should work out of the box in those environments. Istio’s sidecars capture spans automatically, so why are traces incomplete? The issue is almost always context propagation, and fixing it doesn’t have to mean modifying application code.

How to Analyze Logs Using AI

Your tech stack is growing, and with it, the endless stream of log data from every device, application, and system you manage. It’s a flood—one growing 50 times faster than traditional business data—and hidden within it are the patterns and anomalies that hold the key to the performance of your applications and infrastructure. But here’s the challenge you know well: with every log, the noise grows louder, and manually sifting through it is no longer sustainable.

Logging Best Practices to Reduce Noise and Improve Insights

Are your logs helping you, or are they just creating more work? If you’re sifting through endless data but still missing the important details, you’re not alone. It’s a common challenge—but one that can be solved. For anyone managing infrastructure, logs are essential. They show what’s happening, what’s broken, and sometimes even why. But without the right approach, they can easily turn into clutter instead of clarity.

AI Agents: Your data sidekick (minus the coffee breaks)

Do you ever wish you had a personal data guru who could magically sift through all your data, spot patterns before they become problems, summarize everything in a way that actually makes sense and propose recommendations? Well, meet AI Agents—the “digital teammates” who do all that without demanding coffee breaks.

Nginx Logging: A Complete Guide for Beginners

So, you're wrestling with Nginx logs, huh? Been there. In fact, I used to spend way too much time hunting down log files until I finally got smart about it. Let me save you the trouble. Nginx logs are like the black box flight recorder for your web server. When everything crashes and burns (and it will), those logs are often the only evidence left to figure out what happened. But first, you need to know where to find them.

Getting MTTR to zero: the failed promise of observability

There’s an old cliche about sales and jobs to be done - no one wants to buy a drill, they need a hole… actually, they want a home with pictures on the wall. To get to that beautifully designed home, they will buy a drill, make holes for brackets that can support their various artwork and family photos, and progress toward their dream home experience. Similarly, no one wants to buy observability software. They want their mean time to resolve (MTTR) issues to be zero.

Grafana Alloy: OpenTelemetry, With Some Abstraction Issues

OpenTelemetry (OTel) is supposed to be the great equalizer in observability, giving teams full control over how they collect, process, and store telemetry data. It was built to be open, flexible, and vendor-neutral. Grafana Alloy claims to be OpenTelemetry-compatible, but scratch beneath the surface, and you’ll see that, based on our investigations, it is not a neutral OpenTelemetry Collector.

Revolutionizing Incident Management with AI: Meet Mo Copilot

Join us for this webinar as we explore how our newly launched Sumo Logic Mo Copilot redefines incident management with the power of AI. We'll examine the limitations of traditional troubleshooting methods and why they fall short in today’s fast-paced environments. Discover how Mo Copilot leverages advanced machine learning and automation to streamline root cause analysis and reduce mean time to resolution (MTTR). We'll also showcase a live demonstration and highlight how Mo Copilot integrates into your workflow, transforming how you manage operational reliability.

How I used Graylog to Fix my Internet Connection

In today’s digital age, the internet has become an integral part of our daily lives. From working remotely to streaming movies, we rely on the internet for almost everything. However, slow internet speeds can be frustrating and can significantly affect our productivity and entertainment. Despite advancements in technology, many people continue to face challenges with their internet speeds, hindering their ability to fully utilize the benefits of the internet.

Everything You Need to Know About SIEM Logs

That moment when your production system goes down, and you're stuck piecing together logs from twenty different services? It’s frustrating and slow—especially when you need answers fast. SIEM logs help bring order to this chaos, giving you a structured way to track security events and system activity. But understanding how to use them effectively isn’t always straightforward, and most documentation can feel more complicated than the problem itself.

Python Logging Exceptions: The Setup Guide You Actually Need

Debugging a Python app can be frustrating, especially when an unexpected crash leaves behind nothing but a vague error message. A well-configured exception log can make all the difference, turning guesswork into clear insights. Here’s how to set up logging that actually helps.

Work faster with Sumo Logic: Mo Copilot, Otel Remote Management and more

Are you tired of always digging through data and not finding what you're looking for? We get it. Troubleshooting and data analysis should be easier, not harder, especially when time is of the essence. To simplify your work life, we’ve introduced several powerful new features designed to eliminate wasted time and help you focus on what matters: less time troubleshooting and more time building.

Nginx Error Logs: Troubleshooting and Security Guide

Nginx error logs can be tough to decipher, even for experienced sysadmins and DevOps engineers. They hold valuable clues about what’s going wrong, but sorting through them can feel overwhelming. Understanding these logs doesn’t have to be a challenge. This guide breaks them down in a clear, practical way—so you can find the issues that matter and fix them with confidence.

How to Use journalctl --last to Check Recent System Logs

When your Linux server starts acting up at 3 AM, you don't need a philosophy lesson—you need answers. Fast. That's where journalctl last comes in, the command-line equivalent of having a time machine for your system's events. If you've been piecing together log information like some digital detective with a cork board and string, it's time to upgrade your toolkit. Let's cut through the noise and get you the intel you need, when you need it.

Cut Costs, Not Insights: A Practical Guide to Telemetry Data Optimization - A Mezmo Webinar

Managing telemetry data efficiently is a constant balancing act—how do you maximize visibility while controlling costs? In this webinar, we’ll show you how Mezmo’s telemetry pipeline helps you make smarter decisions about your data.

Top Audit Logging Best Practices

Audit logs, otherwise referred to as audit trails, are detailed records that document activities or a sequence of activities or events. Typically, they deal with the usage of systems, applications, and/or networks. They are crucial in ensuring security, compliance, and operational oversight and enable users to keep track of the history of all actions executed and who has done what and when.