Monthly Archive

Complete Guide to Redis Monitoring: Essential Metrics, Tools & Best Practices 2025

Jul 31, 2025 By Ankit Anand In SigNoz

Redis is a powerful tool, but its position in the critical path of applications means that performance issues can have a widespread impact. Whether you use Redis as a cache, session store, or primary database, effective monitoring is essential to prevent slowdowns and ensure a responsive user experience. This guide provides a comprehensive walkthrough of Redis monitoring, covering the essential metrics you need to track, the tools available to you, and the best practices to adopt in 2025.

Read Post

SigNoz

Read more about Complete Guide to Redis Monitoring: Essential Metrics, Tools & Best Practices 2025

Why Observability Isn't Just for SREs (and How Devs Can Get Started)

Jul 30, 2025 By Elizabeth Mathew In SigNoz

Almost every other day, when I scroll past r/devops or r/sre, I see a post like this asking how a dev can get started with devops, observability, etc. Sample Reddit thread on how to get started with OTel This blog is an attempt for anyone lost to find their way into observability and a wake-up call for devs to they should think about observability more actively today than ever before. A dev’s observability playbook.

Read Post

SigNoz

Read more about Why Observability Isn't Just for SREs (and How Devs Can Get Started)

This Month in Datadog: Bits AI SRE, Datadog Data Observability, and more

Jul 30, 2025 By Datadog In Datadog

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. To learn more about Datadog and start a free 14-day trial, visit Cloud Monitoring as a Service | Datadog. This month, we chat with two guests about Bits AI SRE and Datadog Data Observability.

View Video

Datadog

Read more about This Month in Datadog: Bits AI SRE, Datadog Data Observability, and more

Multi Factor Authentication for Synthetic Monitoring for AVD

Jul 29, 2025 By SatheeshKumar S In eG Innovations

Today, I’ll cover some of the basics of monitoring Multi-Factor Authentication and why ensuring MFA is implemented is essential, particularly in environments where remote access is possible. I’ll cover some recent, specific case studies where a lack of MFA has led to security breaches and the mechanisms the bad actors used.

Read Post

eG Innovations

Read more about Multi Factor Authentication for Synthetic Monitoring for AVD

AI Agents Console: Monitor the behavior and interactions of any AI agent in your stack

Jul 29, 2025 By Datadog In Datadog

With Datadog's AI Agents Console, you can monitor the behavior and interactions of any AI agent that’s a part of your enterprise stack, whether that’s a computer use agent like OpenAI’s Operator, IDE agent like Cursor, DevOps agent like Github Copilot, enterprise business agent like Agentforce, or your internally built agents. You'll have full visibility into every agent's actions, insights into the security and performance of your agents, analytics on user engagement, and measurable business value from every agent, all in a centralized location.

View Video

Datadog

Read more about AI Agents Console: Monitor the behavior and interactions of any AI agent in your stack

New in APM

Jul 29, 2025 By Datadog In Datadog

Datadog’s Latency Investigator for APM—now in Preview—automatically investigates hypotheses in the background, comparing historical traces and correlating change tracking, DBM, and profiling signals. This helps teams quickly isolate root causes and understand impact without combing through raw telemetry data. You can go from detection to resolution in a single workflow, and generate a pull request to apply a recommended fix, all without leaving Datadog..

View Video

Datadog

Read more about New in APM

Data Observability: Build confidence in the data life cycle

Jul 29, 2025 By Datadog In Datadog

Datadog Data Observability provides a complete solution with quality checks (e.g., volume, row changes, freshness), custom SQL-based monitors, anomaly detection, column-level lineage across systems like Snowflake and Tableau, full pipeline visibility, and targeted alerts when data issues arise.

View Video

Datadog

Read more about Data Observability: Build confidence in the data life cycle

Observing Vercel AI SDK with OpenTelemetry + SigNoz

Jul 28, 2025 By Goutham Karthi In SigNoz

LLM-powered apps are growing fast, and frameworks like the Vercel AI SDK make it easy to build them. But with AI comes complexity. Latency issues, unpredictable outputs, and opaque failures can impact user experience. That’s why monitoring is essential. By using OpenTelemetry for standard instrumentation and SigNoz for observability, you can track performance, detect errors, and gain insights into your AI app’s behavior with minimal setup.

Read Post

SigNoz

Read more about Observing Vercel AI SDK with OpenTelemetry + SigNoz

JSON Flattening: Fix Query Failures and Cut Storage Costs in SigNoz

Jul 25, 2025 By Anushka Karmakar In SigNoz

You have nested JSON in your logs. When you query body.event.type = "user_action", it fails. Here's why and how to fix it.

Read Post

SigNoz

Read more about JSON Flattening: Fix Query Failures and Cut Storage Costs in SigNoz

OpenTelemetry NestJS Implementation Guide: Complete Setup for Production [2025]

Jul 24, 2025 By Ankit Anand In SigNoz

NestJS applications require comprehensive monitoring to ensure optimal performance and rapid issue resolution. As your application grows—spanning multiple services, databases, and external APIs—understanding what's happening under the hood becomes critical. That's where OpenTelemetry comes in. OpenTelemetry provides vendor-agnostic observability for your NestJS applications through distributed tracing, metrics, and logs.

Read Post

SigNoz

Read more about OpenTelemetry NestJS Implementation Guide: Complete Setup for Production [2025]

Advanced Proactive SSL Certificate Monitoring

Jul 24, 2025 By Ramesh Subramaniam In eG Innovations

eG Enterprise version 7.5 introduces advanced capabilities for detailed SSL Certificate Monitoring including monitoring for web servers and apps using SSL. Monitoring SSL certificates is essential to ensure secure connections, prevent service outages, and maintain user trust. Here are a few things you need to monitor and questions you should ask to keep your services and apps running reliably and securely.

Read Post

eG Innovations

Read more about Advanced Proactive SSL Certificate Monitoring

Taming Your Dynatrace Bill: How to Cut Observability Costs, Not Visibility

Jul 24, 2025 By Mezmo In Mezmo

Dynatrace is a powerhouse for application performance monitoring and business analytics. But for many organizations, its power comes with a significant challenge: as applications scale across complex hybrid environments and diverse tech stacks, the sheer volume and variety of logs, metrics, and traces sent to the platform can explode, leading to staggering and unpredictable costs.

Read Post

Mezmo

Read more about Taming Your Dynatrace Bill: How to Cut Observability Costs, Not Visibility

Monitor apps using Vercel AI SDK with SigNoz and OpenTelemetry

Jul 24, 2025 By SigNoz - Open Source Observability Platform In SigNoz

Monitor apps using Vercel AI SDK with SigNoz and OpenTelemetry. This video talks about how to configure your AI apps to send data to SigNoz using OpenTelemetry.

View Video

SigNoz

Read more about Monitor apps using Vercel AI SDK with SigNoz and OpenTelemetry

Datadog Log Management: Analyze complex data sets

Jul 24, 2025 By Datadog In Datadog

Datadog Sheets provides a spreadsheet-style interface for analyzing your telemetry data — you can perform lookups, build pivot tables, and create calculated columns using familiar spreadsheet functionality. This enables teams to join datasets, aggregate results, and explore trends without writing code.

View Video

Datadog

Read more about Datadog Log Management: Analyze complex data sets

Debug live production issues with the Datadog Cursor extension

Jul 24, 2025 By Datadog In Datadog

The Datadog Cursor Extension uses the Datadog remote MCP Server to give developers access to Datadog tools and observability data directly from within the Cursor IDE. The Cursor Extension enables you to view live variable values that your logpoints capture during execution, and you can use the Cursor Agent to identify the lines of code responsible for the issue at hand. The Datadog Cursor Extension is now available in Preview.

View Video

Datadog

Read more about Debug live production issues with the Datadog Cursor extension

Datadog IDP: Ship software quickly and confidently

Jul 24, 2025 By Datadog In Datadog

Datadog Internal Developer Portal (IDP) helps developers quickly track down shared engineering knowledge, execute common production tasks in self-service manner, and evaluate the production-readiness of new service code.

View Video

Datadog

Read more about Datadog IDP: Ship software quickly and confidently

Why Your Business Needs APM: 10 Key Benefits You Shouldn't Ignore

Jul 23, 2025 By Pavithra Parthiban In Atatus

In today’s digital world, how well your applications perform has a big impact on how people see your business, and how well it runs. Whether you are in finance, e-commerce, SaaS, healthcare, or media, your users expect everything to work smoothly, all the time. Even a few seconds of slow performance can lead to lost sales, lower productivity, and unhappy customers. That’s why Application Performance Monitoring (APM) is so important.

Read Post

Atatus

Read more about Why Your Business Needs APM: 10 Key Benefits You Shouldn't Ignore

Bits AI Dev Agent: Automatically identify issues and generate code fixes

Jul 23, 2025 By Datadog In Datadog

The Bits Dev Agent is an AI-powered coding assistant in Datadog designed to reclaim developer productivity by autonomously monitoring telemetry data, identifying key issues, and generating production-ready pull requests. Developers receive asynchronous, context-rich PRs with clear explanations, allowing them to shift their focus from troubleshooting to reviewing solutions and building better code.

View Video

Datadog

Read more about Bits AI Dev Agent: Automatically identify issues and generate code fixes

Introducing Bits AI SRE, your AI on-call teammate

Jul 23, 2025 By Datadog In Datadog

Bits AI SRE is your AI on-call teammate, built to autonomously investigate alerts and coordinate incident response. Integrated with Datadog, Slack, GitHub, Confluence, and more, Bits analyzes telemetry, reads documentation, and reviews recent deployments to determine the root cause of alerts—often before you’ve even opened your laptop. In fact, if you're using Datadog On-Call, you can view Bits’s findings right from your phone—so you’re always one step ahead, no matter where you are.

View Video

Datadog

Read more about Introducing Bits AI SRE, your AI on-call teammate

Datadog Incident Response: Unify remediation and communication

Jul 23, 2025 By Datadog In Datadog

With Datadog's new AI voice agent in Incident Response, you can quickly get up to speed on the issue and start taking action directly from your phone. Handoff notifications make it easy to jump straight to the relevant context and quickly communicate with other responders. Finally, our status pages enable you to automatically update users on your remediation progress.

View Video

Datadog

Read more about Datadog Incident Response: Unify remediation and communication

What is Python Application Performance Monitoring? - [A Complete Guide]

Jul 22, 2025 By Mohana Ayeswariya J In Atatus

A recent study looked at real-world Python programs and found something important: Python isn’t the main reason apps slow down. The real problems come from inside the code like poor logic, memory issues, and slow database queries. The problem is, these issues often go unnoticed. Your app may seem fine until users start complaining about slowness or things start breaking under pressure.

Read Post

Atatus

Read more about What is Python Application Performance Monitoring? - [A Complete Guide]

From Sequential Bottlenecks to Concurrent Performance: Optimizing Log Processing at Scale

Jul 21, 2025 By Anushka Karmakar In SigNoz

We optimized log processing pipeline by moving from sequential to concurrent processing at the entry level, achieving 30% higher throughput and better resource utilization without increasing infrastructure costs. When customers start sending millions of logs per minute, you quickly discover whether your processing pipeline can actually scale with vertical scaling.

Read Post

SigNoz

Read more about From Sequential Bottlenecks to Concurrent Performance: Optimizing Log Processing at Scale

The Hidden Cost of Not Using APM in Production

Jul 21, 2025 By Pavithra Parthiban In Atatus

Many organizations don’t realize how important it is to monitor how their applications run in production. Without Application Performance Monitoring (APM), it becomes difficult to detect and resolve issues quickly, leading to increased downtime, wasted developer effort, and poor user experience. These hidden costs, though not always visible at first, can impact customer satisfaction, reduce team efficiency, and result in lost revenue.

Read Post

Atatus

Read more about The Hidden Cost of Not Using APM in Production

Golang Application Performance Monitoring: A Comprehensive Guide

Jul 18, 2025 By Pavithra Parthiban In Atatus

Application Performance Monitoring (APM) refers to the practice of tracking, analyzing, and optimizing the performance and availability of software applications. When it comes to Go (Golang), a language known for its concurrency, speed, and efficiency, APM becomes crucial to ensure that your applications stay fast, reliable, and scalable under real-world loads. APM in Go involves monitoring the runtime behavior, request response times, system resource usage, and error patterns across your application.

Read Post

Atatus

Read more about Golang Application Performance Monitoring: A Comprehensive Guide

I built an MCP Server for Observability. This is my Unhyped Take

Jul 18, 2025 By Elizabeth Mathew In SigNoz

Recently, I read a blog titled “It’s The End Of Observability As We Know It (And I Feel Fine)”, which discussed MCP servers in observability and how these systems would potentially be the “end of observability”. As someone who has spun up an MCP server for an observability backend and as someone who has been in the space for a while, I certainly do not think so.

Read Post

SigNoz

Read more about I built an MCP Server for Observability. This is my Unhyped Take

Cloud or Self-Hosted - Which Deployment Model is Right For You?

Jul 18, 2025 By Anushka Karmakar In SigNoz

Choosing the right observability platform is a critical decision. But how you deploy it is just as important. The right deployment strategy can accelerate your team, simplify operations, and ensure you meet compliance and security requirements. The wrong one can lead to operational headaches and slow you down. At SigNoz, we believe in flexibility. There is no single "best" way to deploy an observability platform; there's only the way that's best for you.

Read Post

SigNoz

Read more about Cloud or Self-Hosted - Which Deployment Model is Right For You?

How APM Can Improve Your Digital Customer Experience?

Jul 17, 2025 By Mohana Ayeswariya J In Atatus

When a customer taps a button, submits a form or waits for a page to load, they’re not thinking about your backend architecture, microservices, or CDN; they want it to work instantly. But when it doesn’t, the frustration is immediate. Maybe the app freezes. Maybe a checkout fails. Maybe the entire experience just feels laggy. And the worst part? They don't complain, they just leave the application.

Read Post

Atatus

Read more about How APM Can Improve Your Digital Customer Experience?

Getting started with Dynatrace dashboards

Jul 17, 2025 By Sameer Mhaisekar In Squared Up

Dynatrace gives you incredibly deep observability data. But all that depth can bury the insights needed. In this blog, we show how to turn Dynatrace's complex telemetry into visual dashboards that actually make sense. Dynatrace is a leading observability and application performance monitoring (APM) platform, known for its deep insight into complex, modern cloud environments. With capabilities spanning infrastructure monitoring, real user monitoring, and security, Dynatrace offers powerful telemetry.

Read Post

Squared Up

Read more about Getting started with Dynatrace dashboards

Kubernetes Observability with OpenTelemetry | A Complete Setup Guide

Jul 17, 2025 By Elizabeth Mathew In SigNoz

Kubernetes provides a wealth of telemetry data from container metrics and application traces to cluster events and logs. OpenTelemetry offers a vendor-neutral, end-to-end solution for collecting and exporting this telemetry in a standardised format.

Read Post

SigNoz

Read more about Kubernetes Observability with OpenTelemetry | A Complete Setup Guide

Challenges in AIOps and how to sail through them

Jul 16, 2025 By Swaminathan J In eG Innovations

AIOps (Artificial Intelligence for IT Operations) is not only a game changer, but the need of the hour as modern IT grows and becomes increasingly complex. The promises of AIOps are both overwhelming and tantalizing. AI-powered monitoring and observability can help predict issues, automatically resolve incidents, and optimize performance across the IT infrastructure. However, onboarding an AIOps monitoring tool can be more complicated than it sounds on paper.

Read Post

eG Innovations

Read more about Challenges in AIOps and how to sail through them

Atatus APM: Full-Stack Visibility for Modern Engineering Teams 2025

Jul 15, 2025 By Pavithra Parthiban In Atatus

APM stands for Application Performance Monitoring or Application Performance Management. It helps engineering teams track key metrics, detect slowdowns, and improve the overall performance of their applications. With Atatus APM, you get complete visibility into your application, from backend code and databases to external services and frontend performance.

Read Post

Atatus

Read more about Atatus APM: Full-Stack Visibility for Modern Engineering Teams 2025

Datadog vs Jaeger - Features, Pricing & Use Cases [Updated for 2025]

Jul 10, 2025 By Ankit Anand In SigNoz

Datadog and Jaeger are both leading tools in the observability space, but they represent two fundamentally different philosophies. Datadog is a commercial, all-in-one SaaS platform that unifies metrics, traces, and logs. Jaeger is a popular, open-source project focused specifically on distributed tracing. Choosing between them isn't just a technical decision; it's about balancing the convenience of a fully managed, integrated platform against the power and control of a self-hosted, specialized tool.

Read Post

SigNoz

Read more about Datadog vs Jaeger - Features, Pricing & Use Cases [Updated for 2025]

Why APM Is Essential for Microservices Architecture?

Jul 10, 2025 By Mohana Ayeswariya J In Atatus

According to Statista, over 85% of large enterprises and nearly 50% of small to midsize businesses will have adopted microservices as part of their software architecture. The shift is clear: organizations of all sizes are moving away from monolithic applications toward microservices to accelerate development cycles, improve scalability, and support continuous delivery. But this architectural freedom comes with a hidden cost, which increases operational complexity.

Read Post

Atatus

Read more about Why APM Is Essential for Microservices Architecture?

Beyond Metrics: How We Reimagined Incident Response with RUM

Jul 10, 2025 By Datadog In Datadog

When your monitoring tools and logs tell you everything's fine, but users can't access critical healthcare services, where do you look? Our team discovered that Real User Monitoring (RUM) isn't just for tracking page load times and user journeys – it's a powerful incident response tool that can uncover issues traditional monitoring misses entirely.

View Video

Datadog

Read more about Beyond Metrics: How We Reimagined Incident Response with RUM

How We Made Our Queries 99.5% Faster

Jul 10, 2025 By Anushka Karmakar In SigNoz

We cut log-query scanning from ~100% of data blocks to < 1% by reorganizing how logs are stored in ClickHouse. Instead of relying on bloom-filter skip indexes, they generate a deterministic “resource fingerprint” (hash of cluster + namespace + pod, etc.) for every log source and sort the table by this fingerprint in the primary-key ORDER BY clause. This packs logs from the same pod/service contiguously, letting ClickHouse’s sparse primary-key index skip irrelevant blocks.

Read Post

SigNoz

Read more about How We Made Our Queries 99.5% Faster

Here's how to add business data to logs from retail endpoints | Datadog Tips & Tricks

Jul 10, 2025 By Datadog In Datadog

Some sources simply do not generate data-rich logs. Retail endpoints that are older or run on proprietary services, for example, very often produce logs without the kinds of data that are needed to perform useful business analytics. So, what can you do?

View Video

Datadog

Read more about Here's how to add business data to logs from retail endpoints | Datadog Tips & Tricks

OpenTelemetry Collector: A Complete Guide [2025]

Jul 9, 2025 By Ankit Anand In SigNoz

The OpenTelemetry Collector is a stand-alone service that acts as a powerful, vendor-neutral pipeline for your telemetry data. It can receive, process, and export logs, metrics, and traces, giving you full control over your observability data before it reaches a backend. This guide will provide a comprehensive overview of the OpenTelemetry Collector, its architecture, deployment patterns, and how to configure it for production use.

Read Post

SigNoz

Read more about OpenTelemetry Collector: A Complete Guide [2025]

Monitoring AWS billing costs with AWS tags

Jul 9, 2025 By Babu Sundaram In eG Innovations

Today, I’ll be covering how AWS tags can help you keep track of and monitor your AWS billing costs with the granularity and depth needed to reduce and optimize your AWS costs.

Read Post

eG Innovations

Read more about Monitoring AWS billing costs with AWS tags

Comparing The Top 9 Datadog Alternatives and Competitors in 2025

Jul 7, 2025 By Ankit Anand In SigNoz

The rising costs and complexities of monitoring cloud infrastructure are pushing many organizations to explore alternatives to Datadog. With monthly bills sometimes reaching thousands of dollars and feature sets that can be overwhelming, teams are looking for practical, cost-effective solutions that better fit their needs.

Read Post

SigNoz

Read more about Comparing The Top 9 Datadog Alternatives and Competitors in 2025

Application Performance Monitoring (APM) Use Cases Every DevOps Team Should Know

Jul 4, 2025 By Pavithra Parthiban In Atatus

Modern applications are built using distributed architectures, microservices, and cloud-native technologies. As these systems grow in complexity, it becomes harder for DevOps teams to maintain performance, track issues, and ensure a consistent user experience across all environments. Application Performance Monitoring (APM) helps solve these challenges by providing real-time visibility into how applications behave, from user interactions to backend services and infrastructure.

Read Post

Atatus

Read more about Application Performance Monitoring (APM) Use Cases Every DevOps Team Should Know

APM best practices: Dos and don'ts guide for practitioners

Jul 3, 2025 By Elastic Observability Team In Elastic

Application performance management (APM) is the practice of regularly tracking, measuring, and analyzing the performance and availability of software applications. APM helps you get visibility into complex microservices environments, which can overwhelm site reliability engineering (SRE) teams. The generated insights create an optimal user experience and achieve desired business outcomes.

Read Post

Elastic

Read more about APM best practices: Dos and don'ts guide for practitioners

Choosing the Right APM Software: 5 Key Factors to Consider

Jul 3, 2025 By Mohana Ayeswariya J In Atatus

When applications slow down, users leave, and engineering teams scramble. Whether you're troubleshooting a spike in response times or chasing down intermittent backend failures, Application Performance Monitoring (APM) provides the visibility you need to detect, diagnose, and resolve performance issues before they impact your users or business goals. For engineers, APM isn’t just a convenience - it’s essential. But not all APM tools are created equal.

Read Post

Atatus

Read more about Choosing the Right APM Software: 5 Key Factors to Consider

Introducing Raygun CLI: Level-up your error tracking workflow

Jul 2, 2025 By Kai Koenig In Raygun

Raygun CLI is a powerful command-line interface tool designed to enhance the developer experience when working with Raygun's error tracking and performance monitoring platform. With this tool, we bring Raygun's features directly to your terminal, making it easier to integrate some important elements of Raygun Crash Reporting and error tracking into your development and CI/CD workflow. We are excited to announce the release of version 1.0.0 of Raygun CLI.

Read Post

Raygun

Read more about Introducing Raygun CLI: Level-up your error tracking workflow

The Complete Guide to APM Best Practices for Developers, DevOps & SREs

Jul 2, 2025 By Pavithra Parthiban In Atatus

Application Performance Monitoring (APM) is no longer optional, it is essential for delivering fast, reliable, and seamless digital experiences. But simply installing an APM tool isn’t enough. To truly know its potential, IT teams need to follow APM best practices. Best practices for APM refer to the most effective ways to monitor, analyze, and optimize your application’s performance using APM tools.

Read Post

Atatus

Read more about The Complete Guide to APM Best Practices for Developers, DevOps & SREs

MCP Observability with OpenTelemetry

Jul 2, 2025 By Elizabeth Mathew In SigNoz

2025 has truly been the year of Agentic AI, with MCP (Model Context Protocol) emerging as one of its flashy and most talked-about innovations. While many products have seamlessly integrated MCP servers into their systems, these servers are increasingly being labelled as black boxes, opaque components that handle critical tasks but offer little visibility into what's happening under the hood. We prompt an agent, a tool gets invoked, and a response is generated. But what really happens in between?

Read Post

SigNoz

Read more about MCP Observability with OpenTelemetry

Instrument NextJS with OpenTelemetry in 100 seconds

Jul 1, 2025 By SigNoz - Open Source Observability Platform In SigNoz

What if setting up observability in your Next.js app was as easy as running a few commands? In this quick guide, we show you how to instrument your Next.js application using OpenTelemetry and visualize with SigNoz — without all the headaches.

View Video

SigNoz

Read more about Instrument NextJS with OpenTelemetry in 100 seconds

Perform Distributed Tracing for your MCP system with OpenTelemetry

Jul 1, 2025 By SigNoz - Open Source Observability Platform In SigNoz

2025 has truly been the year of Agentic AI, with MCP (Model Context Protocol) emerging as one of its flashy and most talked-about innovations. While many products have seamlessly integrated MCP servers into their systems, these servers are increasingly being labelled as black boxes, opaque components that handle critical tasks but offer little visibility into what’s happening under the hood. We prompt an agent, a tool gets invoked, and a response is generated. But what really happens in between? And when something breaks, how do we trace the failure and debug it effectively?

View Video

SigNoz

Read more about Perform Distributed Tracing for your MCP system with OpenTelemetry

Operations | Monitoring | ITSM | DevOps | Cloud