Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How to Stream AWS Logs to Grafana Cloud via Amazon Data Firehose | Grafana

In this video, we show you the steps to configure your Grafana account so you can start streaming AWS logs to Grafana Cloud using Amazon Data Firehose. It takes just a few minutes to set up so you can see your logs in Grafana Explore. Save money and time by using this new approach!

Empowering Excellence: Celebrating Five Years of Trust and Innovation

At ScienceLogic, we’re thrilled to mark a significant milestone: five consecutive years of earning TrustRadius’s Top Rated award. Since 2016, the TrustRadius Top Rated Awards have been the B2B industry’s standard for unbiased recognition of excellent technology products. Based entirely on customer feedback, results have never been influenced by analyst opinion or status as a TrustRadius customer.

Manage incidents seamlessly with the Datadog Slack integration

Modern, distributed application architectures pose particular challenges when it comes to coordinating incident management. DevOps, SREs, and security teams—often spread out across separate locations and time zones, and equipped with limited knowledge of each other’s services—must work quickly to collaboratively triage, troubleshoot, and mitigate customer impact.

What's New With Mezmo: In-stream Alerting

Here at Mezmo, we see the purpose of a telemetry pipeline is to help ingest, profile, transform, and route data to control costs and drive actionability. There are many ways to do that as we’ve previously discussed in our blogs, but today I’m going to talk about real-time alerting on data in motion, yes - on streaming data, before it reaches its destination.

Introducing Elastic's OpenTelemetry Distribution for Node.js

We are delighted to announce the alpha release of the Elastic OpenTelemetry Distribution for Node.js. This distribution is a light wrapper around the OpenTelemetry Node.js SDK that makes it easier to get started using OpenTelemetry to observe your Node.js applications.

Does Your Observability Practice Lack Maturity? Here's What to Do.

Observability isn’t new. But organizations are struggling to adopt mature observability practices, and the impact on business is palpable. Organizations are seeing the value of observability for their applications and infrastructure—the results of our 2024 Observability Pulse survey of 500 global IT professionals reflects that across the board.

"Secret" elmah.io features #4 - Get help from AI and ChatGPT

In this fourth post in the series of "secret" elmah.io features, I want to introduce you to one of several AI features available on elmah.io. We have had machine learning features like automatic bot detection and spike identification for years. But a recent addition to the portfolio of AI features is the integration with ChatGPT to get help solving issues. In this post, I'll show you how to set it up and how it works.

Scaling in the Cloud with Cribl's Universal Receiver

Scaling cloud services is a critical task for Site Reliability Engineers, and it’s a challenging one. As organizations grow, the amount of data and the number of users of it grow like crazy, pushing traditional data management methods to their limits. SREs not only have to keep everything running, they’ve got to make sure it runs smoothly, efficiently, and swiftly.