Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Grafana OnCall mobile app notifications: The new and improved experience for Android users

The Grafana OnCall mobile app is an essential tool for on-call engineers to monitor and respond to critical system events. Available for both iOS and Android, the app offers a range of features and notification settings that make the on-call experience easier and more intuitive — all in the palm of your hand.

Why Organizations are Using Grafana + Loki to Replace Datadog for Log Analytics

Datadog is a Software-as-a-Service (SaaS) cloud monitoring solution that enables multiple observability use cases by making it easy for customers to collect, monitor, and analyze telemetry data (logs, metrics and traces), user behavior data, and metadata from hundreds of sources in a single unified platform.

Optimizing IT Operations: Health System Drives Efficiency and Cost Savings

In the critical world of healthcare services, efficiency and accuracy are indispensable. For a leading American health services company catering to the healthcare needs of over 100 million individuals and managing a workforce of more than 100,000 associates, a strategic approach to optimizing IT operations is necessary.

Troubleshoot WiFi and Wireless Networking Issues Everywhere

In today’s varied workspace dynamics, wireless networking issues can greatly impact user experience and productivity. Whether it’s slow download speeds, poor wireless coverage, connectivity, or collaboration problems during virtual meetings, wireless troubleshooting is crucial to ensuring remote and office productivity.

Top 10 Change Management Tools

Changes to software are inevitable and fundamental part of growth for any organization, however, change is often not straightforward. It can affect numerous aspects of a company and requires collaboration among all stakeholders. This is where change management tools come in to assist you with this. There’s currently a wide range of change management tools available, each providing benefits to specific scenarios and weaknesses to others.

A guide to scaling OpenTelemetry Collectors across multiple hosts via Ansible

OpenTelemetry has emerged as a key open source tool in the observability space. And as organizations use it to manage more of their telemetry data, they also need to understand how to make it work across their various environments. This guide is focused on scaling the OpenTelemetry Collector deployment across various Linux hosts to function as both gateways and agents within your observability architecture.

Control your log volumes with Datadog Observability Pipelines

Modern organizations face a challenge in handling the massive volumes of log data—often scaling to terabytes—that they generate across their environments every day. Teams rely on this data to help them identify, diagnose, and resolve issues more quickly, but how and where should they store logs to best suit this purpose? For many organizations, the immediate answer is to consolidate all logs remotely in higher-cost indexed storage to ready them for searching and analysis.

Aggregate, process, and route logs easily with Datadog Observability Pipelines

The volume of logs generated from modern environments can overwhelm teams, making it difficult to manage, process, and derive measurable value from them. As organizations seek to manage this influx of data with log management systems, SIEM providers, or storage solutions, they can inadvertently become locked into vendor ecosystems, face substantial network costs and processing fees, and run the risk of sensitive data leakage.