Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Error 502 Bad Gateway in Nginx: What It Is and How to Fix It

A 502 Bad Gateway error implies that the server (Nginx) can’t properly communicate with the upstream web application server. A sign of more severe problems, such as server overload, improper configuration, or network failure, a 502 Bad Gateway error can cause service interruption, which can translate to revenue loss. Fortunately, you can easily resolve the error in Nginx once you identify the causes.

Troubleshooting Kafka Clusters: Common Problems and Solutions

Apache Kafka’s thing is real-time data streaming. But keeping it running at full throttle? That takes more than just spinning up a cluster and hoping for the best. As your environment grows, you’ll need to do some tweaking to make sure Kafka keeps up with the pace. The good news? You don’t need to be a Kafka wizard to make a real difference. Even some basic tuning can have a big impact on performance.

Unlock the Real Value of Logs With Honeycomb Telemetry Pipeline and Honeycomb for Log Analytics

At Honeycomb, we know how important it is for organizations to have a unified observability platform. This is why we’re launching Honeycomb Telemetry Pipeline and Honeycomb for Log Analytics: to enable engineering teams to send and analyze data—including logs—into a single, unified platform. For too long, teams have had to wrangle large volumes of logs, their context scattered across multiple teams and tools, leading to knowledge silos.

Introducing UptimeRobot's Core Monitoring Infrastructure Upgrade: What's Changing And What it Means For You

At UptimeRobot, we’re always evolving to serve you better—while understanding that change can sometimes be inconvenient. We’re excited to announce a major infrastructure upgrade designed to boost performance, scalability, and reliability. This upgrade will help us deliver faster, more reliable service as we grow, and we hope you’ll see the benefits soon.

Cisco uses Elastic to save 5,000 support engineer hours a month

With the precision of search and the intelligence of AI, Cisco uses Elastic on Google Cloud to create richer search experiences, so support engineers can quickly find the answers they need. Scaling from this success, Cisco's Search team added AI models, semantic search, and vector search to more than 50 internal- and external-facing apps, helping them innovate more quickly and increase overall operational efficiency.

How can you simplify web performance monitoring with auto RUM injection

Real user monitoring (RUM) is a powerful tool for optimizing the end-user experiences of web applications. With insights into performance, load times, user behavior, and more, RUM enables businesses to identify and address issues that negatively impact user satisfaction. Consider a scenario where a growing e-commerce company experiences periodic slowdowns during peak hours, adversely affecting user experiences and sales.

Comprehensive Observability: Key Performance Metrics to Monitor in Cloud Environments

Enterprises need strong observability to ensure system reliability, proactively detect and resolve issues, optimize performance, enhance security, and maintain seamless business operations across complex distributed environments.