Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

A detailed guide on Azure architecture diagram

Azure architecture diagrams are visual representations that illustrate the structure, components, and relationships of a solution or application deployed on Microsoft Azure. These diagrams provide a clear and concise overview of the various Azure resources and services used in a specific architecture. They are helpful for design discussions, documentation, and communication among team members and stakeholders.

OpenTelemetry Gotchas: Phantom Spans

This guest post is written by Ian Duncan, Staff Engineer - Stability Team at Mercury. To view the original post, go to Ian's website. At work, we use OpenTelemetry extensively to trace execution of our Haskell codebase. We struggled for several months with a mysterious tracing issue in our production environment wherein unrelated web requests were being linked together in the same trace, but we could never see the root trace span.

Bringing speedups to top-k queries with many and/or high-frequency terms

Disjunctive queries (term_1 OR term_2 OR... OR term_n) are extremely commonly used, thus they are getting a lot of attention when it comes to improving query evaluation efficiency. Apache Lucene has two main optimizations for evaluating disjunctive queries: BS1 on the one hand for exhaustive evaluation, and MAXSCORE and WAND on the other hand to compute top hits.

Create browser tests directly from Datadog RUM Session Replay

Testing is a key part of application development and helps you maintain a reliable experience for your users. But the process can be difficult to scale and is often siloed to a single team or individual that does not have broad knowledge of your application’s UI. This can lead to organizations investing in sizable test suites that do not accurately represent real user behavior.

How to Improve Core Web Vitals

Gaps in website performance optimization have a devastating effect, and you will surely get strict penalties for making them happen. Websites failing to pass the Google Core Web Vitals assessment can expect their traffic, conversions, and business revenue to go south. And they can only make up the leeway with fast intervention and ingenious strategic planning.

How to provision a notification policy in Grafana Alerting - and keep it editable in the UI

Provisioning Grafana Alerting resources, such as notification policies, can help you deploy resources faster and streamline the alerting and notification process. Before getting started, it’s important to understand the different options for provisioning notification policies, how they work, and the challenges they can present. In Grafana Alerting, notification policies use alert labels to determine how alerts are routed to different contact points or receivers.

Mezmo Logging vs Coralogix Logging: Features, Pricing and Support

Mezmo, formerly known as LogDNA, offers log analytics without any native capabilities around metrics and tracing data. While Coralogix’s full-stack observability supports logs, metrics, tracing and security data, for the purpose of this comparison with Mezmo, we will focus primarily on logs.

Testing, Observing, and Debugging RabbitMQ

RabbitMQ is a popular open-source message broker that facilitates communication between different components of a distributed system. Monitoring a RabbitMQ instance is crucial to ensure its health, performance, and reliability. Monitoring allows you to identify and address potential issues before they escalate, ensuring smooth communication between various parts of your application.