Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

OpenTelemetry Metrics: Concepts, Types & Instruments

OpenTelemetry (OTel) Metrics are part of the OpenTelemetry project, which provides tools, APIs, and SDKs for telemetry data collection. These metrics capture system performance data like request latency, error rates, resource usage, and throughput. OTel aims to standardize observability across languages and platforms, making it easier to use and integrate telemetry data. Metrics are one of three core signals of OpenTelemetry along with logs and traces.

How to achieve Observability for Microservices-based apps using Distributed Tracing?

Modern digital organizations have rapidly adopted microservices-based architecture for their applications. Microservices-based apps have components designed around business capabilities serving a specific purpose. It enables smaller engineering teams to own specific services that lead to increased productivity. But componentization also leads to complexity. Today’s modern internet-scale businesses have hundreds or thousands of microservices.

CloudWatch Pricing: A Straightforward 2024 Guide

To ensure your company’s cloud-based resources remain continuously available, you need a way to monitor all your applications and quickly detect when something goes wrong — especially if you are running multiple instances and using various products. Amazon’s inbuilt tool, CloudWatch, allows you to do just this. In this article, we’ll cover what AWS CloudWatch is, how it works, and how much it costs.

Migrating AIX to Linux

Today, everyone lives in a hybrid, multi-cloud world. The combination of continuously changing business drivers and complex, heterogeneous tech stacks means that virtually every organization has production workloads on-prem, in co-lo facilities, in private clouds, and in multiple public clouds. Moreover, your stack is likely often in motion, which requires you to manage workload migrations from one environment to another (and sometimes back again) as your needs change.
Sponsored Post

Troubleshoot WiFi and Wireless Networking Issues Everywhere

In today's varied workspace dynamics, wireless networking issues can greatly impact user experience and productivity. Whether it's slow download speeds, poor wireless coverage, connectivity, or collaboration problems during virtual meetings, wireless troubleshooting is crucial to ensuring remote and office productivity.
Sponsored Post

AI engineering for AI Error Resolution

Smart engineering teams are working out how to use Large Language Models (LLMs) to solve real business problems. At Raygun, we're no exception, and we're committing our time and effort to developing AI software applications that bring value to our customers. Our first AI-powered release is AI Error Resolution (AIER), a novel Crash Reporting feature that takes debugging with ChatGPT to the next level. We know that LLMs have already dramatically increased software engineers' productivity.

The Benefits of a Single Incident Management System

How many monitoring tools do you have? Chances are at least 2-3. One tool usually does not cover all cases, and it’s usually a combination of self-managed and managed tools. Self-managed gives you more control over custom configurations and cost. Managed ones take away the headache of running it yourself. Prometheus is the de-facto standard for monitoring these days if you have a modern application stack and you want to manage your own monitoring.

6 Common Spanning Tree Mistakes and How to Avoid Them

Let me start by saying that spanning tree is a Good Thing. It saves you from loops, which will completely shut down a network. But it has to be configured properly to work properly. I can’t count the number of times I’ve had a client call me, desperate with a terribly broken network, and I’ve responded, “Sounds like a spanning tree problem.” There are many ways things can go wrong with spanning tree. In this article, I’ve collected six of the recurring themes.

Network Basics: Spanning Tree Protocol

For new networkers, Spanning Tree Protocol (STP) can be an intimidating topic. Many old-timers speak of spanning-tree in ominous tones, recounting the time when a “spanning-tree loop” brought down the network. Some managers strictly forbid anyone from changing anything related to the spanning tree, fearing a resulting service interruption. Some of the fear surrounding spanning trees is likely based on bad experiences, but some are based on ignorance—at least partly.