Operations | Monitoring | ITSM | DevOps | Cloud

Simplifying Observability: Streamlining Telemetry with a Centralized Pipeline

Modern applications generate a deluge of telemetry data—logs, metrics, and traces—that hold the key to understanding system performance and reliability. However, managing this data effectively is a growing challenge for DevOps teams. Raw telemetry can overwhelm teams with complexity and noise even when collected via robust standards like OpenTelemetry.

Why didn't my Playwright test capture video?

If you use Checkly, eventually you'll be looking at alerts about something failing, and wonder how to debug a failed check. For most of us, the first thing we want to see is the video of a failed check run. Sometimes, though, our check doesn’t capture video. This guide will cover three common reasons a video doesn’t show up on a check run. This advice is general for Playwright as well as those running Playwright tests on Checkly.
Sponsored Post

Six Lessons from Production gRPC

In the half-decade since gRPC became part of our production ecosystem, we've encountered a range of challenges and discovered a few hidden pitfalls that can trip up even the most experienced teams. Below, we'll walk through some of the core lessons learned, with tips, best practices, and examples drawn straight from the trenches.

Create a status page for your production service in 5 minutes

“When are we going to tell users about this?” By the time your incident response team, it’s already too late. During an outage, communicating about downtime with your user base has three main drawbacks: Instead, it’s better to create a status page that automatically shares the status of all your services in a format that users can easily understand. You’ll build trust with your users as you proactively share service status, lessening the perceived impact of incidents.

Third party API Monitoring powered by OpenTelemetry semantics

In today’s cloud-native world, third-party APIs are everywhere. Payments, notifications, search, AI, analytics as modern applications are built on a web of external services. But what happens when one of those APIs slows down, starts throwing errors, or gets rate-limited? Suddenly, your users are facing issues, and you’re stuck asking.

Managing monthly reports with the API

On the first of every month we generate an extensive PDF report for every site. This report contains a summary of all check results for the month and is a snapshot available to you and your team via email and the Oh Dear dashboard. We keep the report history so each month can be viewed in a browser or downloaded as a PDF. This report can also be emailed to any email address - not just team members - perfect for keeping your customers informed.

Real-Time Monitoring Solutions for Modern Web Applications

Web applications have evolved from simple static sites into complex distributed systems spanning multiple servers, services, and geographical locations. This evolution has created new challenges for monitoring these applications effectively. Today's web stacks require comprehensive visibility across all layers to ensure optimal performance and reliability.

What Is an API Outage? Why It Happens and How to Avoid It

APIs are a big part of how modern applications or services work. They act as bridges, allowing systems to talk to each other and share data. Whether it's logging into an app or making an online payment, an application programming interface helps make that process smooth. But what happens when an API suddenly stops working? Even a short outage can cause a disruption. It can break features, delay operations, and impact users and businesses alike.

SQL analytics - unified querying across any API

SQL is just for querying relational data, right? Well, not necessarily! With our SQL Analytics feature, you can run SQL queries over all types of data from all kinds of backend stores. This gives you incredible flexibility and power – you can even combine different types of entity (e.g. a pull request and a pipeline run) in a single query. Equally, I could have datasets with job tickets from Jira, ServiceNow and Zendesk and combine them in a single query.

Third party API Monitoring Powered by OpenTelemetry Semantics

Is it the third-party API or my code? Your service suddenly slows down, or errors spike, and you’re stuck guessing if it’s your own logic or an external API you don’t control. We’ve seen this pain across teams: dashboards don’t tell you which vendor or endpoint is the culprit, and debugging turns into a maze of guesswork. Rate limiting, vendor errors, or integration issues often slip through until users complain.

Eliminating Flaky Tests with Traffic Replay

There are few things that can derail developer productivity and undermine your pipeline like a flaky test. Testing is the backbone of a good development process, ensuring that your code is as accurate and usable as possible. When these tests point towards faulty development, the impacts can be significant. This information is predicated on an assumption, however – the assumption that what the test says is accurate.

REST v. GraphQL v. gRPC #speedscale #developers #softwaredevelopment #shorts #softwaretesting #api

When it comes to building APIs and enabling communication between different software components, three prominent architectural styles and frameworks often come up: REST, GraphQL, and gRPC. Each has its own approach, strengths, and weaknesses, making them suitable for different use cases.