Operations | Monitoring | ITSM | DevOps | Cloud

Working as a remote engineer at Cribl | Building the AI Platform for Telemetry

Learn what it’s like to work as an engineer at Cribl, a remote-first company building the AI platform for IT and security data. In this recruiting video, Cribl’s engineering and support leaders share how fully distributed teams collaborate, solve hard data problems, and grow their careers while working from around the world. You’ll hear from managers and leaders in site reliability engineering, security incubation, and technical support about.

What Is Database Software? Types, Examples, and dbForge Edge Explained

Database software helps organize, manage, retrieve, and analyze data in databases. But what does that actually mean in practice? In this video, we explain what database software is using a simple library analogy, show how it helps add, edit, delete, and report on data, and break down the main types of database tools used by developers, DBAs, analysts, and technical teams. You will also see examples of well-known database software, including SSMS, MySQL Workbench, pgAdmin, Oracle SQL Developer, JetBrains DataGrip, DBeaver, and dbForge Edge.

KWhy? MSP Webinar

Most MSPs are sitting on a goldmine of data across their tools. The problem isn’t access, it’s knowing what *actually* matters… and how to use it to drive better outcomes. Join Amanda Doucette-Lachapelle and Kyle Christensen (Empath) as they walk through how to use KPIs to make smarter, more confident decisions, with real examples you can apply right away.

What's New in the Updated OnPage Enterprise Management Console

Take a quick walkthrough of what’s new in the updated OnPage Enterprise Management Console. In this video, we highlight the latest updates designed to give admins more visibility, flexibility and self-service control across critical communication workflows. You’ll see what’s new across the console, including: The updated Enterprise Management Console helps teams manage on-call schedules, critical alerts, escalation workflows and Dedicated Lines more efficiently from one centralized place.

Creating Schedule Overrides in OnPage

Learn how override schedules work in OnPage and how admins can quickly manage temporary on-call coverage changes without rebuilding the entire schedule. With OnPage overrides, teams can adjust coverage for vacations, sick days, shift swaps, after-hours changes or last-minute availability issues. During the override window, alerts are automatically routed to the covering responder. Once the override ends, the schedule returns to the regular on-call rotation.

The Data Plane Reality: OTel Scales, While Topology UX Lags

OpenTelemetry won the architectural standards battle. At scale, though, telemetry breaks more like plumbing than code. It breaks quietly, across a graph, with a blast radius you don’t understand until it’s expensive. With over 65% of organizations now running more than 10 collectors in production, hybrid deployments across Kubernetes and VMs are accelerating fast. Telemetry standardization is no longer a project milestone. It is a baseline expectation.

Service Level Agreement (SLA) Templates: Examples, Metrics, and Best Practices

How quickly should your team resolve a critical ticket, and what are the consequences when it misses the target? That is exactly where Service Level Agreements (SLAs) come into play. An SLA turns service expectations into measurable commitments by defining clear response and resolution targets. Rather than starting from scratch, an SLA template provides a structured foundation for establishing those commitments and tracking performance against agreed standards. Why does that matter?

Agent Timeline Is Now Generally Available

A few weeks ago I wrote about a customer’s refund request that stopped halfway through at 11:47 p.m. on a Tuesday night. That post walked through the 40 minutes it took to work out what happened when an agentic application had a problem: a tool retried against a rate-limited payments API, the error responses filled up the context window, and the agent gave up. The whole reason we built Agent Timeline was to turn that 40 minutes into five. To reduce MTTR. To solve the problem and get back to sleep.