Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Why prompt engineering can be so frustrating

In this clip of The Debrief, Charlie explains why prompt engineering can be so challenging. Imagine an AI assistant that could automatically surface a whole host of useful incident response data points with just a prompt. Well, you won't need to imagine for much longer. That's exactly what we built in Assistant, one of our newest features powered by AI. In this episode, you'll hear from Charlie, the project lead for Assistant, to get a peek behind this game-changing product.

How to set up on-call compensation

Once you set up an on-call team, the next step is to decide their compensation. There might be several questions in your mind right now: "How do we fairly value on-call time?" "Is it a flat rate or hourly?" and a few others. So we are here to help you set up an on-call compensation system because we know compensating people fairly lays the foundation of a healthy business. Are you still stuck on setting up an on-call team? Read this guide first: 7 steps to set up an on-call team.

The Debrief: How we built a "game changing" AI assistant feature

Imagine an AI assistant that could automatically surface a whole host of useful incident response data points with just a prompt. Well, you won't need to imagine for much longer. That's exactly what we built in Assistant, one of our newest features powered by AI. In this episode, you'll hear from Charlie, the project lead for Assistant, to get a peek behind this game-changing product.

Conquer The Storm: Hit with Downtime? Find Solutions with StatusCast!

Ready to tackle downtime head-on? Join us in this informative video, "Conquer The Storm with StatusCast," where we explore strategies to navigate and overcome unexpected IT downtime challenges. In the fast-paced world of technology, downtime is inevitable. Whether you're a seasoned IT professional, business owner, or just curious about safeguarding your digital operations, this video is a must-watch!

Centralize, triage, and track tickets with Datadog Case Management

Complex systems require many different monitors to assess the health of their infrastructure and applications, creating a wealth of alerts that can be hard to track. Due to a lack of effective triage processes, many organizations page engineers for every alert that comes in, making it difficult to separate false positives from issues that actually require immediate attention.

Why Love A Status Page: IT Transparency & Trust

In our interconnected world of technology, where we work tirelessly even on this Valentine’s Day, the reliance of our businesses on digital platforms and services has never been greater. Amidst this, the efficiency and efficacy of large organizations depend on openness and transparency from their IT systems and the professionals managing them. One of the unsung heroes in this realm is the often-overlooked status page.