%term

DASH 2024 Keynote

Jun 26, 2024 By Datadog In Datadog

DASH is an annual conference about building and scaling the next generation of applications, infrastructure, security, and technical teams.

View Video

Datadog

Read more about DASH 2024 Keynote

Track the status of all your SLOs in Datadog

Jun 24, 2024 By Meghan Jordan In Datadog

Service level objectives, or SLOs, are a key part of the site reliability engineering toolkit. SLOs provide a framework for defining clear targets around application performance, which ultimately help teams provide a consistent customer experience, balance feature development with platform stability, and improve communication with internal and external users.

Read Post

Datadog

Read more about Track the status of all your SLOs in Datadog

Best practices for managing your SLOs with Datadog

Jun 24, 2024 By Mark Azer In Datadog

Collaboration and communication are critical to the successful implementation of service level objectives. Development and operational teams need to evaluate the impact of their work against established service reliability targets in order to improve their end user experience. Datadog simplifies cross-team collaboration by enabling everyone in your organization to track, manage, and monitor the status of all of their SLOs and error budgets in one place.

Read Post

Datadog

Read more about Best practices for managing your SLOs with Datadog

SLOs 101: How to establish and define service level objectives

Jun 24, 2024 By Mark Azer In Datadog

In recent years, organizations have increasingly adopted service level objectives, or SLOs, as a fundamental part of their site reliability engineering (SRE) practice. Best practices around SLOs have been pioneered by Google—the Google SRE book and a webinar that we jointly hosted with Google both provide great introductions to this concept. In essence, SLOs are rooted in the idea that service reliability and user happiness go hand in hand.

Read Post

Datadog

Read more about SLOs 101: How to establish and define service level objectives

Troubleshoot infrastructure faster with Recent Changes

Jun 21, 2024 By Sriram Raman In Datadog

Infrastructure changes often trigger incidents, but troubleshooting these incidents is challenging when responders have to navigate through multiple tools to correlate telemetry with configuration changes. This lack of unified observability leads to longer mean time to resolution (MTTR), greater operational stress, and ultimately, negative business outcomes.

Read Post

Datadog

Read more about Troubleshoot infrastructure faster with Recent Changes

Troubleshoot infrastructure issues faster with Resource Changes

Jun 21, 2024 By Sriram Raman In Datadog

Infrastructure changes often trigger incidents, but troubleshooting these incidents is challenging when responders have to navigate through multiple tools to correlate telemetry with configuration changes. This lack of unified observability leads to longer mean time to resolution (MTTR), greater operational stress, and ultimately, negative business outcomes.

Read Post

Datadog

Read more about Troubleshoot infrastructure issues faster with Resource Changes

Diagnose runtime and code inefficiencies in production by using Continuous Profiler's timeline view

Jun 20, 2024 By Guillaume Turbat In Datadog

When you face issues like reduced throughput or latency spikes in your production applications, determining the cause isn’t always straightforward. These kinds of performance problems might not arise for simple reasons such as under-provisioned resources; often, the root of the problem lies deep within an application’s runtime execution.

Read Post

Datadog

Read more about Diagnose runtime and code inefficiencies in production by using Continuous Profiler's timeline view

Troubleshoot and optimize data processing workloads with Data Jobs Monitoring

Jun 20, 2024 By Fionce Siow In Datadog

Data is central to any business: it powers mission-critical applications, informs business decisions, and supports the growing adoption of AI/ML models. As a result, data volumes are only increasing, and teams rely on engines like Apache Spark and managed platforms like Databricks or Amazon EMR to process this data at scale.

Read Post

Datadog

Read more about Troubleshoot and optimize data processing workloads with Data Jobs Monitoring

Monitor your AWS generative AI Stack with Datadog

Jun 18, 2024 By Datadog In Datadog

As organizations increasingly leverage generative AI in their applications, ensuring end-to-end observability throughout the development and deployment lifecycle becomes crucial. This webinar showcases how to achieve comprehensive observability when deploying generative AI applications on AWS using Amazon Bedrock and Datadog.

View Video

Datadog

Read more about Monitor your AWS generative AI Stack with Datadog

Remediate Google Cloud issues with new actions in Workflow Automation and App Builder

Jun 18, 2024 By Syed Sarjeel Yusuf In Datadog

Datadog Actions help you respond to alerts and manage your infrastructure directly from within Datadog. This can be done by creating workflows that automate end-to-end processes or by using App Builder to build resource management tools and self-serve developer platforms. With more than 550 available actions, Datadog Actions offers capabilities such as creating Jira tickets, resizing autoscaling groups, and triggering GitHub pipelines.

Read Post

Datadog

Read more about Remediate Google Cloud issues with new actions in Workflow Automation and App Builder

Operations | Monitoring | ITSM | DevOps | Cloud

DASH 2024 Keynote

Track the status of all your SLOs in Datadog

Best practices for managing your SLOs with Datadog

SLOs 101: How to establish and define service level objectives

Troubleshoot infrastructure faster with Recent Changes

Troubleshoot infrastructure issues faster with Resource Changes

Diagnose runtime and code inefficiencies in production by using Continuous Profiler's timeline view

Troubleshoot and optimize data processing workloads with Data Jobs Monitoring

Monitor your AWS generative AI Stack with Datadog

Remediate Google Cloud issues with new actions in Workflow Automation and App Builder

Monthly Archive

Follow Us