Monthly Archive

How Complyt used Datadog's Cloud Cost Management to reduce their cloud spend

Feb 28, 2024 By Datadog In Datadog

Learn how the team at Complyt was able to integrate Cloud Cost Managament in a matter of hours and quickly pinpoint underutilized services to cut their cloud spend in half. CCM delivers cost data where engineers work and with resource-level context like CPU, memory, and requests — easily scoped to their services and applications — so that they can take action and spend effectively.

View Video

Datadog

Read more about How Complyt used Datadog's Cloud Cost Management to reduce their cloud spend

Troubleshoot anomalies in workload performance with Watchdog Insights and Alerts for Live Processes

Feb 26, 2024 By Ivan Ilichev In Datadog

Processes—the service workloads that run on your infrastructure—are the building blocks of your application, and it’s critical to know how well they operate at every level of the stack. Degraded process performance can lead to downtime for your mission-critical services, resulting in loss of customer trust and potentially impacting revenue for the business.

Read Post

Datadog

Read more about Troubleshoot anomalies in workload performance with Watchdog Insights and Alerts for Live Processes

How to monitor etcd with Datadog

Feb 23, 2024 By David Lentz In Datadog

So far in this series, we’ve walked through key etcd metrics and tools you can use to monitor etcd metrics and logs. In this post, we’ll show you how you can monitor etcd with Datadog, including how to: But first, we’ll show you how to set up and configure the Datadog Agent and Cluster Agent to send etcd monitoring data to your Datadog account.

Read Post

Datadog

Read more about How to monitor etcd with Datadog

Tools for collecting etcd metrics and logs

Feb 23, 2024 By David Lentz In Datadog

In Part 1 of this series, we looked at how etcd works and the role it plays in managing the state of a Kubernetes cluster. We also explored key etcd metrics you should monitor to ensure the health and performance of your etcd cluster. In this post, we’ll show you how you can use tools like Prometheus, Grafana, and etcdctl to collect and visualize etcd metrics. We’ll also show you how to collect etcd logs that provide context for those metrics.

Read Post

Datadog

Read more about Tools for collecting etcd metrics and logs

Key metrics for monitoring etcd

Feb 23, 2024 By David Lentz In Datadog

Etcd is a distributed key-value data store that provides highly available, durable storage for distributed applications. In Kubernetes, etcd functions as part of the control plane, storing data about the actual and desired state of the resources in a cluster. Kubernetes controllers use etcd’s data to reconcile the cluster’s actual state to its desired state. This series focuses on monitoring etcd in Kubernetes.

Read Post

Datadog

Read more about Key metrics for monitoring etcd

Monitor the Windows Registry with Datadog

Feb 23, 2024 By Nicholas Thomson In Datadog

The Windows Registry is a centralized key-value database that stores permissions, user data, and configuration settings for the Windows operating system and many Windows native applications. The keys stored in the registry provide a granular view into the processes occurring on a Windows host, such as certificate expirations, security checks, and pending reboots.

Read Post

Datadog

Read more about Monitor the Windows Registry with Datadog

Measure long-term user engagement with Datadog Retention Analysis

Feb 15, 2024 By Addie Beach In Datadog

It’s relatively easy to study the immediate impact of new releases by analyzing short-term changes in user behavior or system activity. However, this information doesn’t tell you much about the long-term viability of your application, which depends less on the novelty of major application updates and more on sustained usability.

Read Post

Datadog

Read more about Measure long-term user engagement with Datadog Retention Analysis

Why ngrok Prioritized a Datadog Integration for Streamlined Monitoring of HTTP Events

Feb 14, 2024 By Datadog In Datadog

ngrok delivers instant ingress to your applications in any cloud, private network, or devices with authentication, load balancing, and other critical controls using their global points of presence. Hear from Chad Tindel, Field CTO & VP WW Solution Architecture, on why Datadog was their most requested integration and how it provides an easy pathway to ship application and traffic logs into one unified observability platform.

View Video

Datadog

Read more about Why ngrok Prioritized a Datadog Integration for Streamlined Monitoring of HTTP Events

Datadog Conversations: Toyota's Shift to Software-First Mobility

Feb 13, 2024 By Datadog In Datadog

As the world’s largest automotive manufacturer and the leading software-first mobility company, Toyota leans on Datadog to achieve its goals of delivering value to customers and uplifting employees with new technologies and processes. Jason Ballard, IT Executive and General Manager, shares his top priorities for the enterprise in North America and offers his advice for how other leaders in the industry can transform their business.

View Video

Datadog

Read more about Datadog Conversations: Toyota's Shift to Software-First Mobility

Datadog Incident Management Demo

Feb 12, 2024 By Datadog In Datadog

With Incident Management, Datadog provides a unified platform to seamlessly detect, investigate and manage incidents from end-to-end, helping you to streamline processes and quickly mobilize the right teams for faster incident resolution.

View Video

Datadog

Read more about Datadog Incident Management Demo

Centralize, triage, and track tickets with Datadog Case Management

Feb 12, 2024 By Kai Xin Tai In Datadog

Complex systems require many different monitors to assess the health of their infrastructure and applications, creating a wealth of alerts that can be hard to track. Due to a lack of effective triage processes, many organizations page engineers for every alert that comes in, making it difficult to separate false positives from issues that actually require immediate attention.

Read Post

Datadog

Read more about Centralize, triage, and track tickets with Datadog Case Management

Analyze the root causes and business impact of production issues with Trace Queries

Feb 12, 2024 By Antoine Dussault In Datadog

Tracing provides indispensable insights into the state and performance of distributed applications, but it can often be difficult to determine the root cause or ultimate business impact of issues indicated by traces. Translating visibility of individual microservices into broader performance insights often requires drawing complex correlations between spans. This can be a laborious process, which can complicate everything from troubleshooting and triage to tracking KPIs and managing costs.

Read Post

Datadog

Read more about Analyze the root causes and business impact of production issues with Trace Queries

Quickly spot and revert faulty deployments with Change Overlays

Feb 9, 2024 By Meghan Lo In Datadog

Faulty deployments and other types of erroneous changes may account for around 70% of all application outages. With the prevalence of CI/CD workflows, engineering teams make changes to their applications, services, and infrastructure all the time, which can make it difficult to trace issues to specific changes.

Read Post

Datadog

Read more about Quickly spot and revert faulty deployments with Change Overlays

Datadog on Kubernetes Autoscaling

Feb 9, 2024 By Datadog In Datadog

Datadog, the observability platform used by thousands of companies, runs on dozens of self-managed Kubernetes clusters in a multi-cloud environment, adding up to tens of thousands of nodes, or hundreds of thousands of pods. Also, this infrastructure is used by a wide variety of engineering teams at Datadog, with different features and capacity needs that may also change overtime.

View Video

Datadog

Read more about Datadog on Kubernetes Autoscaling

This Month in Datadog: Dynamic Instrumentation, Log Pipeline Scanner, Network Device map, and more

Feb 9, 2024 By Datadog In Datadog

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. This month, we put the Spotlight on Dynamic Instrumentation..

View Video

Datadog

Read more about This Month in Datadog: Dynamic Instrumentation, Log Pipeline Scanner, Network Device map, and more

Monitor Windows Performance Counters with Datadog

Feb 8, 2024 By Nicholas Thomson In Datadog

The Windows operating system exposes metrics such as CPU, memory, and disk usage as built-in performance counters, which provide a unified way to observe performance, state, and other high-level facets of Windows subsystems, components, and native or third-party applications. As such, Windows Performance Counters can be invaluable for monitoring resource usage and the health of your infrastructure, as well as systems your services are using.

Read Post

Datadog

Read more about Monitor Windows Performance Counters with Datadog

Track and alert on Amazon CloudWatch Network Monitor metrics with Datadog

Feb 7, 2024 By Nicholas Thomson In Datadog

Amazon CloudWatch Network Monitor, available as part of Amazon CloudWatch, is a network monitoring service that enables you to create customizable monitors for your network connectivity from AWS to on-premises infrastructure via AWS Direct Connect (DX).

Read Post

Datadog

Read more about Track and alert on Amazon CloudWatch Network Monitor metrics with Datadog

Monitor your OpenStack components with Datadog

Feb 6, 2024 By Candace Shamieh In Datadog

OpenStack is an open source cloud platform that enables customers to provision and manage compute, storage, and networking resources via web-based dashboards or APIs. OpenStack offers a range of services beyond standard infrastructure-as-a-service functionality, including orchestration, fault management, and service management components. These components help customers build, maintain, and scale high-availability applications.

Read Post

Datadog

Read more about Monitor your OpenStack components with Datadog

How Wawa Improves Customer Engagement using Datadog Real User Monitoring (RUM)

Feb 6, 2024 By Datadog In Datadog

Learn how Wawa's engineering team incentivized customers to increase spending in stores using Datadog Real User Monitoring. RUM gave them a holistic view to understand customer patterns and provide the right in-store incentives.

View Video

Datadog

Read more about How Wawa Improves Customer Engagement using Datadog Real User Monitoring (RUM)

How Autodesk engineers better service and own their infrastructure.

Feb 5, 2024 By Datadog In Datadog

Morgan Goose, Autodesk, shares how he and his team have democratized observability and made it a default offering for all their engineers. Autodesk is a global leader in software for people who design and make the world. That includes software for architects, builders, engineers, 3D artists, and production teams. To ensure the best customer experience, Autodesk has partnered with Datadog and is taking advantage of products like DBM to quickly identify and maintain the systems they instrument.

View Video

Datadog

Read more about How Autodesk engineers better service and own their infrastructure.

Visually replay user-facing issues with Zendesk and Datadog Session Replay

Feb 2, 2024 By Jamie Milstein In Datadog

Zendesk provides support teams with an integrated solution for processing all types of customer inquiries and feedback. But as organizations scale, support tickets can multiply, making it difficult to parse customer feedback and investigate issues promptly and thoroughly. Customers often report problems without providing the detailed context needed for effective troubleshooting.

Read Post

Datadog

Read more about Visually replay user-facing issues with Zendesk and Datadog Session Replay

Operations | Monitoring | ITSM | DevOps | Cloud

How Complyt used Datadog's Cloud Cost Management to reduce their cloud spend

Troubleshoot anomalies in workload performance with Watchdog Insights and Alerts for Live Processes

How to monitor etcd with Datadog

Tools for collecting etcd metrics and logs

Key metrics for monitoring etcd

Monitor the Windows Registry with Datadog

Measure long-term user engagement with Datadog Retention Analysis

Why ngrok Prioritized a Datadog Integration for Streamlined Monitoring of HTTP Events

Datadog Conversations: Toyota's Shift to Software-First Mobility

Datadog Incident Management Demo

Centralize, triage, and track tickets with Datadog Case Management

Analyze the root causes and business impact of production issues with Trace Queries

Quickly spot and revert faulty deployments with Change Overlays

Datadog on Kubernetes Autoscaling

This Month in Datadog: Dynamic Instrumentation, Log Pipeline Scanner, Network Device map, and more

Monitor Windows Performance Counters with Datadog

Track and alert on Amazon CloudWatch Network Monitor metrics with Datadog

Monitor your OpenStack components with Datadog

How Wawa Improves Customer Engagement using Datadog Real User Monitoring (RUM)

How Autodesk engineers better service and own their infrastructure.

Visually replay user-facing issues with Zendesk and Datadog Session Replay

Monthly Archive

Follow Us