Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Application Performance Monitoring and related technologies.

Datadog on Kafka

As a company, Datadog ingests trillions of data points per day. Kafka is the messaging persistence layer underlying many of our high-traffic services. Consequently, our Kafka usage is quite high: double-digit gigabytes per second bandwidth and the need for petabytes of high performance storage, even for relatively short retention windows. In this episode, we’ll speak with two engineers responsible for scaling the Kafka infrastructure within Datadog, Balthazar Rouberol and Jamie Alquiza. They'll share their strategy in scaling Kafka, how it’s been deployed on Kubernetes, and introduce kafka-kit; our open source toolkit for scaling Kafka clusters. You'll leave with lessons learned while scaling persistent storage on modern orchestrated infrastructure, and actionable insights you can apply at your organization

Citrix Admins can now track Citrix Connection Quality without Needing any Client Software

Citrix technologies are often used by remote employees or collaborators to access corporate applications and desktops. Citrix access is session oriented – a session is established at logon time and a connection between the client and the server/desktop being accessed is maintained for the duration of the session. User access to Citrix apps and desktops is highly interactive – mouse clicks, keystrokes, etc. all have to go from the client to the server/desktop to be processed.

Why Stack Trace APM Isn't Enough for Complete Web Application Monitoring

It’s probably true to say that if you asked an average user what makes a great web application, they’d probably say “speed.” But speed is the probably the least important aspect of an extensive rundown of elements. Factors like application development and rendering in the program are probably higher on that list. And what makes up a great performing application? And when something goes wrong, how do you know?

Introduction to Site Reliability Engineering

In this session, we start with the basics of SRE, including some common terminology and theory, then dive into practical examples—including lessons learned from our own journey here at Datadog. We discuss the relationship between SRE and DevOps, what success looks like (and how to measure it), and how to identify and nurture both internal and external talent in order to build a cross-functional team. SRE is a large, complex topic, so the session ends with a live Q&A and deep-dive into some great topics.

How to Create a Graph Using Tags and Time Aggregation | Datadog Tips & Tricks

In this video, you’ll learn how to use tag-based grouping and time aggregation (with the rollup function) to create actionable time-series graphs. Datadog offers various ways to manipulate your metric graphs so that you can create graphs that are specific and actionable for all of your use cases. Two methods of doing this—as explored in this video—are tag-based grouping and time aggregation.

How To Monitor Containers in Real-Time with Datadog Live Containers | Datadog Tips & Tricks

In this video, you’ll learn how to utilize Datadog’s Live Container View to monitor and troubleshoot container performance underlying your applications. Datadog makes it easy to monitor ephemeral, containerized infrastructure. In this video you’ll learn how to leverage Datadog’s Live Container View to effectively dive into your container health. Using this view, you can sort and group your containers by tags or labels imported from Kubernetes, such as container name.

Citrix Analytics for Performance - Where does it fit in relation to Citrix Performance Monitoring and Management?

I recently published an article on Citrix Performance Analytics (also called Citrix Analytics for Performance), part of the Citrix Analytics service in Citrix Cloud. In this article, i will analyze where Citrix Performance Analytics fits in relative to other Citrix performance monitoring and management tools in the market.

Synthetic Monitoring vs. APM Stack Trace Tools

The complexity of an application’s digital architecture is increasing dramatically every day. In an era of cloud infrastructure, the goal is to integrate all your web services into one place: CDN, DNS, third-party API services, QA tools, analytics tools, and any other component you can think of, are working with each other to make your services function. With so many components in place for an application to run, each one of these behaves like its own black box within your IT infrastructure.