Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring and Improving Employee Experience In Virtual Desktop (DaaS/VDI) Environments (Part 1)

A common pain point we repeatedly hear from our customers that use Desktop as a Service (DaaS)/Virtual Desktop Infrastructure (VDI) environments is, “We have monitoring in place for physical hosts and infrastructure, but our employees still complain a lot.” If DaaS or VDI is part of your IT environment and you lack visibility into such environments to ensure effective employee experience, read on.

How Slack Transformed Their CI With Tracing

Slack experienced meteoric growth between 2017 and 2020—but that level of growth came with growing pains. In his talk at the 2021 o11ycon+hnycon, Frank Chen (LinkedIn), a Slack Senior Staff Engineer, detailed one of Slack’s biggest pain points in that period: flaky tests. A flaky test returns both a passing and failing result despite no changes in the code. At one point, between 2017 and 2020, Slack’s flaky test rate reached as high as 50%.

Practical CPU time performance tuning for security software: Part 2

In a previous blog, we discussed how to monitor, troubleshoot, and fix high %CPU issues. We also revealed a system API that could have an unexpected impact on CPU consumption. In this episode, we’ll discuss another time-related performance aspect that is unique to security software: application startup time. You don’t need to be a developer to benefit from this article.

Debugging with Dashbird: AWS Lambda Process Exited Before Completing Request

Another generic error message from our favorite FaaS provider AWS Lambda. And again, there are multiple reasons why this issue could arise. Let’s first look at the basics of AWS Lambda to get a better intuition for when things go wrong later. Lambda is an asynchronous event-based service at heart.

Why Serverless Apps Fail and How to Design Resilient Architectures?

We’ve been monitoring 100,000’s of serverless backend components for 3+ years at Dashbird. In our experience, Serverless infrastructure failures boil down to: These isolated faults become causes of failure due to dependencies in our cloud architectures (ref. Difference of Fault vs. Failure). If a serverless Lambda function relies on a database that is under stress, the entire API may start returning 5XX errors.

Implementing an Internal Developer Platform

In a previous post, we discussed what an Internal Developer Platform (IDP) is and some drivers behind IDP initiatives. If we go through our interactions with different organizations, we see teams embarking on the journey to build their IDPs mainly driven by the following requirements: While building an IDP may seem like an obvious choice and initiative, it is definitely not an easy task to accomplish. Building an IDP involves dealing with many moving components.

Four Industries That Will Be Disrupted by AI in 2021

With the never-ending potential of technology to disrupt everyday processes, more and more industries are deciding to adapt to one exciting area of innovation today: artificial intelligence (AI). In fact, Global Industry Analysts Inc. predicts that AI will be worth 164.03 billion GBP by 2026, and here, we look at four industries set to be disrupted by AI. Since the healthcare sector collects and greatly depends on personal data from their patients, AI will play a crucial role in data management.

How to Move Kubernetes Logs to S3 with Logstash

Sometimes, the data you want to analyze lives in AWS S3 buckets by default. If that’s the case for the data you need to work with, good on you: You can easily ingest it into an analytics tool that integrates with S3. But what if you have a data source — such as logs generated by applications running in a Kubernetes cluster — that isn’t stored natively in S3? Can you manage and analyze that data in a cost-efficient, scalable way? The answer is yes, you can.

DHCP scope

It’s possible to have more than one DHCP server on the same network and still have everything work right, with no conflicts and no dropped packets or IP requests. It’s really not that hard to pull together, either, but there are some things to know, and some things to consider before we investigate that situation. For this blog, we’ll put some of the overlooked facets of DHCP in bold text. Let’s take a look.

TL;DR InfluxDB Tech Tips - Optimizing Flux Performance in InfluxDB Cloud

So you’re using InfluxDB Cloud and you’re taking full advantage of Flux to create custom data processing tasks, checks, and notifications. However, you notice that some of your Flux scripts aren’t executing as quickly as you expect. In this post, we’ll learn about best practices and tools for optimizing Flux performance.