Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

Splunk: My Start Will Go On: Splunk's TA for Windows Part 2

Join us for part two of our Windows TA Tech Talk, where we're focusing on our technical add-on (TA) for Windows OS. This TA for Windows makes management of many data sources-eventlog, performance sources, registry and of course standard logfiles-easier. It offers CIM compliant knowledge objects, normalizing your data and providing a unified view across the entire data domain.

Splunk: Cloud Data Modeling for Security

Are you trying to achieve end-to-end visibility across your multi-cloud or hybrid environment but running into roadblocks? This tech talk addresses the challenge of normalizing data from the 3 major cloud service providers' implementations, and establishing a set of security checks across them. Join us to learn how to implement a unified framework within data analytics tools that can be used for cloud monitoring, investigation, detection and response.

Launching Desktop Central Cloud: Embrace UEM the SaaS way!

Desktop Central is a holistic unified endpoint management (UEM) solution that offers a dynamic approach to securing and managing user devices, including desktops, laptops, smartphones, and tablets. Already established as a leader in the UEM field, ManageEngine adds another feather to its cap by now offering a cloud-based UEM solution. Desktop Central Cloud gives you 360-degree control over all your network endpoints.

Diagnosing out-of-memory errors on Linux

Out-of-memory (OOM) errors take place when the Linux kernel can’t provide enough memory to run all of its user-space processes, causing at least one process to exit without warning. Without a comprehensive monitoring solution, OOM errors can be tricky to diagnose. In this post, you will learn how to use Datadog to diagnose OOM errors on Linux systems.

How to monitor Golden signals in Kubernetes

What are Golden signals metrics? How do you monitor golden signals in Kubernetes applications? Golden signals can help to detect issues of a microservices application. These signals are a reduced set of metrics that offer a wide view of a service from a user or consumer perspective, so you can detect potential problems that might be directly affecting the behaviour of the application.

Embed Your Status Page Everywhere

A well-crafted status page is designed to save you time, energy, and resources when communicating service irregularities. Instead of fielding thousands of support requests when you experience an outage, a status page provides a self-service way for your customers to get up-to-the minute information about any current downtime. It also allows you to proactively communicate maintenance and other work in advance.

Where did all my spans go? A guide to diagnosing dropped spans in Jaeger distributed tracing

Nothing is more frustrating than feeling like you’ve finally found the perfect trace only to see that you’re missing critical spans. In fact, a common question for new users and operators of Jaeger, the popular distributed tracing system, is: “Where did all my spans go?” In this post we’ll discuss how to diagnose and correct lost spans in each element of the Jaeger span ingestion pipeline.