Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

How To Get The Most Out Of The Linux Screen Command

If you’re logging onto a service or running remote command line operations over a network link via the Secure Shell (SSH) protocol, the last thing you need is for your session to be cut off by a faulty connection. This scenario is all too common – but for Linux users, the Screen utility can prevent it from occurring.

DevOps Metrics: 7 KPIs to Evaluate Your Team's Maturity

Measuring the maturity of your DevOps team might sound difficult, but it isn’t at all. Simple key performance indicators (KPIs), such as the deployment success rate or mean time between failure, give a good indication of the maturity of your DevOps team. By “mature,” I mean that your team consistently and smoothly operates at a high level and can deploy several times a day with very little risk.

Creating a Custom Container for the Deep Learning Toolkit: Splunk + Rapids.ai

The Deep Learning Toolkit (DLTK) was launched at .conf19 with the intention of helping customers leverage additional Deep Learning frameworks as part of their machine learning workflows. The app ships with four separate containers: Tensorflow 2.0 - CPU, Tensorflow 2.0 GPU, Pytorch and SpaCy. All of the containers provide a base install of Jupyter Lab & Tensorboard to help customers develop and create neural nets or custom algorithms.

Best Practices for Using Splunk Workload Management

Workload management is a powerful Splunk Enterprise feature that allows you to assign system resources to Splunk workloads based on business priorities. In this blog, I will describe four best practices for using workload management. If you want to refresh your knowledge about this feature or use cases that it solves, please read through our recent series of workload management blogs — part 1, part 2, and part 3.

The Daily Telegraf: Getting Started with Telegraf and Splunk

In this blog post, we discuss using Telegraf as your core metrics collection platform with the Splunk App for Infrastructure (SAI) version 2.0, the latest version of Splunk’s infrastructure monitoring app that was recently announced at Splunk .conf19. This blog post assumes you already have some familiarity with Telegraf and Splunk. We provided steps and examples to make sense of everything along the way, and there are also links to resources for more advanced workflows and considerations.

Rancher: Kubernetes Master Class: Security Best Practices -- How to harden your Kubernetes Clusters

As enterprises accelerate their adoption of containers and Kubernetes, they need to take necessary steps to protect such a critical part of their compute infrastructure. But not everyone has the same security needs, with developers and engineers often asking for different levels of granular control on specific configurations.

Rancher: Kubernetes Master Class: Troubleshooting Kubernetes

Everything breaks at some point, wether it is infrastructure (DNS, network etc) or Kubernetes itself, something will break eventually. In this session we will walk through the master components of Kubernetes, how they interact and how to troubleshoot the most common issues with Kubernetes. What parameters to use, what commands to run, how to interpret output from logging or commands are things that we will show you.