Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

The Impact of MTTR on Customer Satisfaction and Business Success

Today, businesses are increasingly reliant on their ability to provide uninterrupted service and respond swiftly to any disruptions. Whether it's a website outage, a malfunctioning application, or hardware failure, downtime can significantly affect a company's operations. Customers expect quick resolutions, and delays can result in dissatisfaction, loss of trust, and ultimately, business failure.

Join Ken on SMC Journal - Scaling Kubernetes, Microservices, and Ephemeral Environments

Check out Ken Ahrens and Scott Moore as they discuss some blockers of developer productivity when building in Kubernetes, and how removing environment and data challenges can reduce toil and frustration! You can catch the full podcast on Scott’s page here: Scott Moore: Hey everybody out there in internet meme land. It’s time to hide your kids and hide your wife because it’s time for the SMC Journal podcast. Some of you will get that joke. Others will not.

Orchestration as a Data Management Challenge-Part 3

In blog one of this series, we discussed how orchestration is about data. In blog two, we talked about how effective orchestration with advanced data management techniques can be used to develop a digital twin. In this blog, we will discuss what we can do with all that data. More specifically, we’ll take a look at using data-empowered RAG AI to deliver stateful orchestration.

Installing Karpenter: Lessons Learned From Our Experience

This article shares our experience migrating from the AWS Cluster Auto Scaler to Karpenter. We provide an overview of the steps we took to install Karpenter. This article is the first in a series dedicated to Karpenter. In future posts, we will cover other aspects of using Karpenter.

How to Make Better Data Center Energy Management Decisions

Data centers are among the largest consumers of energy worldwide, accounting for up to 3% of global electricity consumption, a figure expected to rise with increasing demand for computing power and services. Energy consumption is a main concern in the data center industry as managers struggle to find ways to improve overall efficiency and environmental sustainability amidst the green wave of new reporting and operating regulations.

How Does PUE Relate to Data Center Sustainability?

In a time where sustainability is a critical concern, data centers play a pivotal role in shaping the future of environmental responsibility. Power Usage Effectiveness (PUE) is a core metric in assessing the energy efficiency of data centers. But how exactly does PUE relate to sustainability, and what role does it play in the broader context of ecological responsibility?

Top 11 Cloud Observability Tools To Use In 2024

Cloud observability tools offer visibility into your cloud infrastructure. They collect data from various sources to help you understand your applications’ performance. The tools enable you to monitor, optimize, and troubleshoot your cloud environment. In this guide, we’ll share why cloud observability is important and the best tools to consider.

From Basic Monitoring to Modern Observability: Shifting Right and Observability as Code

I've been in the observability market long before it even had that name. Over the years, observability has undergone a significant transformation. As someone who has witnessed these changes firsthand, I can attest to the dynamic nature of this field. In the early days, it was largely about basic monitoring: tracking system metrics, lots of logs, and simple alerts.