Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Observability in the Cloud-Native Age: Announcing the DevOps Pulse 2020

It’s that time of year again — the DevOps Pulse 2020 is here! Last year, nearly 1,000 engineers around the world provided their insights in the DevOps Pulse 2019 so we could get the community’s perspective on the growth and challenges associated with observability, cloud monitoring and more. As we discovered in last year’s DevOps Pulse, observability is still a major challenge for many organizations.

Multi-Cluster Vulnerability Scanning with Alcide and Rancher

Kubernetes provides the freedom to rapidly build and ship applications while dramatically minimizing deployment and service update cycles. However, the velocity of application deployment requires a new approach that involves integrating tools as early as possible in the deployment pipeline and inspecting the code and configuration against Kubernetes security best practices. Kubernetes has many security knobs that address various aspects required to harden the cluster and applications running inside.

What is AWS Amplify?

Amazon Web Services is the world’s biggest cloud platform, and businesses of all shapes and sizes use it every day to run their businesses. You may find this surprising, but AWS accounts for more than half of Amazon's operating income. Thus, Amazon has a vested interest in getting as many people to use AWS as possible, so it offers a whole bunch of tools to make it easy to use. AWS Amplify is one of these.

Datadog and Relay for Incident Response

Datadog is an awesome tool for aggregating and visualizing the metrics that matter to you. Recently, Datadog launched a new Incident Management feature, which allows you to coordinate the activities around a problem that affected your service. In this example, I’ll walk through using Relay to roll back a Kubernetes deployment that caused a service impact, and show how the Datadog Incident timeline can keep everyone working on the incident in sync.

How to Avoid SLA-Killing, Budget-Busting Cloud Performance Problems

There are lots of excellent reasons to move applications into the public cloud. But those benefits cannot come at the expense (pun intended) of performance. Your SLAs, whether explicitly stated and written into contracts or implicitly promised through your commitment to quality, are part of your brand. Falling short is costly. Even if you don’t have to pay penalty fees, your reputation and customer loyalty can take a hit.

Monitor containers on Amazon Bottlerocket with Datadog

Amazon’s Bottlerocket is a new Linux-based open-source operating system that’s designed with containers in mind. Bottlerocket is optimized and stripped down to only the essential software needed to run containers. You can apply updates to Bottlerocket in a single step, and roll them back instantly if necessary. And, because it’s open-source, you can customize the operating system to fit your specific needs.