Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

What Is BGP and Why Is It Important?

When you send an email or load a website, you probably never think about how the data gets from your computer to the server that needs to process it. But something does have to decide how the data will move across the vast expanse of the Internet – and, in particular, which of the virtually infinite number of potential routes your data will take as it moves from your device to a server and back again.

Upsun is now using annual carbon intensities from Electricity Maps

One of the value propositions of Upsun is to offer greener hosting. By choosing cloud hosting instead of on-premise hosting, you are selecting a greener way to host your applications and websites. In addition, when creating a project, you have the option to choose the greenest region to host it. How?

Mastering Kubernetes Networking with Cilium

In a recent meetup I hosted alongside Kunal Kushwaha, we discussed Cilium, an eBPF-powered open-source cloud-native networking solution that offers security, observability, scalability, and superior performance. Throughout this blog I will explore how the increased usage of Kubernetes has led to the need for advanced networking, security, and observability solutions. This will allow us to take a closer look at how Cilium can benefit Kubernetes users.

Infrastructure is Fundamental: Learn Your Hybrid Cloud ABCs

In 21st-century business, computing is what makes daily operations, competitive advantage, and strategic growth possible. The foundation that enables this is a hybrid cloud infrastructure that supports business requirements, delivers a suitable user experience, and stays on budget. Mastering the ABCs of infrastructure performance management (IPM) will put you on the road to long-term success.

The Azure Hybrid tour

In this episode of our podcast, join us as we explore the world of hybrid and multi-cloud with Thomas, Senior Program Manager & Chief Evangelist for Azure Hybrid at Microsoft. Discover the significance of hybrid and multi-cloud in the enterprise market, learn about Microsoft's solutions for on-premise to cloud transition, and gain insights into the challenges and benefits of multi-cloud strategies. Thomas also shares valuable advice on continuous learning and embracing change in the tech industry. Tune in now for a deep dive into the dynamic world of Azure and cloud technology.

Getting started with Squadcast's On-Call Scheduling

We understand that everyone values a simple and straightforward approach when it comes to setting up schedules. We at Squadcast are fully aware of the difficulties involved in creating an on-call schedule from scratch or migrating it to a new platform. Hence we have come up with a blog to assist you in seamlessly setting up your on-call schedule using Squadcast. Our goal is to provide guidance and support to make the process as effortless as possible for you.

Prometheus Blackbox Exporter: Guide & Tutorial

Prometheus is a favored open-source monitoring system that collects, stores, and queries metrics from various sources. In Prometheus, an exporter is a component that collects and exposes metrics in a format Prometheus can scrape. The Prometheus Blackbox Exporter is designed to monitor “black box” systems with internal workings that are not accessible by Prometheus. It sends HTTP, TCP, and ICMP requests to the external systems and measures their response times and statuses.
Sponsored Post

Prometheus Sample Alert Rules

Prometheus is a robust monitoring and alerting system widely used in cloud-native and Kubernetes environments. One of the critical features of Prometheus is its ability to create and trigger alerts based on metrics it collects from various sources. Additionally, you can analyze and filter the metrics to develop: In this article, we look at Prometheus alert rules in detail. We cover alert template fields, the proper syntax for writing a rule, and several Prometheus sample alert rules you can use as is. Additionally, we also cover some challenges and best practices in Prometheus alert rule management and response.

The Swedbank Outage shows that Change Controls don't work

This week I’ve been reading through the recent judgment from the Swedish FSA on the Swedbank outage. If you’re unfamiliar with this story, Swedbank had a major outage in April 2022 that was caused by an unapproved change to their IT systems. It temporarily left nearly a million customers with incorrect balances, many of whom were unable to meet payments.