Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Are Digital Transformation Initiatives Secretly to Blame for Rising Number of IT Outages?

The race is on to turn data into useful information. Company executives across a wide range of departments are increasingly tasked with modernization and digital transformation initiatives. But what is digital transformation in IT? IT pros are racing to keep up with rapid industry changes and must consider efforts such as digitizing assets, restructuring department goals to focus on providing a seamless customer experience, or shifting from on-premises to public cloud.

Azure Monitor (Part 1): What is it and how does it work?

Join me on my Azure Monitor journey as I learn all there is to know about the platform. Check out my intro note here for a brief series overview and a bit about me (tl;dr former SCOM admin, avid tech blogger, SquaredUp tech evangelist). We’ll start with the basics and dive deeper as we go along. Buckle up, your journey to becoming an Azure Monitor superhero starts here!

How to get the most out of your ELB logs

Amazon ELB (Elastic Load Balancing) allows you to make your applications highly available by using health checks and intelligently distributing traffic across a number of instances. It distributes incoming application traffic across multiple targets, such as Amazon EC2 instances, containers, IP addresses, and Lambda functions. You might have heard the terms, CLB, ALB, and NLB. All of them are types of load balancers under the ELB umbrella.

AI-Based remediation for cloud incidents? We are on our way there

“Today, maintaining a high performance and continuous availability of cloud production environments are some of the most significant pain points for technology organizations. Production availability is commonly measured and managed by evaluating some of the following dimension: Does the product or service work fast enough? Does its performance meet the expected metrics? Is it able to deliver the optimal customer experience?

Key metrics for monitoring AWS Lambda

AWS Lambda is a compute service that enables you to build serverless applications without the need to provision or maintain infrastructure resources (e.g., server capacity, network, security patches). AWS Lambda is event driven, meaning it triggers in response to events from other services, such as API calls from Amazon API Gateway or changes to a DynamoDB table.

Tools for collecting AWS Lambda data

In Part 1 of this series, we discussed AWS Lambda functions and some key metrics for monitoring them. In this post, we’ll look at using Amazon’s native tooling to query those metrics. We’ll also show you how to collect logs and traces that provide further visibility into your Lambda functions. Amazon provides built-in monitoring functionality through CloudWatch and X-Ray.

Monitoring AWS Lambda with Datadog

In Part 2 of this series, we looked at how Amazon’s built-in monitoring services can help you get insights into all of your AWS Lambda functions. In this post, we’ll show you how to use Datadog to monitor all of the metrics emitted by Lambda, as well as function logs and performance data, to get a complete picture of your serverless applications. In this post, we will: Datadog integrates with AWS Lambda and other services such as Amazon API Gateway, S3, and DynamoDB.

Introducing Lambda Enhanced Metrics

AWS Lambda decouples the need to provision and maintain a runtime environment from running code, allowing developers to focus on applications rather than infrastructure. But, by abstracting away the underlying infrastructure of an application, serverless architectures introduce new challenges into monitoring and observability.