Operations | Monitoring | ITSM | DevOps | Cloud

Stream Your AWS Services Metrics to Splunk

Amazon Web Services (AWS) recently announced the launch of CloudWatch Metric Streams. Cloudwatch Streams can stream metrics from a number of different AWS resources using Amazon Kinesis Data Firehose to target destinations. The new service is different from the current architecture. Instead of polling, metrics are delivered via an Amazon Kinesis Data Firehose stream. This is a highly scalable and far more efficient way to retrieve AWS service metrics.

Failover Conf 2021 Wrap-Up

That’s a wrap! Gremlin hosted Failover Conf 2: Fail Smarter on April 27, 2021. In attendance were over 500 SREs, developers, sales engineers, product managers, DevOps experts, C-level execs, and other reliability pros from around the globe! This year’s conference included discussions around the future of DevOps, strategies for building reliable teams, analyzing human error to create better systems, and more.

Cloud-Hosted of Cloud-Native? Discover Why Cloudsmith Was Born in the Cloud

Today, almost every service now is offered in a “Cloud” variant. But what does that really mean? Are all clouds services equal? It’s easy to see why so many vendors rush to add a Cloud edition/variant of established software they sell. Undoubtedly, there has been a move to Cloud services across the industry, as more and more organizations seek to take advantage of the higher reliability and lower total cost of ownership that Cloud platforms promise.

What is Java Memory Analysis

Java memory analysis is an important process in checking the performance of a Java application. It helps Java developers ensure the stability of the application by checking the memory consumption. There are several factors to look into when doing memory analysis. But to get to the bottom of this process, it is vital to learn first how memory works.

SLA Compliance: The Service Desk & ITSM Metric Explained

IT solutions are either utilized as a service or procured from third-party vendors by organizations of all types and sizes. This enables organizations to gain access to reliable IT technologies without having to internally build, operate, or manage the underlying systems. As a result of this, both the organization and the solutions provider sign a service-level agreement (SLA), which commits the vendor to deliver services that meet the established performance requirements.

New Features: Heroku Errors and a Magic Dashboard

We have been collecting Logplex data for our Heroku customers for a while now. With that data we create Magic Dashboards for Postgres and Redis integrations, and track Heroku Host Metrics. Starting today, we also extract error incidents from Heroku Logplex data and provide you with a magic dashboard for Heroku status codes.

Unravel the hidden mysteries of your cluster with the new Kubernetes Dashboards

One of the greatest challenges you may face when creating Kubernetes dashboards is getting the full picture of your cluster. Kubernetes is the de-facto standard for container orchestration, but it also has a very steep learning curve. We, at Sysdig, use Kubernetes ourselves, and also help hundreds of customers dealing with their clusters every day. We are happy to share all that expertise with you in the Kubernetes Dashboards.

Using Distributed Tracing in Microservices Architecture

With the rise of microservices based cloud applications & its corresponding complexities, the need for observability is greater than ever. This blog looks into the what-why of distributed tracing along with few best practices to adopt for the same in microservices architecture. Distributed tracing for Microservices architecture is an emerging concept that is gaining momentum across internet-based business organizations.