Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Ingest OpenTelemetry traces and metrics with the Datadog Agent

OpenTelemetry is a Cloud Native Computing Foundation (CNCF) initiative that provides open, vendor-neutral standards and tools for instrumenting services and applications. Many organizations use OpenTelemetry’s collection of APIs, SDKs, and tools to collect and export observability data from their environment to their preferred backend. As part of our ongoing commitment to OpenTelemetry, we are proud to have contributed our distributed tracing libraries to the CNCF community.

Cloud SQL: Concepts of Networking

Cloud SQL provides a managed service for MySQL, PostgreSQL, and SQL Server databases as well as backups, high availability, maintenance, and so much more! In this episode of Networking End to End, Lorin Price discusses networking concepts from implementation and security to connectivity on Cloud SQL. Watch along to learn about the options for deploying Cloud SQL and tips on how to determine who and what can access your Cloud SQL instance.

Event Reduction in Four Easy Ways with Cribl Stream

One of Cribl Stream’s selling points is the reduction of ingested log volume, which helps our customers control costs and improve system performance. This can be accomplished in two ways – either by eliminating duplicate or unnecessary fields and null values within the events, or controlling the number of specific events that actually get sent to the destinations through strategic filtering.

All things logs: best practices for logging and Grafana Loki

What’s the saddest log line in the multiverse? A log line without context. That’s according to Grafana Labs software engineer and Grafana Loki tech lead Ed Welch, who joined Grafana Labs VP of Culture Matt Toback and Engineering Director Mat Ryer for the latest episode of “Grafana’s Big Tent," our new podcast about people, community, tech, and tools around observability.

Introducing a high-usage tier for Managed Service for Prometheus

Prometheus is considered the de facto standard for Kubernetes application metrics, but running it yourself can strain engineering time and infrastructure resources when your usage grows. In March, we announced the general availability of Google Cloud Managed Service for Prometheus to help you offload that burden, and today, we’re excited to announce a new low-cost, high-usage pricing tier designed for customers who are moving large volumes of Kubernetes metrics over to the service.

Alerting on error log messages in Cloud SQL for SQL Server

With Cloud SQL for SQL Server, you can bring your existing SQL Server on-premises workloads to Google Cloud. Cloud SQL takes care of infrastructure, maintenance, and patching so you can focus on your application and users. A great way to take better care of your application is by monitoring the SQL Server error log for issues that may be affecting your users such as deadlocks, job failures, and changes in database health.

Tracing a Ruby application with OpenTelemetry for performance monitoring

Ruby on Rails is a popular MVC framework for creating web applications. It is necessary to monitor your Ruby applications for performance issues. In today’s cloud-native and microservices-based architecture, it is difficult for engineering teams to troubleshoot performance issues. Tracing your application can give the much needed context required to troubleshoot performance issues.

The future of automation - from slow to hyperflow

Automation has been the harbinger of change since the start of industrialisation. From Roman aqueducts to conveyor belts, and from water wheels to modern computing, automation has represented the pinnacle of human innovation of its time. Making it an essential catalyst for change and a critical step to the next industrial revolution. But with mission critical software application being at the heart of modern industries, automation is not only a pinnacle but a base layer for the fourth industrial revolution.

New release of Sysdig Open Source leverages Falco plugins

Sysdig maintainers are thrilled to announce the latest release of our beloved OSS tool for analyzing and/or recording the activity of processes and containers on a Linux system. You can find the full CHANGELOG in the GitHub repository, but here are some highlighted features: Note: The 0.29.1 has been released with a bug fix shortly after we started to write this post.

Sysdig Advisor: Making Kubernetes troubleshooting effortless

The cloud, Kubernetes, CI/CD, DevOps, GitOps… the last five years have seen a huge transformation in how organizations are architecting and shipping applications. It’s hard to keep up with the pace and learn all of this new tech! Nearly 55% of respondents to Canonical’s 2021 Kubernetes and cloud native operations report highlighted how the lack of sufficient in-house skills and people power is the biggest challenge that Kubernetes brings to businesses.