Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Introduction to Apache Iceberg

Apache Iceberg is an open source table format for large-scale analytics. It improves upon the limitations of traditional table storage solutions by offering a high-performance, more efficient way of managing data at scale. Iceberg allows for fine-grained control over data, enabling features such as schema evolution, time travel, and transactional support, which are crucial for modern data architectures.

Turbo360 Welcomes Black Marble as a Partner in Excellence

We at Turbo360 are thrilled to announce our partnership with Black Marble, a renowned leader in high-quality software development and innovative solutions. With their extensive expertise across the Microsoft platform and commitment to delivering exceptional user experiences, Black Marble brings a wealth of knowledge and skill to our collaborative efforts.

Use Grafana Alloy to collect Azure metrics with less hassle

Are you using the Azure metrics exporter to ship telemetry data to Grafana Cloud? Are you overwhelmed with the amount of configuration and complexity necessary to avoid being rate limited? Well, did you know that with Grafana Alloy, our distribution of the OpenTelemetry Collector with built-in Prometheus pipelines and support for metrics, logs, traces, and profiles, you can now: Let’s look at how these two features can reduce the complexity of your Alloy configuration.

Why business continuity belongs in the cloud?

Resilience in today’s liquid business environment demands flexibility. The term “observability” replaces monitoring, reflecting the need to adapt and be agile in the face of challenges. The key is to dissolve operations into the cloud, integrating tools and operational expertise for effective resilience. I remember that when I started my professional career (in a bank) one of the first tasks I was handled was to secure an email server exposed to the internet.

Elastic's RAG-based AI Assistant: Analyze application issues with LLMs and private GitHub issues

As an SRE, analyzing applications is more complex than ever. Not only do you have to ensure the application is running optimally to ensure great customer experiences, but you must also understand the inner workings in some cases to help troubleshoot. Analyzing issues in a production-based service is a team sport. It takes the SRE, DevOps, development, and support to get to the root cause and potentially remediate. If it's impacting, then it's even worse because there is a race against time.

Sentry on Sentry: How Metrics saved us $160K

If you know me, you know I care about fast code. Recently, I ran a simple query that revealed that we spend almost $160k a year on one task. Luckily, we launched the Metrics beta back in March. Over the last month or so, 10 of us Sentry engineers collaborated across many functions to leverage Metrics to track custom data points and pinpoint the issue leading to this ridiculous ingestion cost.

The Ultimate Guide To Incident Communication in 2024

In the digital realm, incidents such as service disruptions and security breaches are inevitable. Incidents affect your customers and stakeholders. Also, incidents pose significant challenges to IT, Ops, DevOps, and customer support teams. As we increasingly depend on digital tools and services, the demand for seamless performance escalates, highlighting the importance of effective incident communication.