%term

The latest News and Information on Service Reliability Engineering and related technologies.

Auto optimize production logs with Last9 MCP

Apr 21, 2025 By Last9 - High Cardinality Monitoring In Last9

Use Last9 MCP in Claude Desktop or Cursor to analyze logs for a service, get recommendations on improving logging, and optimize log volumes. In this demo, the AI agent uses the `add_drop_rules` MCP tool from Last9 to filter out unnecessary logs and reduve volumes by ~60%.

View Video

Last9

Read more about Auto optimize production logs with Last9 MCP

Apache Cassandra Monitoring: Tools, Challenges & Best Practices

Apr 18, 2025 By Anjali Udasi In Last9

When your distributed database architecture scales to handle massive workloads, keeping tabs on everything becomes critical and complex. With its masterless architecture and linear scalability, Apache Cassandra powers mission-critical applications across industries—but without proper monitoring, you might as well be flying blind through a storm.

Read Post

Last9

Read more about Apache Cassandra Monitoring: Tools, Challenges & Best Practices

GDPR Log Management: A Practical Guide for Engineers

Apr 17, 2025 By Prathamesh Sonpatki In Last9

GDPR compliance for logs can be tricky—especially when you're trying to maintain system visibility and protect user data at the same time. For SREs and IT teams, it’s a balancing act between staying on the right side of privacy laws and not losing the context you need to troubleshoot. This guide walks through practical ways to handle personal data in logs, set up retention rules that make sense, and stay compliant without creating unnecessary friction.

Read Post

Last9

Read more about GDPR Log Management: A Practical Guide for Engineers

Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

Apr 17, 2025 By Rootly In Rootly

In this episode, we explore how networking has shaped reliability as we know it. Marino Wijay cloud networking expert and Staff Solutions Architect at Kong shares how his journey began not as an SRE, but with cables, routers, and switches. Marino explains the evolution of the fabric holding systems together through virtualization, and how software-defined networking, which is now a key element to resilient applications.

View Video

Rootly

Read more about Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

Creating an LLM-powered Incident Diagram

Apr 17, 2025 By Rootly In Rootly

Jeba Emmanuel, Rootly AI Labs Fellow, explains how he created a tool that takes a GitHub repository and a postmortem repository to generate an incident diagram and a timeline. The solution uses a series of highly-specialized LLMs for better and more consistent results.

View Video

Rootly

Read more about Creating an LLM-powered Incident Diagram

The New Rootly Ringtones: How Research-based On-Call Sounds

Apr 17, 2025 By Rootly In Rootly

We set out to create a ringtone that wasn’t just loud—but the sound of a modern pager. Something that wakes you up, but without triggering a full-blown adrenaline spike. In this video, go behind the scenes with sound engineer Gorjão as he crafts a how research-based on-call sound sounds like.

View Video

Rootly

Read more about The New Rootly Ringtones: How Research-based On-Call Sounds

A Closer Look at Docker Build Logs for Troubleshooting

Apr 16, 2025 By Faiz Shaikh In Last9

In the world of containerization, understanding what's happening under the hood during image builds can mean the difference between smooth deployments and frustrating debugging sessions. Docker build logs are your window into this process, offering crucial insights that help you optimize builds, troubleshoot errors, and maintain robust container infrastructure.

Read Post

Last9

Read more about A Closer Look at Docker Build Logs for Troubleshooting

How to Connect ELK Stack with Grafana

Apr 16, 2025 By Anjali Udasi In Last9

In today’s distributed systems world, you need clear visibility into logs, metrics, and everything in between to keep systems healthy and reliable. That’s where the ELK Stack and Grafana work well together—each solving a different part of the observability puzzle. ELK handles the heavy lifting of log collection and processing. Grafana adds intuitive dashboards and powerful visualizations.

Read Post

Last9

Read more about How to Connect ELK Stack with Grafana

Everything You Need to Know to Start Monitoring Postgres

Apr 15, 2025 By Faiz Shaikh In Last9

Keeping your Postgres databases healthy is non-negotiable if you care about application performance and reliability. But monitoring Postgres the right way? That’s where things get tricky. Between the sheer volume of metrics and the noise that comes with them, it’s not always obvious what to pay attention to—or when. This guide breaks things down with a focus on what matters in real-world production setups.

Read Post

Last9

Read more about Everything You Need to Know to Start Monitoring Postgres

Log Consolidation Made Easy for DevOps Teams

Apr 15, 2025 By Faiz Shaikh In Last9

Managing multiple systems that each generate their alerts and logs can quickly become overwhelming. The challenge of scattered logs is a real headache, especially in the fast-paced world of DevOps. Log consolidation is not just a convenience—it's an essential practice that can save you from chaos and improve your operational efficiency. This guide covers everything you need to know about log consolidation, from understanding what it is and why it matters, to practical steps for making it work.

Read Post