%term

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Add BugSplat Crash Reporting to Your Monday.com Workflow

Aug 8, 2024 By BugSplat Team In BugSplat

We're excited to announce that BugSplat's integration with Monday.com is now in beta! This powerful combination brings crash reporting and project management together, helping development teams work more efficiently and resolve issues faster.

Read Post

BugSplat

Read more about Add BugSplat Crash Reporting to Your Monday.com Workflow

Elastic Observability 8.15: AI Assistant, OTel, and log quality enhancements

Aug 8, 2024 By Alex Fedotyev, In Elastic

Elastic Observability 8.15 announces several key capabilities: New and enhanced native OpenTelemetry capabilities: Elastic AI Assistant enhancements: Large language model (LLM) observability for Azure OpenAI: Elastic Observability now provides deep visibility on the usage of the Azure OpenAI Service. The integration includes an out-of-the-box dashboard that summarizes the most relevant aspects of the service usage, including request and error rates, token usage, and chat completion latency.

Read Post

Elastic

Read more about Elastic Observability 8.15: AI Assistant, OTel, and log quality enhancements

Monitor your Anthropic applications with Datadog LLM Observability

Aug 8, 2024 By Shri Subramanian In Datadog

Anthropic is an AI research and development company focused on building reliable and safe artificial intelligence systems. Their flagship product is Claude, an advanced language model and conversational AI assistant known for its strong capabilities in natural language processing, reasoning, and task completion. Anthropic places a particular emphasis on AI safety and ethics, and its models and APIs are used by organizations across various industries to build powerful, safe, and performant AI applications.

Read Post

Datadog

Read more about Monitor your Anthropic applications with Datadog LLM Observability

Heterogeneous IT Architectures - A Strategy for Resilience

Aug 8, 2024 By Rachel Berry In eG Innovations

As the dust settles from the CrowdStrike outages we reflect on how adopting a heterogeneous IT strategy when adopting technologies can increase your organization’s resilience against outages and mitigate risks.

Read Post

eG Innovations

Read more about Heterogeneous IT Architectures - A Strategy for Resilience

Event Logs Explained: Your Guide to System Health

Aug 8, 2024 By David Benson In Logit.io

Event logs contain critical information and the analysis of these logs will support organizations in the detection of many security incidents, from auditing user access to observing malicious traffic and even isolating monitor rule changes on a firewall. By collecting event logs systematically and analyzing them, organizations can obtain insights into their IT environment for maintaining operational efficiency and security.

Read Post

Logit.io

Read more about Event Logs Explained: Your Guide to System Health

82% less downtime: enhanced organizational efficiency with Sumo Logic

Aug 8, 2024 By Hadijah Creary In Sumo Logic

You have a business imperative to deliver exceptional customer experiences. Sadly, traditional monitoring and observation methods, which assume predictability, are no longer sufficient.

Read Post

Sumo Logic

Read more about 82% less downtime: enhanced organizational efficiency with Sumo Logic

Understanding the Deficiencies of AWS CloudWatch for Cloud Visibility

Aug 8, 2024 By Phil Gervasi In Kentik

While CloudWatch offers basic monitoring and log aggregation, it lacks the contextual depth, multi-cloud integration, and cost efficiency required by modern IT operations. In this post, learn how Kentik delivers more detailed insights, faster queries, and more cost-effective coverage across various cloud and on-premises resources.

Read Post

Kentik

Read more about Understanding the Deficiencies of AWS CloudWatch for Cloud Visibility

Is the Internet ready for L4S?

Aug 8, 2024 By Sergey Katsev In Catchpoint

Today, Catchpoint is pleased to be sharing the results of our Global Explicit Congestion Notification (ECN) Bleaching Rates measurement campaign, covering the state of ECN bleaching worldwide, according to Catchpoint’s perspective. ISPs, telecoms and streaming services, among others (this information should be of interest to anyone with ISP dependencies), will be able to draw on this information to determine if your network or an upstream network is experiencing ECN bleaching.

Read Post

Catchpoint

Read more about Is the Internet ready for L4S?

Observe deleted Kubernetes components in Grafana Cloud to boost troubleshooting and resource management

Aug 8, 2024 By Vasil Kaftandzhiev In Grafana

As a site reliability engineer, you need constant vigilance and a keen eye for detail if you want to manage your Kubernetes infrastructure effectively. As part of that effort, you need to see the historical data from your pods, nodes, and clusters — even after they’ve been deleted or recreated. Many SREs rely on kubectl for this, and while it’s indispensable for real-time Kubernetes management, it presents some significant challenges with historical data.

Read Post

Grafana

Read more about Observe deleted Kubernetes components in Grafana Cloud to boost troubleshooting and resource management

The CoPE and Other Teams, Part 2: Custom Instrumentation and Telemetry Pipelines

Aug 8, 2024 By Nick Travaglini In Honeycomb

The previous post laid out the basic idea of instrumentation and how OpenTelemetry’s auto-instrumentation can get teams started. However, you can’t rely only on auto-instrumentation. This post will discuss the limitations in more detail and how a CoPE can help teams overcome them.

Read Post