Operations | Monitoring | ITSM | DevOps | Cloud

November 2023

ML and APM: The Role of Machine Learning in Full Lifecycle Application Performance Monitoring

The advent of Machine Learning (ML) has unlocked new possibilities in various domains, including full lifecycle Application Performance Monitoring (APM). Maintaining peak performance and seamless user experiences poses significant challenges with the diversity of modern applications. So where and how does ML and APM fit together? Traditional monitoring methods are often reactive, resolving concerns after the process already affected the application’s performance.

AVD Monitoring for MSPs (Managed Service Providers)

AVD (Azure Virtual Desktop) has grown in popularity and for many reasons many organizations are choosing to consume AVD via a specialist Managed Service Provider (MSP). For the MSPs offering AVD, monitoring and troubleshooting AVD especially for multi-tenant environments is challenging and involves significant, costly and skilled effort using native tools such as Azure Insights or Azure Monitor.

How to Monitor MongoDB Metrics with OpenTelemetry

For high throughput systems that focus on gathering continuous data or have a heavy read-only traffic, NoSQL databases came as a blessing. NoSQL databases, due to their unstructured nature of data, allow relatively faster inserts as well as reads compared to relational databases. One such database that’s quite popular today is MongoDB. In this article, our focus would be to understand how to extract metrics out of MongoDB and ship them to Signoz using the Open Telemetry collector.

Memcached Metrics Monitoring with OpenTelemetry

Let's dive deep into the realm of Memcached, where we'll uncover the power of monitoring with OpenTelemetry and SigNoz. This isn't just about caching data; it's about watching over Memcached like a vigilant guardian, ensuring it performs at its best, and optimizing your application's speed. In this tutorial, you will install OpenTelemetry Collector to collect Memcached metrics that should be monitored for performance and then send the collected data to SigNoz for visualization and alerts.

Streamline your CD pipeline for Cisco Cloud Observability

How can you leverage a monitoring-as-code mechanism to initiate new workload monitoring, or to create new visualizations? In this demo, see how Cisco AppDynamics can integrate with Flux CD (Continuous Delivery)—a GitOps Kubernetes operator tool that offers a simple and efficient interface to synchronize manifests within CD workflows from GitHub repositories. See how easy it is to upgrade existing software with just a few lines of code such as when instrumenting new workloads with the OpenTelemetry Agent or customizing a Grafana dashboard.

Uni Updates Episode 4: AppDynamics University is moving to Cisco U

AppDynamics University's self-paced and instructor-led training is moving to Cisco U., Cisco’s world-class learning experience platform, and will be available after December 15, 2023. What does this mean for you? If you have a Standard AppDynamics University subscription, self-paced training will be available in Cisco U. Free. If you have a Premium or Multi-User University subscription, self-paced and instructor-led training will be available in Cisco U. Essentials. In all cases, you will need to create a Cisco U. account using your AppDynamics University account email address.

Transitioning from lz4net to K4os.Compression.LZ4

At Raygun, we’re processing billions of events per month for our customers, so it’s well worthwhile looking for the most efficient data storage solutions. Way back when we started out, we chose lz4net to store data, which served our purposes well for many years. As we grew, though, we realized this was getting expensive, and was starting to undermine our business model. This post is focused on how we made the switch to the K4os.Compression.LZ4 rewrite, attaining significant performance gains.

How to Monitor MySQL Metrics with OpenTelemetry

Database monitoring is an important aspect to look at for a high-volume or high-traffic system. The database performance drastically impacts the response times for the application. In this tutorial, you will install OpenTelemetry Collector to collect MySQL metrics and then send the collected data to SigNoz for monitoring and visualization. In this tutorial, we cover: If you want to jump straight into implementation, start with this pre-requisites section.

How to Collect .NET Application Logs with OpenTelemetry

In the realm of modern software development, achieving true observability is paramount for understanding application behavior and performance. This demonstration focuses on a.NET application that harnesses the capabilities of OpenTelemetry to seamlessly integrate logging and tracing functionalities. OpenTelemetry, a key player in the Cloud Native Computing Foundation, provides a unified framework for comprehensive observability.

How to Monitor Prometheus Metrics with OpenTelemetry Collector?

OpenTelemetry provides a component called OpenTelemetry Collector, which can be used to collect data from multiple sources. Prometheus is a popular metrics monitoring tool that has a wide adoption. If you’re using Prometheus SDKs to generate metrics, you can collect them via OpenTelemetry collector and send them to a backend of your choice.

How to Monitor Apache Server Metrics with OpenTelemetry

Monitoring Apache web server metrics ensures your web server performs efficiently, securely, and reliably. In this tutorial, you will configure OpenTelemetry Collector to collect Apache metrics and send them to SigNoz for monitoring and visualization. We cover: If you want to jump straight into implementation, start with this prerequisites section.

Best practices for writing clean, maintainable JavaScript

The world’s biggest language comes with a huge collection of conventions and guidelines from the community’s collective wisdom. Following JavaScript best practices can help achieve faster page loads and better performance, improve code readability, and make maintenance and debugging easier. Carefully crafted code can also prevent errors and security issues, especially if it’s complemented with real-time diagnostic tools such as JavaScript error monitoring.

Chasing the Rainbow: Towards Unified Service Metrics

As Zendesk migrated from a monolithic application to an ecosystem of hundreds of services, its need for fully unified and standardized observability became a chief concern. In this talk, Senior Principal Engineer Daniel Schierbeck shares how adopting a service mesh has helped Zendesk teams manage its growing number of services while standardizing its observability. He also explains how Zendesk’s approach to monitoring service interactions has evolved as it adopted Datadog metrics and Datadog APM.

OpenTelemetry Java Tutorial | Auto-Instrument Java App with OpenTelemetry

OpenTelemetry stands at the forefront of modern observability practices, revolutionizing how developers gain insights into their applications' performance and behavior. As a powerful distributed tracing framework, it empowers engineers to effortlessly instrument their applications, providing comprehensive visibility into the intricacies of microservices architectures. This tutorial discusses how OpenTelemetry can be used to get insights from a Java application.

OpenTelemetry Operator Complete Guide [OTel Collector + Auto-Instrumentation Demo]

Manually deploying and managing OpenTelemetry components in a Kubernetes environment can be a complex and time-consuming task. It involves creating various Kubernetes resources, setting up configurations, and ensuring the components are properly integrated with the applications.

Proactive monitoring with automated alerts

E-commerce DevOps teams can use AppDynamics to monitor the health and performance of their applications, receiving alerts on issues BEFORE significant business impact. Devs can use AppDynamics to provide automatic email and text notifications about such issues. In this demo, see how you can harness a custom email alert notification to view anomalous synthetic transaction events within the Browser App Dashboard, drill into the Heath Rules violation page, and link to a custom dashboard to troubleshoot an unexpected increase in synthetic end-user response time for shopping cart activity.

OpenTelemetry Webinars - The Open Agent Management Protocol

Open Agent Management Protocol (OpAMP) is the emerging open standard to manage a fleet of telemetry agents at scale. Join Nica and Srikanth as we discuss recent updates to the standard and how you can remotely manage the OpenTelemetry collector with OpAMP. SigNoz helps developers monitor applications and troubleshoot problems in their deployed applications. SigNoz uses distributed tracing to gain visibility into your software stack.

OpenTelemetry Webinars - The Trace API

Join Nica and Srikanth to talk in detail about the OpenTelemetry Trace API. We'll talk about adding spans, events, attributes and other extra info, whether it's really possible to replace logs with traces, and more More about SigNoz: SigNoz - Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc. Backed by Y Combinator.

Java Application Monitoring - How IT Ops can Diagnose Memory Leaks at Scale

Many server-side applications are written in Java and often process tens of millions of requests per day. Key applications in various domains like finance, healthcare, insurance and education are often Java-based. When these applications slow down or fail, they affect the user experience and in turn, reduce business revenue. Behind many web forms or form-like GUIs there will often be a Java application.

Germain UX's Platform Now Available on Salesforce AppExchange: A Game-Changer for Salesforce Users!

We are absolutely delighted to announce that the Salesforce Partner Team has given their enthusiastic approval for Germain UX to be publicly listed on the illustrious Salesforce AppExchange platform. Following an extensive and rigorous review process that spanned an impressive 14 months, we are honored to become part of the exclusive league of top-tier applications featured on the AppExchange. This marks a momentous achievement for Germain UX, and we are eager to share this significant milestone with you.

Proactive error management: Collaborate effectively and work smarter with tags

Talking to many of our customers with different needs and use cases, one particular issue comes up all the time. When I’m seeing so many error groups in my app and so many error notifications in my inbox every day, it’s easy to end up feeling overwhelmed. I want a more proactive system to alert me to which errors need attention and when, so that I can stop getting buried. Does this hit home?

Unleashing the Dynamic Duo: Supercharging Productivity with Liquit Workspace and eG Innovations Monitoring Solution!

A guest blog from Donny van der Linde, Pre-Sales Consultant at eG Innovations’ partner Liquit covering how to leverage eG Enterprise’s monitoring in combination with Liquit Workspace technologies to build powerful proactive contextual access workflows to support automated application delivery. An example using eG Enterprise’s user experience metrics to trigger remediation via Liquit Smart Icons is given.

How Mercado Libre scales its AWS microservices without losing visibility

Learn how Mercado Libre acts more quickly, strategically, and proactively thanks to Datadog’s centralized platform and context-rich alerting.Mercado Libre hosts the largest online commerce and payments ecosystem in Latin America, which means thousands of dollars can be lost if some of their critical applications stop working for even 1 minute. Senior Technical Manager Juliano Martins and software expert Marcelo Quadros share a few reasons why they chose Datadog as their observability platform of choice for their AWS environment: the power of our infrastructure monitoring solution, extensive range of integrations, strong reputation in the market, and more.

Retrace Keynote: Observability (metrics, traces, logs) - Take back control of your data

In the keynote recorded during our recent user group session NUGGETS 2023, Sanjeev Mittal, GM of Retrace talks about how organizations are rapidly moving their infrastructure to the cloud and how their costs and complexity are increasing. Our customers are looking for Observability solutions that give them control of data and costs.

Troubleshooting Azure Virtual Desktop (AVD) Issues through Logon and Beyond

Today, I’ll be covering troubleshooting Azure Virtual Desktop (AVD) issues. I’ll cover the common causes of problems beyond logon and how you can monitor and troubleshoot to identify the root-causes of issues and how to resolve them resolve them. For information on troubleshooting logon problems and slow logons, please see my previous article: Troubleshoot Slow Azure Virtual Desktop (AVD) Logons.

What is Application Performance Monitoring (APM)?

In today's digital landscape, where everything is moving at breakneck speed, the performance of your applications can either make or break your digital game. We've all been there – the frustration of a slow website or an unresponsive app. That's where Application Performance Monitoring, or APM, swoops in to save the day. APM is like your app's personal fitness trainer, helping you keep it in peak condition.

Crossed 15K+ GitHub Stars, Simplified Logs Parsing with Pipelines & Trending on Hacker News - SigNal 30

Welcome to the 30th edition of our monthly product newsletter - SigNal 30! Last month, our Github repo crossed 15k+ Github stars, which is a great milestone for our open-source project and for our team. We also shipped the much-awaited logs pipeline that will make logs parsing a much better experience for our users. We also shipped other improvements to the product, hosted OpenTelemetry meetups and webinars, and much more.

I've Made a Huge Mistake: Implementing Agile on Infrastructure Teams

Bad planning methods can damage team morale and prevent teams from improving the systems they maintain. In this talk, Sam Handler from Shopify explains how his attempts to fix poor infrastructure planning processes through Agile methods failed. Drawing from this experience, he offers several principles that can help infrastructure teams improve the way they work.

Quick Demo of Logs Pipelines in SigNoz

Log pipeline allows you to preprocess your logs for enrichment, transformation, and attribute extraction before they get indexed. Here's a quick demo of using the Logs pipeline feature in SigNoz to parse Nginx logs. More about SigNoz: SigNoz - Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc. Backed by Y Combinator.
Sponsored Post

Taking down (and restoring) the Raygun ingestion API

In a world where Software as a Service (SaaS) products are integral to daily life, maintaining uninterrupted service for end-users is paramount. However, stuff happens. When it does, our most valuable response (other than restoring service ASAP) is to review the series of events that led up to the incident and learn from them. On August 25th, 2023, at 7:02 AM NZT, Raygun experienced a significant incident that impacted our API ingestion cluster, leading to an outage lasting approximately 1 hour and 15 minutes. While this wasn't fun for anyone involved, this incident did prove to be a valuable learning experience, shedding light on the importance of infrastructure management and resilience.

What is VMware Tanzu? And Why does Tanzu Matter?

Here in the Benelux region, we have recently been seeing increased interest in our capabilities to monitor and automate root-cause diagnostics for VMware Tanzu and other containerized / K8s technologies. Tanzu monitoring is one of the VMware technologies we’ll be demoing at VMware Explore in Barcelona (6 – 9 November 2023) and expect to see a lot of interest in it.

How Uber Freight Powers Intelligent Logistics with Datadog

Thiyagarajan Anandan, Uber Freight, shares how he and his team have created a center of excellence for monitoring and DevOps culture. Uber Freight, a division of Uber, delivers an end-to-end enterprise suite of Relational Logistics to advance supply chains and move the world’s goods. With more than 1,000 shippers across $18B freight under management (FUM), it’s critical for Uber Freight to provide a 99.99% uptime for its shippers and customers. Since migrating to the Datadog platform, Uber Freight for the first time has unlocked the full breadth and depth of their systems, thereby significantly decreasing MTTR/MTTD and delivering an improved customer experience.

APM vs Tracing vs Observability

Application Performance Monitoring (APM), tracing, and observability are fundamental software development and system management approaches. Each of these three concepts uniquely ensures that your applications operate, efficiently, smoothly, and reliably. Your organisation will more than likely already adopt one of these approaches, or even two, potentially all three.