Operations | Monitoring | ITSM | DevOps | Cloud

May 2022

Where Does Middleware Stand in Web Development?

Before we go ahead with the topic, the readers must have some prior knowledge about web development and Middleware. As observed by its name, Web development is a field of computer science. The developers are responsible for developing websites, whether they are for the internet, intranet, single-page website, blog sites, or complete social media platforms. Middleware acts as a glow-in as it provides the path between the front end and back end of a computer or android based application.

Monitor Amazon RDS Proxy with Datadog

For over a decade, Amazon RDS (Relational Database Service) has been a popular managed database service companies have used to set up, operate, and scale databases for web and mobile applications. However, modern, high-scale applications can have upward of tens of thousands of clients. This means that maintaining direct connections between the database and application can quickly begin to consume more resources than the query executions themselves.

Monitor your Helm-managed Kubernetes applications with Datadog

Helm is a package manager that makes it easy to deploy and manage Kubernetes applications. Our new Helm integration allows you to monitor the availability and status of the Helm-managed applications deployed in your Kubernetes clusters. In this post, we’ll show you how you can visualize the status of your Helm releases and use monitors to notify you of important changes in your Helm environment.

What is OpenTelemetry

You may have previously heard about OpenTelemetry (also known as OTel) if you have looked into improved ways of standardising different data types. In this article, we’ll delve into the key things you need to know about OpenTelemetry and how this unified standard may become the future of how logs, metrics, events and traces are all handled.

Why API Monitoring is a Business Necessity

This article will discuss API Monitoring, the definition of APIs and how they would affect your business if there is a failure, and how to protect APIs or your services. Everything on the web is powered by an API nowadays. APIs are an essential part of the web to connect different services that help customers complete their actions. This article will help you understand the importance of API and why API monitoring is a business necessity.

Become a Telegraf Pro with a New Course from InfluxDB University

Telegraf, InfluxData’s open source data collection agent, is a key tool for managing data pipelines. Users around the world rely on Telegraf’s flexibility to collect, clean, transform, and send data. But even with Telegraf’s immense popularity, we continue to look for ways to make it even easier to use. That’s why, when we launched InfluxDB University, we knew that Telegraf would be a key ingredient.

Survey: Are You Using an Observability Solution, Implementing One, Actively Planning for it or Thinking About it?

Whether you’ve just started your observability journey or haven’t started one at all, Techstrong Research is here to help the industry gain insights into observability with their “Observability at the Speed of Innovation 2022” survey, sponsored by StackState. Techstrong Research, an industry analyst and consulting group focused on the business outcomes of disruptive technologies, is researching where organizations are in their observability journey.

Grafana for business intelligence: How Grafana Labs uses dashboards for more than observability data

Having joined Grafana Labs as one of our first data & analytics hires, I spent much of my time in the first few months considering how we should structure our data stack to optimize for a quick path to value, while allowing our small data team to scale going forward.

What is RMM? The Importance of Network Management

Remote monitoring and management (RMM) is software with a pretty simple mission: save IT pros time and effort. How? Instead of waiting for a user to report an issue, an RMM lets you automate responses and gain insights that can reduce issues down the road. Now, there are plenty of other software tools you could use to make life easier, and to be clear, RMMs aren’t the right tool for every job.

What Is Apache Kafka and How Do You Monitor It?

Apache Kafka is known for its ability to handle real-time streaming data with speed and efficiency. It’s also known for being scalable and durable, which makes it ideal for complex, enterprise-grade applications. Of course, those new to the concept behind Kafka may find that it takes some time to understand how it works. Thanks to its unique combination of messaging, storage, and stream processing features, Kafka is well suited for both real-time and historical data analysis.

Monitoring AdTech KPIs Can Prevent Lost Business and Revenue

The high volume and high rate of transactions in the adtech market pushes vast amounts of data through the entire ecosystem, 24×7. Regardless of its place in the market – advertiser, ad exchange, ad network, or publisher – each has thousands or even hundreds of thousands of metrics that measure every aspect of the company’s business. Monitoring these metrics can prevent incidents from impacting the business.

Looking for Needles In a Stack of Needles? Develop an Observability Mindset

When I talk with Splunk customers, their challenges sometimes sound like trying to find a needle in a stack of needles. Feel the same way? The answers you need are out there, hidden in your data. Our job is to help you find them. Securing your networks, keeping them up and running and maximizing efficiency are key priorities. You also face the challenges of speeding up development and driving innovation to stay competitive.

Lightrun Is Now Available For Web IDEs

We’re delighted to announce that Lightrun for Web IDEs is now available for our beta users! Lightrun for the Web is now supported in three different IDEs: If you want to check out the individual plugins, check out the respectable documentation articles: Lightrun’s users are now able to connect to their live applications directly from the browser, without having to download one of our dedicated plugins – and enjoy the full suite of Lightrun features right in the browser.

The 4 Golden Signals for Monitoring Kubernetes: Everything You Need to Know

Kubernetes is currently the de-facto container orchestration system on the market. Both small and large companies adopt it, and all major cloud providers offer it as a service. However, Kubernetes is a complex and layered platform, so you can’t just jump into it. There are three essential stages for each application: design, deployment, and operation. This blog post will focus on operation, where you need to monitor and troubleshoot your deployed applications.

5 Tips To Make Google Fonts Faster

Google Fonts is a fantastic tool for web designers and developers, but it is sometimes one of the slowest resources on your website. It’s frustrating and ironic that Google’s own font service is the long pole in so many web performance reports, but it doesn’t have to be! Here’s 5 ways to supercharge your install of Google Fonts to make it download less, load faster, and reduce layout shifts of your website.

Introducing native support for OpenTelemetry in Jaeger

The latest Jaeger v1.35 release introduced the ability to receive OpenTelemetry trace data via the OpenTelemetry Protocol (OTLP), which all OpenTelemetry SDKs are required to support. This is a follow-up to the previous announcement to retire Jaeger’s “classic” client libraries. With this new capability, it is no longer necessary to use the Jaeger exporters with the OpenTelemetry SDKs, or to run the OpenTelemetry Collector in front of the Jaeger backend.

Cloud Data Fusion: Concepts of Networking

To understand implementation, security, and connectivity in Cloud Data Fusion, it’s helpful to first understand the networking concepts. In this episode of Networking End to End, Lorin Price goes over what Cloud Data Fusion is, Public IP vs Private IP, and more. Be sure to look out for more videos on the Concepts of Networking for Google Cloud’s managed services! Chapters

Netdata Machine Learning Meetup

This video livestream meetup by Netdata takes a deep dive into the fundamentals of Machine Learning in DevOps Infrastructure Monitoring. It also covers the Netdata way of approaching Machine Learning. The Anomaly Advisor major update to Netdata is introduced as a valuable troubleshooting tool for any DevOps or Site Reliability Engineer looking for anomalies in their infrastructure. The hosts share real-world infrastructure monitoring & troubleshooting examples, as well as early feedback from the community on the Anomaly Advisor.

The 5 Levels of MSP Operational Maturity

Paul Dippell is the co-founder and CEO of Service Leadership, Inc., a company that measures IT and managed service provider (MSP) performance across the industry and annually publishes the results as the Service Leadership Index®. In this interview, we ask about his work using operational maturity in benchmarking managed service providers.

How to Troubleshoot Routing Problems

Routing problems tend to emerge when you’re first setting up a new piece of network equipment, and when something has failed. Usually, routing problems are caused by some sort of configuration or design error. Troubleshooting routing problems is tricky because the usual tools like ping and traceroute don’t always tell you what you need to know.

Monitor PlanetScale with Datadog

PlanetScale is a serverless, MySQL-compatible database platform powered by Vitess. PlanetScale handles database scaling while also providing you with the tools to increase your development velocity, such as branching, non-blocking schema changes, automatic backups, built-in connection pooling, as well as a helpful interface and CLI. Datadog’s new integration gives you deep visibility into your PlanetScale databases, so you can optimize your usage and costs.

How To Perform Sentiment Analysis using NLP and RDA Bots

Movie Reviews usually play a big role in whether audiences who haven’t watched the movie yet would watch it. Movie reviews are a reflection of the movie being interpreted. But it’s too hard to track the value of those interpretations unless there is some analytical method in doing so.

Cribl Search: Powering the Future of Observability

Cribl Search turns the traditional search process on its head, allowing users to search data in place. No longer must data be collected and moved to storage before being examined. With Cribl Search, administrators can search data at the edge, moving through an observability pipeline, stored in a data lake, or even stored in their existing solutions like TSDBs or log stores.

Is hyperautomation the key to digital world success?

In this article, we look at the core concepts of SAP digital transformation, its role in enhanced decision making, and how hyperautomation could be the secret to creating a competitive advantage for your organization. Once seen as a costly legacy system for many organizations, SAP is no longer the ‘unwieldly’ application it once was.

Celery worker | Tutorial on how to set up with Flask & Redis

Celery worker is a simple, flexible, and reliable distributed system to process vast amounts of messages while providing operations with the tools required to maintain such a system. In this tutorial, we will learn how to implement Celery with Flask and Redis. Before we deep dive into the tutorial, let’s have a brief overview of Celery workers.

How to: Comprehensive Monitoring for WordPress Sites

It’s funny how even today, in 2022, people still ask me if WordPress is a good choice for a high-traffic website. There are few website-building solutions on the internet as tested as WordPress. And brands like TechCrunch and The New Yorker wouldn’t be hosted on WordPress if traffic was an issue. But here’s the thing: WordPress is self-managed. As a result, you as an individual or organization using it for your business are responsible for monitoring your WordPress site.

5 surprising things you might not know about StatusCake

Yep, you read that right, THE fastest. Like the Usain Bolt of check rates, just without the Olympic gold medal to back it up (but it’s safe to say that if there was a gold medal for quickest check rates, we would win gold). So what does this actually mean? It means that your website will be checked almost constantly; every 30 seconds depending on the plan that you pick.

Highlights from KubeCon + CloudNativeCon 2022

After two years of virtual editions, KubeCon + CloudNativeCon Europe returned as a hybrid event, with its in-person portion held in Valencia, Spain, from May 16-20. As platinum sponsors of this year’s conference, Datadog held a booth where we showcased the latest updates to our Kubernetes monitoring solution, including the new Kubernetes resources overview, improved OpenTelemetry support, and the latest version of the Datadog Operator for Kubernetes.

Network Fault Monitoring vs. Network Performance Monitoring

Every IT administrator knows that users typically complain of two things: the network doesn’t work or it’s slow. When your network isn't working, it’s usually because something is down and we can rely on Network Fault Monitoring tools to notify us. But where do we start when users complain of poor performance? And what tools are available to help us? In these situations, Network Performance Monitoring tools might be just what you need.

10 WordPress Errors That Can Tank Your Website (+ How to Fix Them)

WordPress powers more than 455 million websites globally (some 37% of the total) and dominates a staggering 62% share of CMS (content management systems) platforms. It also offers more than 54,000 plugins to customize your site. However, as with any new process or tech, such as your new business text messaging tool, things may not always go right. What are the most common errors that can occur with WordPress and adversely affect your website?

Lumigo + JetBrains

Lumigo uses IntelliJ IDEs everywhere. The back-end developers love their PyCharm and us frontend developers use WebStorm all the time. No doubt that it’s one of the most popular IDE’s out there. One of the perks at Lumigo is that as employees, we can use 10% of our working time to invest in personal projects or do cool things for self-development and innovation.

Auto-Instrumenting NestJS Apps with OpenTelemetry

In this tutorial, we will go through a working example of a NestJS application auto-instrumented with OpenTelemetry. In our example we will use a simple application that outputs “Hello World!” when we call it in the browser. We will instrument this application with OpenTelemetry’s Node.js client library to generate trace data and send it to an OpenTelemetry Collector. The Collector will then export the trace data to an external distributed tracing analytics tool of our choice.

5 Common Amazon Kinesis Issues

Amazon Kinesis is the real-time stream processing service of AWS. Whether you got video, audio, or IoT streaming data to handle, Kinesis is the way to go. Kinesis is a serverless managed service that integrates nicely with other services like Lambda or S3. Often, you will use it when SQS or SNS is too low-level. But as with all the other services on AWS, Kinesis is a professional tool that comes with its share of complications.

Ease the Transition: 5 Tips for Taking Over your Team's Uptime.com Account

Eleven basic checks and one status page. That’s all you see when looking over the account usage of the Uptime.com account you are now managing for your company. When you logged in for the first time you saw a dashboard with cards and metrics, labeled with titles that don’t obviously connect to services you offer. Your first clicks were to navigate to view the subscription – maybe your plan details can give you some guidance. Does this sound like you?

OpenMetrics vs OpenTelemetry - A guide on understanding these two specifications

OpenMetrics and OpenTelemetry are popular standards for instrumenting cloud-native applications. Both projects are part of the Cloud Native Computing Foundation (CNCF) and aim to simplify how we generate, collect and monitor services in a modern cloud-native distributed application environment. Let's have a look at how both the standards are aiming to help solve the observability conundrum.

Top 30+ Best DevOps Tools in 2022: A Comprehensive List of Automation Technologies You May Not Be Using in Your Pipeline

The software is getting more and more complicated and so is the infrastructure behind it. It is no longer what it used to be with a single web or application server and a database backing it up. Throughout the years, the infrastructure has become more and more complicated. We have multiple databases, queues, datastores, search engines, and configurations. We want to incorporate continuous delivery and automated testing and deploy everything easily.

Synthetic Monitoring With Sematext | Release and Features | The Best Website Monitoring Solution

Sematext’s Synthetic monitoring tool is a website monitoring solution that lets you track the availability and performance of your websites. Monitor your entire website or an individual HTTP request, including 3rd party APIs. Get the best website monitoring tools with Sematext’s synthetics and Real User monitoring tools.

What's new in Sysdig - May 2022

Welcome to another edition of What’s New in Sysdig in 2022! The “What’s new in Sysdig” blog is now under my control! Hello, I’m Wes MacKay, a Sales Engineer based out of Dallas, TX working with the Sysdig US West Corporate team. I’m way too passionate about containerization, personal cloud storage, and automating my home life. In my spare time, I’m always looking for better Thai and Sushi restaurants in my area.

How to deploy Grafana Enterprise Logs on Red Hat OpenShift

Here at Grafana Labs, we’re always looking for ways to provide our customers with a choice of platforms where they can run Grafana Enterprise Logs (GEL). As part of that mission, we’re pleased to announce that we’ve added Red Hat OpenShift 4.x support to GEL. GEL, as you may know, is a leading enterprise logs solution.

Herrenknecht AG Powers IIoT Platform and Edge Data Collection for Tunnel Boring Machines with InfluxDB

Herrenknecht AG is a technology leader in mechanized tunneling systems. Engineers at Herrenknecht set out to build an industrial internet of things (IIoT) platform that provided insight into live and historic data for all their tunnel boring machines (TBMs). These machines have thousands of sensors generating high velocity data, sometimes in remote areas with limited connectivity.

Monthly Product Update - New Developer Experience for InfluxDB Cloud

We love to write and ship code to help developers bring their ideas and projects to life. That’s why we’re constantly working on improving our product in sync with developer needs to ensure their happiness and accelerate time to awesome. This is the first in a blog series covering our product’s latest features — features that we think will save you time and effort when building with time series and InfluxDB.

This Month in Datadog: Episode 11

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. To learn more about Datadog and start a free 14-day trial, visit Cloud Monitoring as a Service | Datadog. This month we put the Spotlight on automated root cause analysis with Watchdog RCA.

How to configure Netdata's all-new Anomaly Advisor, powered by ML, for real-time troubleshooting

Netdata's Lead Machine Learning Engineer, Andrew Maguire, walks through how to configure the all-new Anomaly Advisor. This new feature lets you troubleshoot in real-time, at scale, by identifying periods of time with raised anomaly rates across your entire infrastructure. In this guided video, Andrew will explain how to enable Netdata's ML functionality then, how to set up unsupervised anomaly detection with minimal configuration, and lastly how the Anomaly Advisor works to speed up troubleshooting when an incident occurs.

Charting a Course to Clearer Visibility | Discovering Observability: Session 1

As a monitoring professional, you’re responsible for tracking the elements in your infrastructure – regardless of where they live. You have the metrics, you’re collecting the logs, and you’re tracing the most important business processes, so you already have the critical parts required for true infrastructure visibility. But correlating the data, detecting anomalies, and being informed when things go awry is the challenge. Knowing a value is outside a threshold is great, but does that constitute a problem? Thankfully, SolarWinds® Hybrid Cloud Observability is here to give you that next level of insight.

How to Prepare for Peering Partner Business Review

Peering is more than just setting up sessions with any AS that will accept one. Peering can involve long-term relationships that require reviews and joint planning to grow synergy. A critical milestone in any peering relationship is the business review; and when it comes to business reviews, it’s all about preparation. Learn how Kentik can help you get ready to ace business reviews with peering partners. What you’ll learn.

Role-Based Access Control (RBAC) in eG Enterprise

eG Enterprise is a single pane of glass that provides monitoring and oversight of every layer and tier of your IT infrastructure. However, this does not mean every user in the organization has the same view of the IT infrastructure or needs to use the tool in the same way. In fact, controlling a user’s view of the monitored applications and infrastructure, as well as their privileges to perform tasks and access data via Role-Based Access Control, is a critical feature for our customers.

Splunk in the Financial Services Industry Today

In the late 1960s, there was a rock band called Ten Years After and I liked the name the first I heard about them. I wanted to use "Splunk and the Financial Services Industry: Ten Years After" as the title of this blog entry, but it’s been more than ten years since I wrote the first Splunk Blogs entry on Splunk and the Financial Services Industry. As you can tell, a lot has changed since then and more than a decade is an internet lifetime in technology.

Do you also live in Cloud 9? Cloud installators (agents, server)

In this workshop we will see the new unattended cloud installation model in Red Hat 8 (compatible with Rocky Linux 8), for the quick use of PandoraFMS environments. For this, it will only be necessary to have an activated instance of Red Hat 8 and an Internet connection to run the installer. Another point that we will see will be the agent installer in cloud format for Red Hat and Debian based systems.

3 key benefits that prove OpenTelemetry is the future of APM

Application performance monitoring (APM) solutions were designed to catch anomalies in an application or website's backend and provide meaningful insights to rectify issues in real time. Lately, though, APM solution providers have been left playing catch-up to be more inclusive of newly emerging technologies and the operational challenges they bring. OpenTelemetry (OTel) simplifies the issues caused by the demands of modern applications.

Logz.io's New Integration with AWS Kinesis: Send Metric Data Without a Single Line of Code

After creating your Logz.io account, the first step for onboarding is to send you log, metric, and trace data. Logz.io makes this flexible – allowing for multiple ways to get data into your Logz.io account depending on your use case and technology stack. Today, we’re excited to announce another easy and fast way to get AWS metric data into Logz.io: by setting up a CloudWatch metrics stream and a Kinesis Firehose.

BIG things are Happening at Graylog!

Did you hear the news? Graylog is on a mission to help make your IT environment and data more efficient and secure by making it super easy to uncover the answers stored in your machine data. At Graylog, coming up with solutions to problems faced by IT and Security professionals is what drives us. Our teams are always working on ways to add meaningful functionality that increases productivity so you can focus your resources on the innovation and core competencies that you’re known for.

Create Bitcoin Buy and Sell Alerts with InfluxDB

This article was originally published in The New Stack. Given how volatile Bitcoin price is, an automated alerting system can be valuable for preserving our attention and sanity. We can pay attention to Bitcoin only when the price action is interesting. Momentum, that is buying an asset that has done well in the past, has been one of the most persistently effective trading strategies — see Clif Asness: “Value and Momentum Everywhere” and Tzouvanas (2019).

How to monitor an Umbrel server running a Bitcoin node with Grafana Cloud

Most people in the world are familiar with legal tender paper money — also known as fiat currency — and how to access it online through a bank website, ATM, or mobile app. The idea of “digital money” or cryptocurrency — such as Bitcoin — remains a relatively new concept.

5 Key Requirements of Modern Enterprise Monitoring & Observability Platforms

Monitoring is an essential function of enterprise SRE teams and a critical component of business service deliverability. Its importance has only grown as enterprise environments and technologies continue to evolve at a rapid pace. Unfortunately, traditional monitoring is no longer enough.

How to Monitor Active Directory with OpenTelemetry

We’re excited to announce that we’ve recently contributed Active Directory Domain Services (abbreviated Active Directory DS) monitoring support to the OpenTelemetry collector. You can check it out here! You can utilize this receiver in conjunction with any OTel collector: including the contrib collector, the observIQ’s distribution of the collector, as well as Google’s Ops Agent, as a few examples.

Introducing Anomaly Advisor for troubleshooting at scale

Troubleshoot at scale with our all-new, lightweight Anomaly Advisor, powered by machine learning. The Anomaly Advisor finds periods of time with elevated anomaly rates across your entire infrastructure faster than ever before. This new feature works along with our ML unsupervised models on the edge, making your troubleshooting trouble-free! Even better, the Anomaly Advisor requires minimal configuration and is extremely lightweight. No need to worry about exhausting your CPU usage.

On-premise vs. On the Cloud

Since its emergence in the mid-2000s, the cloud computing market has evolved significantly. The benefits of reliability, scalability, and reduced set-up costs have created a demand to fuel an ever-growing range of “as-a-service” offerings, resulting in an option to suit most requirements. But despite the advantages, the question of cloud or on-premise remains valid.

Best practices to collect, customize and centralize Node.js logs

Node.js is an established platform for developing server-side applications in JavaScript. One of the most fundamental concerns that arise during the development of Node.js apps is how to carry out proper logging that will be safe, secure and performant. While there are several options for configuring Node.js logging, a few specific engineering best practices still apply, no matter which option you choose.

Kafka Summit 2022

We had a fantastic time at the Kafka Summit in London this year. It was so great to be meeting everyone face to face again after such a long break. It was interesting to see how people had progressed with their Kafka implementations. At the 2019 event people were just getting started trying out Kafka in small test environments but no one had enough production experience to know understand their needs for management at scale and production support.

SEO 101: Keyword Research and Strategy

While SEO (search engine optimization) includes a variety of different elements to deliver measurable results, one of the foundational pieces of a good SEO strategy is keyword research. Since keywords and phrases are the actual targets that connect potential visitors to your website, finding ways to ensure your site places as highly as possible for relevant search terms is the linchpin for all other elements of SEO strategy.

What is Full-Stack Observability and Do I Really Need It?

Monitoring and visibility are dead. If you don’t have Full Stack Observability (FSO) you may as well just pack up and go home. Your business will fail, and you will be unemployed with no hope for the future. At least, that is what vendors currently pitching FSO would have you believe. But what is full-stack observability? Observability is the current buzzword in the monitoring industry, and full-stack observability is what vendors are currently focusing on.

NGINX Logging Configuration: How to View and Analyze Access and Error Logs

NGINX is one of the most widely used reverse proxy servers, web servers, and load balancers. It has capabilities like TLS offloading, can do health checks for backends, and offers support for HTTP2, gRPC, WebSocket, and most TCP-based protocols. When running a tool like NGINX, which generally sits in front of your applications, it’s important to understand how to debug issues. And because you need to see the logs, you have to understand the different NGINX logging mechanisms.

Tracking On-Call Health

If you have an on-call rotation, you want it to be a healthy one. But this is sort of hard to measure because it has very abstract qualities to it. For example, are you feeling burnt out? Does it feel like you’re supported properly? Is there a sense of impending doom? Do you think everything is under control? Is it clashing with your own private life? Do you feel adequately equipped to deal with the challenges you may be asked to meet? Is there enough room given to recover after incidents?

4 Best Practices for Root Cause Analysis

As failures are a common part of any system’s lifecycle - what would be the Root Cause Analysis for this type of problem? If you build and deploy a system, there are high chances that you'll have to deal with a failure in the near future. However, what matters is how you handle such failures. As an organization, you need to have pre-formulated strategies to handle failures as and when they occur.

A Layman's Guide To HTTP/2

HTTP stands for Hypertext Transfer Protocol and is the backbone of the World Wide Web. HTTP/2 is the second major version of the HTTP protocol, which offers a performance improvement over its prototype. The new protocol has been in development for a long time, with the first draft published in 2012 and it was finalized in 2015. In recent times, HTTP is the obligation that boards almost all of the networks.

Round-Trip Time (RTT) - An Overview

A notable tool that renders the fortune of a web is Round Trip Time, which is also known as Round Delay Time. The time taken for a network request for a data packet and the flourishing of that data is the round trip time. The duration of RTT is calculated in milliseconds. RTT can be diagnosed and prompted by pinging a specific address. The time takes for an internet request to hit a destination and to lapse back to the primary resource.

From Baud to Awed: The History of the Modem

From 300 baud to multiple gigabits per second, it’s time to celebrate the history of the modem. It occurs to me that we will soon be entering a period where no one will remember the ear-shredding screech of a dial-up modem connecting their computers to the internet—all the while hoping no one picks up the phone and wrecks it. The humble modem is, at least as a device sitting on your desk alongside your computer, largely consigned to history—and more than a few recycling centers.

How to install a Site24x7 APM Insight Java agent in Tomcat Server 6.x and above on Linux?

This video walks you through the steps for installing the Site24x7 APM Insight Java agent on an Apache Tomcat server. With the #Site24x7 #APM Insight Java agent installed, you can monitor your entire application, track every transaction that occurs, discover transaction errors, and optimize transactions before your end users are impacted.

5 Executive Blindspots Around Hybrid IT Observability

For tech leaders, staying on top of hybrid and multi-cloud complexity with traditional monitoring tools is not easy task -- and can create distinct visibility gaps across your environments. SolarWinds Hybrid Cloud Observability can help put you on the path to better business outcomes.

The Power of Elastic

As the leading platform for search-powered solutions, we help everyone — organizations, their employees, and their customers — find what they need faster, while keeping mission-critical applications running smoothly, and protecting against cyber threats. When you tap into the power of Elastic Enterprise Search, Observability, and Security solutions, you’re in good company with organizations like Netflix, Uber, Slack, Microsoft, and thousands of others who rely on us to advance their business. We’d love to help you, too. Together, let’s accelerate the results that matter.

Scaling Grafana Mimir to 500 million active series on customer infrastructure with Grafana Enterprise Metrics

At Grafana Labs, we’ve seen an increasing number of customers who are scraping hundreds of millions of active time series but need a solution to reliably store and query such a huge amount of data. So in March, we announced our new open source TSDB, Grafana Mimir, the most scalable, most performant open source time series database in the world.

How Downtime Can Affect Morale, And What You Can Do About It

Does the worst case scenario for your company include alert fatigue from false alarms? Maybe it should. No one likes a false positive when it comes to infrastructure monitoring, and false flags are especially irritating because you have to respond to a problem that doesn’t actually exist. Just how bad are false positives? Let’s break down what these annoying little mistakes add up to for your team. You might be surprised to learn just how much they are hurting your DevOps pipeline.

Cribl Search: Redefining Search Around Today's Reality

As CEO of Cribl, one of my greatest privileges is to spend time on the road and on calls with our customers hearing about their needs and challenges. Cribl is a focused company. We build software for observability and security. With this lens, it becomes clear the industry is neglecting to address the unique needs of our users. There are many reasons, most of which are simply that vendors tend to come at a user’s problem through the lens of their existing technology.

Cribl Raises $150M in Series D Funding, Announces New Search Product, & Innovation Lab

We are pleased to announce we have closed our Series D round of funding, led by Tiger Global with participation from existing investors IVP, CRV, Redpoint Ventures, Sequoia, and Greylock Partners. In this round, we raised $150M, bringing our total raised to over $400M. This new fundraising round further validates the value we provide to customers.

Understanding the Priorities of Data Behind Tomorrow's Business Opportunities

Many CXOs believe that Web3 will power the next paradigm shift and transform the world. As a result, they are accelerating their learning curve to spot opportunities and leapfrog to next-gen business models that will catapult their organizations to new heights. But is there really an urgency to explore what Web3 can offer?

Instrumenting your webpack-bundled JS code

OpenTelemetry (OTel) is an emerging industry standard that dev teams use to instrument, generate, collect, and export telemetry to better understand software performance and behavior. At Helios, we leverage OTel to provide developers with actionable insights into their code within distributed systems. We give them visibility into how data flows through their applications, enabling them to quickly identify, reproduce and debug issues in their flows.

Featured Post

Network topology software for your enterprise networks

Every network has a topology. It is the task of the network admin to discover and build upon it. So it's vitally important you have an extremely detailed understanding of your network topology. A network topology diagram graphically depicts the devices, connections, and paths of a network so you can see how the different components interact and communicate with one another. A network topology diagram is essential for creating and managing a network. Without it, even basic troubleshooting can become unnecessarily difficult.

Interview With CTO, Leonard Trigg

For the next interview in our series speaking to technology specialists from around the world, we’ve welcomed experienced CTO, Leonard Trigg. Leonard is a member of the Harbourfront executive management team and serves as the firm's Chief Technology Officer. Joining the industry in 1995 Leonard has a background in enterprise technology, finance and operations. Leonard previously served as the Chief Operating Officer and Director at Vertex One Asset Management Inc.

Particle's Fleet Health Feature, Powered by InfluxDB, Delivers Device-Specific Data for IoT Deployments

The Particle platform enables companies to manage and program their IoT devices, bringing them to market quicker. To accomplish this, Particle developers needed to be able to collect telemetry data from a large number of edge IoT devices to measure performance. A key benefit of the Particle platform is its ability to scale from prototype to enterprise. Developers often prototype IoT devices using microcontrollers like Arduino or Raspberry Pi.

Ramp Up | Ep 05 | What is David Beck doing at a patching shop in Bangkok? | Patch Automation

On Episode 05, David Beck finds interesting parallels between a patching shop in Bangkok and Patch Automation Also, follow us on social media channels to learn about product highlights, news, announcements, events, conferences and more -

Enhance Microsoft 365 and Microsoft Teams Performance

We have all seen multiple customers deploying Teams without adequate network capabilities and without any proper way to measure, troubleshoot and improve the user experience. That is why solutions like Martello are highly recommended to our Enterprise size customers.

How to run Checkly in your infrastructure - our new private locations

The monitoring and testing of public applications and APIs is challenging by itself. What should you test? How often should you run your tests? And who should be alerted? A scalable monitoring setup includes many hidden details, but technically it’s straightforward. Call public APIs and see if they do what they’re supposed to.

Collect and visualize MySQL server logs with the updated MySQL integration for Grafana Cloud

Today, we are excited to announce that the MySQL integration has received an important update, which includes a new pre-built MySQL logs dashboard and the Grafana Agent configuration to view and collect MySQL server logs. The integration is already available in Grafana Cloud, our platform that brings together all your metrics, logs, and traces with Grafana for full-stack observability.

Enlightning: What Is Observability?

Is Observability really just logging, metrics, and distributed tracing? Are we done? Mission accomplished? Can we go home for the week even if it is just Tuesday? You can often hear about the "Three Pillars of Observability" but having access to logs, metrics, and traces does not necessarily mean more observable systems. In this session, you'll learn what Observability is, what problems the three pillars solve, what problems they generate, and how deep the rabbit hole goes behind them. We will explore the basics of the three pillars and what Spring has to offer to implement them.

The Present and Future of Arm and AWS Graviton at Honeycomb

As many of you may have read, Amazon has released C7g instances powered by the highly anticipated AWS Graviton3 Processors. As we shared at re:Invent 2021, we had the chance to take a little sneak peek under the Graviton3 hood to find out what even more performance will mean for Honeycomb and our customers.

Testing Metrics using MetricFire

We offer a free trial period to ensure that our customers use MetricFire successfully. But often when someone signs up for a trial they may not know where to start. This is a guide on how to use your trial to test your data and see if MetricFire is right for your use case. Each use case is different so it is important to book a call with our technical support team so we can help you utilize your free trial optimally for you. Create a MetricFire account for free and see how our system works.

How to use VSCode to debug a Node.js application

Debugging is an essential step in software development, as it allows developers to fix errors before releasing the software to the public. Debugging tools can be integrated into code editors, making the debugging process more efficient. This tutorial will show you how to debug node.js in Visual Studio Code.

Building automations with the step library

With the release of Avantra 21.11.4, we’ve introduced a new step library feature to allow our Avantra Professional Edition customers to build complex and reusable automations faster. I think the best way to describe the benefits of a step is to create a simple implementation to guide you through the process. The step library allows users to create individual steps outside of a workflow that behave just as the standard, pre delivered steps do when creating a workflow.

Prescient Devices Makes Managing IoT Edge Devices Easy with InfluxDB

Prescient Devices focuses on changing the way businesses think about and use edge data and industrial IoT. The company built Prescient Designer, an edge data solutions platform that gives organizations the boost they need to transform their IT/OT processes. Prescient’s products empower data engineers, system integrators, and innovators to easily design and orchestrate edge-to-cloud data solutions.

Improve Performance in your iOS Applications - Part 3

Although modern iOS devices are capable of handling a wide range of intensive and complex tasks, the device may seem slow if you are not paying close attention to how your application operates. Performance improvements mentioned in this article are intended to make your code more readable and performant; however, select cautiously as per your needs. Oftentimes, altering or improving architecture and code refactoring takes more time and effort.

The 6 Fundamental Steps in a Network Monitoring Process

Network monitoring is vital for operating an IT environment at optimal performance. As a result, organizations can improve operational efficiencies with a well-managed network while proactively maintaining a secure network. While remote work has made the network monitoring process more challenging, new cloud-based tools have extended IT teams' reach into home and remote offices to ensure employees are secure and productive.

Introducing the macOS integration for Grafana Cloud

Today, we are thrilled to share that the macOS integration has finally arrived for Grafana Cloud! Thanks to the joint efforts of Grafana Labs’ multiple teams, you can monitor your Mac and gather and visualize metrics and logs with ease. The integration is available in Grafana Cloud, our platform that brings together all your metrics, logs, and traces with Grafana for full-stack observability.

Managing Microsoft Teams Services: Redesigning IT Silos for Cloud-Based Services

IT silos, which originated from operational processes that made sense in an on-premise world are still a fact today. It’s easy to understand how these silos, which segregate and isolate IT data and operations from other points in the infrastructure came about.

Top 5 Reasons for "Why AppNeta?"

AppNeta by Broadcom Software is a SaaS platform that enables large enterprises to gain visibility into their business-critical applications as experienced from remote offices and to understand how the networks that drive them operate. Because of our network focus we often get compared to traditional monitoring solutions, but with a quick overview it should be easy to explain to others in your organization how we differentiate.

How AppNeta Drives Business Value

While network and application performance grow increasingly business critical, IT’s ability to track and control service levels continues to be diminished. The shift to hybrid and remote work means users are now highly reliant upon public internet connections, which require additional security at the network edge. Plus, the majority of apps, internal or external, are now cloud hosted.

Get more insights with the new version of the Node.js library

We’re thrilled to announce the release of a new update to the Cloud Logging Library for Node.js with the key new features of improved error handling and writing structured logging to standard output which becomes handy if you run applications in serverless environments like Google Functions!

Kubernetes vs Docker vs Docker Swarm Differences | Pros & Cons Explained - Sematext

“Kubernetes vs. Docker” is one of the most commonly asked questions by new developers. But what is docker? And what is Kubernetes? If you want to become a full-stack developer, you will need to understand both of these technologies. In this video, we will take a look at what Docker is and how it fits into a developer's workflow. We will then look at what is Kubernetes and how it relates to Docker. We will then see what the difference is between docker and docker swarm.

The Success of Hybrid Work Hinges on Technology

We’ve been talking about it for more than a year now, but the last few months have only affirmed the fact: hybrid work is here to stay. The myriad research that’s been conducted so far proves that not only do employees want the flexibility of hybrid work, but also that hybrid work has a huge potential to drive better business results.

VLAN monitoring in OpManager

A local area network, what we know more commonly as a LAN, is a network that comprises devices based out of the same geographic location, enabling communication between them. The virtual counterpart of a LAN is a virtual LAN, or VLAN. A VLAN augments a LAN, offering flexibility in making changes, higher scalability, and better security.

Website Testing: 6 Tips to Help You Get it Right

A website's optimization for search engines and SERP ranking is one thing, but its quality assurance is another thing entirely. What's the use of ranking first on search engines but having a website that doesn't function? Website testing monitors if the website performs its functions as intended, assessing if the final output is good enough to present to end-users.

Do More with Less! Use our New GraphQL Query Bot for Cato Networks, Monday.Com, and More

One of our partners in the LATAM region is working with an end customer, to implement a custom predictive maintenance dashboard, by pulling in and correlating data from multiple sources (like Zabbix, Jira Cloud, Cato Networks, Extreme Networks CloudIQ, etc.) and sending out the data to update a Grafana dashboard, which can read data from OpenSearch. Partner is using our data bots and low-code/no-code pipelines to implement this project.

What's new in Avantra 21.11.5

As Product Manager for Avantra it gives me great pleasure to announce that we’ve just released our newest version of Avantra, 21.11.5, packed full of features, fixes and security updates. For Avantra 21.11.5, we’ve especially been focusing on our automation engine and working with a number of our customers to incorporate their feedback and requirements into how Avantra works. Let’s dive into a few of the newest features that are the most exciting.

Complete guide to GraphQL in Angular [with example]

GraphQL is a query language and server-side runtime for APIs developed by Facebook in 2012. In this guide, we will implement an Angular Apollo GraphQL client with the help of a sample To-Do app. Before we demonstrate how to implement the Angular GraphQL client, let’s have a brief overview of the Angular framework and GraphQL.

2022 Unified NetOps Explainer Video

We continue to unify the DX NetOps platform built-on decades of expertise in the network monitoring space along with best-of-breed components you have used to manage the performance of your networks for years. As we welcomed AppNeta into the Broadcom Software NetOps family in 2022, we expanded our monitoring coverage into unmanaged networks like ISP, Cloud and SaaS environments.

Monitoring Performance and Errors in a Django Application with Sentry

Sentry is a monitoring platform that allows developers to track errors and performance data. In this tutorial, we’ll show you how to add Sentry to a Django application so that you can track and resolve any errors or performance issues that occur while your application is in production.

Monitoring Azure and Your Entire Hybrid Infrastructure with DX UIM

While you often read about the move to “the cloud,” the reality is that most organizations aren’t moving to a single cloud, but multiple cloud environments from multiple providers. There can be a range of reasons for companies to use cloud services from more than one provider today.

How to monitor MongoDB with OpenTelemetry

MongoDB is a document-oriented and cross-platform database that maintains its documents in the binary encoded JSON format. Mongo’s replication capabilities and horizontal capability using sharding make MongoDB highly available. An effective monitoring solution can make it easier for you to identify issues with MongoDB such as resource availability, execution slowdowns, and scalability. observIQ recently built and contributed a MongoDB metric receiver to the OpenTelemetry contrib repo.

Windows 11 Preparation: Is Your Organization Ready?

When Microsoft rolled out Windows 11 last fall, the announcement included what seemed like a relatively distant deadline: October 14, 2025, the date when Windows 10 support will end. Considering how many unexpected changes we’ve all experienced over the past few years, we understand if you’re hesitant to prepare for anything that’s not scheduled to take place until the year 2025.

How Elastic powers speed, security, and connectivity in capital markets

Speed is everything in capital markets. Success in the front and back office is dependent on the ability to provide accurate, fast responses to challenging questions. Over the past several decades, there has been a tremendous increase in the amount of information available to market participants, and trade transactions are now being carried out at a very rapid pace. In parallel, the technology which capital markets firms are developing is becoming increasingly complex.

AWS Service Observability using OpenTelemetry

Efficient use of observability statistics is essential to any microservice architecture. OpenTelemetry is a project supported by the Cloud Native Computing Foundation (CNCF) to enhance the observability of microservice projects. AWS Distro for OpenTelemetry (ADOT) is an AWS-supported distribution of the OpenTelemetry project specifically designed to improve the observability of AWS projects.

Observing Chaos: Is It Possible?

Most Series A and B companies are born in the cloud. Instead of the traditional mainframe architecture, they use AWS, Kubernetes and the likes to run their production environments. While striving to do things faster and better, we must address the other side of the coin: How do you support the constant shifts inherent in these environments? Chaos engineering allows you to observe your environment continuously and reliably.

NEW Citrix ADC Management Pack for SCOM available

Citrix ADC plays a central role in Citrix environments and its performance impacts the user experience. Therefore, the availability and performance of the Citrix ADC appliance are crucial. The new Citrix ADC Management Pack is a complete monitoring solution providing deep visibility into ADC performance. It extends the end-to-end service monitoring capabilities of Microsoft SCOM to include the Citrix ADC infrastructure.

Karapace Schema Registry add-on for Apache Kafka is now in public preview

We noticed this new announcement for Kafka users which may be worth looking into: “Instaclustr is pleased to announce the immediate availability of a new Kafka add-on, Karapace Schema Registry, in public preview, on our managed platform. This follows our earlier announcement of Karapace as part of our comprehensive Apache Kafka support solutions in mid March.

Uptrends launches new solution for SMS-based 2FA transaction monitoring

As web applications and other digital solutions become more prevalent in everyday life, securing access to these apps against cyber threats becomes more an integral part of their design rather than a separate line of thought. Global cybercrime costs are expected to grow by 15% per year over the next five years, reaching $10.5 trillion annually by 2025.

How can observability help telecom providers accelerate 5G monetization

The telecom industry is at an inflection point today, where the endless possibilities of 5G meet the growing challenges of accelerating 5G monetization. This is particularly true for telecom providers who have pumped billions of dollars in building 5G networks. The telecom cloud market is expected to cross USD 74 billion by 2026.

ScienceLogic Earns TrustRadius Top-Rated Status for Third Consecutive Year

There is no more valuable confirmation that your business is doing the right thing than positive feedback directly from your customers. That’s why we are honored and grateful to have achieved TrustRadius “Top-Rated” status across an astounding eight categories.

Introducing Anomaly Advisor - Unsupervised Anomaly Detection in Netdata

Today we are excited to launch one of our flagship ML assisted troubleshooting features in Netdata – the Anomaly Advisor. The Anomaly Advisor builds on earlier work to introduce unsupervised anomaly detection capabilities into the Netdata Agent from v1.32.0 onwards.

Why You Need Full-Fidelity Flow Data For Faster Threat Response

Every second counts when it comes to data theft. So when a breach or an attack occurs, network security teams need to determine what’s happened as quickly as possible. The more time spent in root cause analysis, the more time an attacker has to burrow deeper into or across the network. The greater the overall loss from the attack is likely to be too. That may be financial, customer impact, or brand reputation.

Top 7 Real User Monitoring (RUM) Tools and Software for Better User Experience

As a software-based company, the most critical thing you can do is maintain control over your users' digital experiences and satisfaction levels. However, without a monitoring plan and technologies that allow you to see how customers interact with your application or website from their perspective is impossible. They provide you with the information you need to determine how well your webapp or website is operating and to avoid slow pages or screens that drive customers to your competitors.

Exception Handling in Java (with Real Examples)

Java has been one of the most widely used programming languages among developers worldwide for years. So naturally, it is a popular choice for those beginning their careers in development. Learning Java requires more than just knowing the proper syntax and effective code hygiene. Any developer who hopes to use Java for commercial development must be able to quickly and competently identify and recognize errors in their code.

A Developer's Take On Solving Hard-to-Replicate Performance Problems In-Production

Causal is the business and financial planning platform that allows users to build models effortlessly, connect them directly to their data, and share them with interactive dashboards and visuals. Teams rely on them to build complex models with real business impact, so the UX needs to be fast and reliable, and for the team to guarantee that, they need detailed visibility into the performance of their application.

Ensuring Microsoft Teams Service Quality for the Hybrid Workforce

Managing Microsoft Teams and Microsoft Office 365 application performance requires deep insight into the real user experience as well as coordination between IT departments and service providers that each work with different tools with different objectives. IT teams often struggle to qualify and solve application performance or voice quality issues because available data from user feedback and traditional monitoring tools don’t provide substantial insight into the user experience.

Quickly Turn ALB/ELB Status Codes into an Issue-Seeking Heatmap

More often than not, as developers, when we get a report that a large customer is hitting 502 errors, there's a flurry of activity. What's wrong? Is something deeply broken? So you start digging through AWS logs to see what you can find, but it's hard to reproduce. Sometimes, there's no clear answer, and you move on without any resolution. What if I told you it doesn't have to be this way?

Monitoring COVID-19 virus levels in wastewater using Grafana, Databricks, and the Sqlyze plugin

The new Sqlyze data source plugin (in beta) allows you to connect your Grafana instance to all your favorite SQL databases, many NoSQL databases, and many other non-SQL data sources — from document databases, to ERP systems, to even Slack. You don’t have to know the native query syntax for these data sources; you can just use SQL. The Sqlyze plugin uses ODBC at its core. Hundreds of ODBC drivers are available for various databases/data sources.

Retrace Power User Tips and Tricks - Error and Log Management

The explosive growth of ecommerce has slowed in the last year. But the need for businesses to deliver a great digital user experience continues to grow. Companies that don’t rely on online customer purchases can still suffer blows to revenues due to a poor online experience. Market conditions are raising the importance of Application Performance Monitoring (APM) tools to ensure every digital interaction with your company is positive. APM tools vary by design, features and functionality.

Announcing OpenTelemetry Metrics are Now Available as Release Candidates

Splunk is all-in on OpenTelemetry, as exemplified by our native support for it in Observability Cloud, Splunk Enterprise and Enterprise Cloud’s usage of the OpenTelemetry Collector with Splunk Connect for OpenTelemetry Kubernetes, our long-term ambition to use OpenTelemetry as the main way that all Splunk Products capture data from customers’ infrastructure and applications for analysis, and our massive level of contribution to the project.

What is Cloud Financial Management?

There are few organizations left today without some of their business operating in the cloud. A recent IDG Cloud Computing Study found that 92% of businesses globally moved to the cloud. According to Gartner, cloud adoption spending will surge to about $482 billion by the end of 2022. Most companies make the move to take advantage of the speed, innovation, and flexibility offered by cloud computing solutions. Operating in the cloud can also provide cost savings and improved productivity.

Log Observer Connect: Leverage the power of Splunk Enterprise data in Splunk Observability Cloud

With Splunk Log Observer Connect it’s easier than ever to correlate all of your metric, trace and log data to deliver better customer experiences! Available now for existing Splunk Enterprise and Splunk Observability Customers. Log Observer Connect lets observability users explore the data they’re already sending to their existing Splunk instances with Splunk Log Observer’s intuitive no-code interface integrated in Splunk Observability, for faster troubleshooting, root-cause analysis and better cross-team collaboration.

Top 5 Highlights From Cloud Networking Summit

It’s a fact of life that most multi-cloud newcomers will experience sticker shock at some point. The Infrastructure and Operations leaders we work with at Teneo often tell us that that their multi-cloud environments came with hidden costs and complexities way beyond what they’d originally estimated. But it’s the day to day management that they and their teams say is most complicated and daunting.

Introducing our Zapier Integration

Now you can get your alerts on incidents anywhere you want! It's very probable that you already know Zapier, but if you don't know it, here's a brief intro. Zapier is one of everyone's favorite tools on the internet. It connects everything! You can easily connect systems that don't speak directly with each other or automate repetitive tasks with no code. That's one of the reasons we decided to create an integration. It was the easiest way to connect IsDown with the world 😁.

New and improved Python error grouping

Raygun has a long history of continually improving the quality and capability of our Crash reporting grouping logic. Across thousands of customers, it’s essential to help teams quickly discern where to allocate their resources to fix bugs, resolve performance issues, and create better experiences for customers. Today, we’re pleased to launch a new and improved grouping system for Python.

AVD - Azure Virtual Desktop - Usage Trends and Statistics

Technology advancements have now made it possible for organizations to adopt virtual desktop technologies in a hassle-free manner, from the cloud. Microsoft’s multi-session Windows desktop technologies supported on Azure cloud allow admins to provision virtual desktops on-demand. Integration with Azure Active Directory and FSLogix ensures secure access with efficient profile management.

5 Things That Should Be Part of Your Next Website Redesign Process

Change is inevitable in this world, and that includes your website. The best way to stay current with the latest trends and technology and meet all of your user experience, speed, performance, conversion, and optimization needs is through a website redesign. A well-designed website will be more user-friendly and conversion-optimized. On the other hand, a website that is not updated regularly can quickly become dated and lose its impact. A poorly designed website will do more harm than good.

Serverless: The Future of the Internet

Serverless is a technique for executing operations and running cloud compute services on an as-needed basis. Serverless computing is the latest trend in the cloud computing world. It has made it much easier to develop, deploy, and scale applications. Serverless computing means that developers don't need to worry about anything other than their code. They don't need to provision a server or install software to run their code.

Top 7 Key Features Your Network Monitoring Software Should Have

With the covid-19 pandemic, the demand for broadband communication services has soared and experienced a 60% increase in internet traffic compared to before the crisis However, with prolonged times spent at our homes during the lockdown, continuous network connectivity has become the need of the hour for the entire world.

AlertOps Partners With Cisco AppDynamics to Enhance Major Incident Resolution

Chicago, IL – May 17, 2022 AlertOps, a major incident response management platform, announced today a new technology integration partnership with Cisco AppDynamics, the leading Application Performance Monitoring (APM) and full-stack, business-centric observability solution. This new relationship empowers AlertOps and AppDynamics, joint users, with intelligent alerting, escalation policies, workflows, and scheduling to rapidly remediate major incidents.

How New Relic uses Kentik for network observability

New Relic is known for empowering the world’s leading engineering teams to deliver great software performance and reliability. And the network that delivers that service to New Relic’s users plays a critical role. Hiccups in the performance of the network between New Relic’s mission-critical service and their users can create a cascade of problems.

SAP security audit: How to ensure your SAP system is secure

With huge variations of customized applications and modules depending on the individual organization though, highly sensitive data - if not secured properly - can be extremely vulnerable. In this article, we’ll discuss SAP security, the process of conducting an SAP security audit and steps to take to improve on your SAP security processes. Take an in depth look at how your SAP Basis teams can manage SAP system security and compliance with comprehensive SAP security audits.

How to correlate logs and metrics with the Linux Node integration for Grafana Cloud

We are pleased to announce that an upgraded version of the Linux Node integration is available in Grafana Cloud, including the capability to visualize logs that are correlated with previously existing metrics. It also includes a new pre-configured dashboard based on the USE method, which focuses on showing resources utilization, saturation, and errors.

The Hidden Magic of Extensions

AWS Lambda execution lifecycle has main phases: initialization, invocation, and shutdown. In the initialization phase, Lambda creates the runtime environment, downloads the code, imports everything needed, and runs the functions initialization code. In the invocation phase, the Lambda will get an input, process it, and produce an output. After the invocation phase, Lambda will go to an ideal state and wait for the next input.

How to monitor your Microsoft Azure data via MetricFire

In this article, we will explain what Microsoft Azure is and how you can monitor its data. We will also learn how to integrate Microsoft Azure with MetricFire. Last, we will look at the benefits of using MetricFire’s monitoring solution with Azure metrics. Try the free trial version of MetricFire and enjoy all the benefits of using our system. Book a demo with our team and discuss in detail the process of integrating Microsoft Azure and MetricFire.

Deep Learning Toolkit 3.7 and 3.8 - What's New?

We are excited to share the latest advances around the Deep Learning Toolkit App for Splunk (DLTK). Earlier this year, Splunk’s Machine Learning Toolkit (MLTK) was updated with some important changes. Please refer to the blog post Driving Data Innovation with MLTK v5.3 and the official documentation to learn more about what changes were made and most importantly how they may affect you, especially if you run MLTK models in production.

Benefits of Logging Agents

You probably have heard of logging agents, such as Logstash or Fluent Bit, if you’ve been investigating log analysis, monitoring, and observability. If so, and you’re wondering what logging agents are and why you might need them, you’ve come to the right place. This article will look at what logging agents are for, their advantages, and what you can use instead of a logging agent.

SEO 101: How to Run a Site Audit

SEO, or search engine optimization, has fundamentally transformed the way that companies manage and maintain their digital presence in the modern globalized economy. With the power of Google to direct potential customers to your business entirely based on your relevance to key search terms around your operations, knowing your SEO strengths and weaknesses is a must to keep up with your competition.

Better Together: Combine Real User Monitoring with Synthetics

Synthetic Transaction Monitoring (STM) and Real-User Monitoring (RUM) are two different approaches for monitoring end-user experiences with SaaS, Desktop, and networked applications. But in today's digital world, response time, availability, and work-from-anywhere initiatives are becoming closely aligned. Employees need the flexibility to collaborate from any location without disruption. Organizations must look towards a holistic monitoring strategy to make this feasible every day.

Ingest OpenTelemetry traces and metrics with the Datadog Agent

OpenTelemetry is a Cloud Native Computing Foundation (CNCF) initiative that provides open, vendor-neutral standards and tools for instrumenting services and applications. Many organizations use OpenTelemetry’s collection of APIs, SDKs, and tools to collect and export observability data from their environment to their preferred backend. As part of our ongoing commitment to OpenTelemetry, we are proud to have contributed our distributed tracing libraries to the CNCF community.

Visualize relationships between your Kubernetes resources with Datadog Live Containers

A Kubernetes environment includes a wide range of resources—such as clusters, nodes, and pods—that work together to run dynamic applications at scale. In order to monitor a Kubernetes application effectively, you need a multi-dimensional view into your clusters’ health that encompasses the complex dependency relationships among these resources.

SCOMathon 2022 | Recap

SCOMathon 2022, hosted by SquaredUp and Cookdown on May 10, delivered massive SCOM content packed into 21 sessions, 3 keynotes, and 3 panel discussions. The Microsoft SCOM team, MVPs, experts, and customers discussed all things SCOM, including the latest updates, best practice tips, specific use case scenarios, and lessons learned during their SCOM migrations. 245 attendees joined the sessions to learn and chat with their peers. As every year, a donation is made based on participants per session.

MQ application compatibility across a quarter century

I was working on something recently where I had to upgrade various components in the tooling. And I was getting more and more annoyed that the upgrades broke my existing programs and scripts. None of that was MQ’s fault and I’ll write more about the project once it’s available alongside the newly-announced MQ 9.3. But it got me thinking about the efforts we’ve made to keep MQ application compatibility across its lifetime.

Accelerate Kubernetes troubleshooting with Sysdig Advisor

Troubleshoot Kubernetes problems up to 10X faster. Want to know how? Let’s dig in! Kubernetes is really difficult to operate at scale. When organizations have a problem in Kubernetes they must use command line tools, dashboards, and logs to figure out what went wrong. It can take hours to inspect the situation and identify the root cause before you can take action. Problems like crashloopbackoff and pods pending errors can be especially frustrating since there are so many things that can cause those conditions.

Troubleshoot Kubernetes 10x Faster With Sysdig Advisor

Troubleshoot Kubernetes problems up to 10X faster. Want to know how? Let’s dig in! Kubernetes is really difficult to operate at scale. When organizations have a problem in Kubernetes they must use command line tools, dashboards, and logs to figure out what went wrong. It can take hours to inspect the situation and identify the root cause before you can take action. Problems like crashloopbackoff and pods pending errors can be especially frustrating since there are so many things that can cause those conditions.

New release: SquaredUp 5.5

We’ve just released SquaredUp 5.5! As always, you’ll find some great new additional features as well as enhancements in the SquaredUp Community, Azure, and SCOM Editions. Plus, the new 5.5 release works with SCOM 2022. Here’s a quick overview of what’s new: Check out the release webinar here or keep reading for all the juicy details.

Sysdig Advisor: Making Kubernetes troubleshooting effortless

The cloud, Kubernetes, CI/CD, DevOps, GitOps… the last five years have seen a huge transformation in how organizations are architecting and shipping applications. It’s hard to keep up with the pace and learn all of this new tech! Nearly 55% of respondents to Canonical’s 2021 Kubernetes and cloud native operations report highlighted how the lack of sufficient in-house skills and people power is the biggest challenge that Kubernetes brings to businesses.

New release of Sysdig Open Source leverages Falco plugins

Sysdig maintainers are thrilled to announce the latest release of our beloved OSS tool for analyzing and/or recording the activity of processes and containers on a Linux system. You can find the full CHANGELOG in the GitHub repository, but here are some highlighted features: Note: The 0.29.1 has been released with a bug fix shortly after we started to write this post.

The future of automation - from slow to hyperflow

Automation has been the harbinger of change since the start of industrialisation. From Roman aqueducts to conveyor belts, and from water wheels to modern computing, automation has represented the pinnacle of human innovation of its time. Making it an essential catalyst for change and a critical step to the next industrial revolution. But with mission critical software application being at the heart of modern industries, automation is not only a pinnacle but a base layer for the fourth industrial revolution.

Tracing a Ruby application with OpenTelemetry for performance monitoring

Ruby on Rails is a popular MVC framework for creating web applications. It is necessary to monitor your Ruby applications for performance issues. In today’s cloud-native and microservices-based architecture, it is difficult for engineering teams to troubleshoot performance issues. Tracing your application can give the much needed context required to troubleshoot performance issues.

Alerting on error log messages in Cloud SQL for SQL Server

With Cloud SQL for SQL Server, you can bring your existing SQL Server on-premises workloads to Google Cloud. Cloud SQL takes care of infrastructure, maintenance, and patching so you can focus on your application and users. A great way to take better care of your application is by monitoring the SQL Server error log for issues that may be affecting your users such as deadlocks, job failures, and changes in database health.

Introducing a high-usage tier for Managed Service for Prometheus

Prometheus is considered the de facto standard for Kubernetes application metrics, but running it yourself can strain engineering time and infrastructure resources when your usage grows. In March, we announced the general availability of Google Cloud Managed Service for Prometheus to help you offload that burden, and today, we’re excited to announce a new low-cost, high-usage pricing tier designed for customers who are moving large volumes of Kubernetes metrics over to the service.

All things logs: best practices for logging and Grafana Loki

What’s the saddest log line in the multiverse? A log line without context. That’s according to Grafana Labs software engineer and Grafana Loki tech lead Ed Welch, who joined Grafana Labs VP of Culture Matt Toback and Engineering Director Mat Ryer for the latest episode of “Grafana’s Big Tent," our new podcast about people, community, tech, and tools around observability.

Event Reduction in Four Easy Ways with Cribl Stream

One of Cribl Stream’s selling points is the reduction of ingested log volume, which helps our customers control costs and improve system performance. This can be accomplished in two ways – either by eliminating duplicate or unnecessary fields and null values within the events, or controlling the number of specific events that actually get sent to the destinations through strategic filtering.

Cloud SQL: Concepts of Networking

Cloud SQL provides a managed service for MySQL, PostgreSQL, and SQL Server databases as well as backups, high availability, maintenance, and so much more! In this episode of Networking End to End, Lorin Price discusses networking concepts from implementation and security to connectivity on Cloud SQL. Watch along to learn about the options for deploying Cloud SQL and tips on how to determine who and what can access your Cloud SQL instance.

Observability - for your test runs too

“Cloud native” – working in distributed systems using microservices and DevOps – has promised a lot, and indeed delivered a lot. Among the biggest benefits, in a cloud-native distributed architecture it’s easier and more cost-effective to scale parts of an application. When one part fails, it is less likely to impact other services and the services can still communicate with each other.

Uncovering the Manufacturing Security Risks in Today's Hybrid Working and Industry 4.0

Managing cyber risk is now a business imperative for manufacturing organizations to enjoy the benefits of Industry 4.0 and/or the move to long term hybrid working without falling victim to debilitating, expensive and public cyberattacks. As sophisticated threats including ransomware gangs and state-sponsored actors identify manufacturing as a preferred target, manufacturers struggle to respond to this threat with their current security tools and practices.

How to Monitor Microsoft IIS with OpenTelemetry

The OpenTelemetry members at observIQ are excited to add Microsoft IIS metric monitoring support to OpenTelemetry! You can now easily monitor your IIS web servers with the oIQ OpenTelemetry Collector. You can add the IIS metric receiver to any OpenTelemetry collector. This post demonstrates just one configuration for shipping metrics with OpenTelemetry components. This configuration and many other observIQ OpenTelemetry configurations are available in the oIQ Opentelemetry Collector.

Monitor FoundationDB with Datadog

FoundationDB is a distributed NoSQL database designed to support fully ACID transactions. FoundationDB uses an unbundled architecture that consists of an in-memory transaction management system, a distributed storage system, and a built-in distributed configuration system. This enables developers using FoundationDB to manage and configure each part of their database layer separately to ensure desired scalability, high-availability, and fault tolerance.

Now Monitor with Even More Granularity

StatusGator is a status data platform: We ingest data from almost 2,000 services by extracting and normalizing their official, public status page information. To monitor a service, you simply search through our list of thousands and its added to your dashboard. From there, you can filter to specific components of a service, such as products or regions. Now, you can subscribe to the same service more than once on a single StatusGator dashboard. What does this mean in practice?

Riverbed Webinar Demo - Dependency Mapping

The worst thing about moving applications and services to the cloud is the hidden costs, as a direct result of degraded application performance and lost productivity. By mapping dependencies prior to a cloud migration, you’ll know which users and services will be affected. That means you can: This technical demo shows you how easy dependency mapping should be before you move applications and services to the cloud.

Riverbed Webinar Demo - Infrastructure Monitoring

Without knowing the status of your infrastructure, how can you troubleshoot P1 incidents quickly and reduce the cost of downtime? A slow response could disrupt your ability to monitor operations remotely and respond instantly to field conditions, meaning you can’t for example: In this technical demo, find out how we make infrastructure monitoring and troubleshooting easy with Riverbed SteelCentral NetIM.

Riverbed Webinar Demo - Security Workflows

Managing aging infrastructure that’s geographically dispersed brings unique challenges to the Oil & Gas industry because obsolete technologies and automation make it particularly vulnerable to cyberattacks. To combat these attacks, leveraging integrated and full-fidelity data can help you to identify the unknown, avoid business disruptions, and help you save time by: This technical overview of Riverbed’s NetProfiler Flow Monitoring solution demonstrates those security workflows, and introduces a helpful thread feed that includes SUNBURST Backdoor.

Securing SD-WAN in a Cloudy World

Today’s cloud-first enterprise must securely connect their workers to their applications, no matter where their applications may live. Only by transforming both the WAN edge and security architectures can the full promise of the cloud be fully realized. SD-WAN is your opportunity to re-architect, design and build a network that’s secure and fit for the future. Teneo Inc. and Silver Peak can help to make this a clear reality in our cloudy world.

Node.js Performance Monitoring

Many software developers utilize Node.js to create high-performance backend web applications. It has numerous advantages, including ease of application deployment, asynchronous request handling, great performance, and more. Integrating a solid monitoring solution into your Node.js application is critical since it gives you visibility into what's going on in your application at any given time or over a specific time.

Why Database Monitoring Tools are Important for Senior Business Leaders | Interview with Matt Gordon

Matt Gordon is a Microsoft Data Platform MVP and the Director of Data and Infrastructure at Rev.io. In this short video, he talks us through the 3 key reasons database monitoring tools are essential for him as a Director, and how they help him to lead a team successfully and productively.

How the growing Grafana Observability team restructured themselves successfully

Over the past year, Grafana Labs has grown from 300 to 700 Grafanistas. Moving forward, we expect to continue to maintain a high rate of change, and to sustain that, we need to ensure there is flexibility in how our teams* are set up. The majority of our Engineering squads have changed in size and structure — and the same goes for the Grafana Observability team, where I work.

New observability features for your Splunk Dataflow streaming pipelines

We’re thrilled to announce several new observability features for the Pub/Sub to Splunk Dataflow template to help operators keep a tab on their streaming pipeline performance. Splunk Enterprise and Splunk Cloud customers use the Splunk Dataflow template to reliably export Google Cloud logs for in-depth analytics for security, IT or business use cases.

The Return to the Office: Major Companies Investing in Flexible Workplaces

For much of the past two years, businesses have been looking ahead to an eventual return to the office – a return that has been frequently delayed and disrupted by the unpredictable nature of the COVID-19 pandemic. But with some employees resistant to returning, companies are looking for new options to entice employees back. It may not be a return to the office at all, but instead a movement towards an entirely new way of working together.

Docker Log Rotation Configuration | Container Logging for Beginners - Sematext

Docker logs are one of the primary sources of information developers use to spot problems with their apps. However, Docker log files can get huge in a short amount of time. This is why you absolutely must configure Docker log rotation! 🔥Today, we will explain how to free up space on the host machine of your Docker containers. We will also look at how to set up a centralized logging solution for free to get the most out of your Docker logs.

10 things you didn't know about LogQL

For this edition of my ongoing Grafana Loki how-to series, I wanted to offer up some helpful — and perhaps surprising — facts about using LogQL, Loki’s query language. In case you’re new to Grafana Loki, it’s a log aggregation system created in 2018, and the Loki team has worked with the community ever since to introduce new features and make it easier to deploy.

5 Common Amazon SQS Issues

The simple query service (SQS) was one of the first services AWS offered. It’s a managed queuing service that lets you take pressure from your downstream services. You put your items on the queue, and other services can pull them whenever they have the capacity to work on them. It’s a managed service, so you don’t have to install or maintain the software yourself; you just configure a queue and start pushing to and pulling from it. So SQS is very simple to get started with.

How to prepare for a peering-partner business review

Peering is more than just setting up sessions with any AS that will accept one. Peering can involve long-term relationships that require reviews and joint-planning to grow synergy. A critical milestone in any peering relationship is the business review – and when it comes to business reviews, it’s all about preparation. So where to start?

Get visibility into AWS Lambda serverless functions with Elastic Observability

Adoption of AWS Lambda functions in cloud-native applications has increased exponentially over the past few years. Serverless functions, such as the AWS Lambda service, provide a high level of abstraction from the underlying infrastructure and orchestration, given these tasks are managed by the cloud provider. Software development teams can then focus on the implementation of business and application logic.

Webinar Recap: Best Practices for Right-Sizing and Overhauling Your Architecture

Last week, we hosted a webinar on the easiest way to right-size – and safest way to overhaul – your architecture. One of the scenarios we’re seeing come up more and more with prospects and customers is the need to update your architecture, and particularly your security architecture, as new needs and threats arise. As I’m sure you all know, that can be a real hassle, put a strain on your resources, and put your security posture at risk if it isn’t done well.

Ask Miss O11y: Not Your Aunt's Tracing

Dear Miss O11y, How is modern observability using tracing, such as Honeycomb, different from the previous distributed tracing software I'm familiar with, like Dapper, at my company? I haven't really been able to wrap my head around Dapper. Does "advanced" observability mean that it's even more complicated than Dapper is? Auntie Alphabet.

Effective Dashboards to Jump Start Your IT Monitoring Initiatives

Earlier this year ‘MIT Center For Information Systems Research’ released a new research report on effectively using dashboards to jump-start your digital transformation initiatives. The research team found that companies with top quartile dashboard effectiveness significantly outperformed bottom-quartile companies on internal and external measures of performance. The topics discussed in this report apply well to IT monitoring initiatives as well.

4 Different Ways to Ingest Data in AWS OpenSearch

AWS OpenSearch is a project based on Elastic’s Elasticsearch and Kibana projects. Amazon created OpenSearch from the last open-source version of ElasticSearch (7.10) and is part of the AWS system. The key differences between the two are topics for another discussion, but the most significant point to note before running either distribution is the difference in licenses. ElasticSearch now runs under a dual-license model, and OpenSearch remains open-source.

Icinga integration cases: OpsBridge and ServiceNow

The health of your systems and applications is fundamental for your organization’s infrastructure. Monitoring them indicates if there are any issues that need to be handled before they become serious and affect your customers. This is why companies often use a plethora of monitoring tools that can spot any irregularities as early as possible. Icinga allows you to monitor a lot of different metrics throughout your ecosystem, with various plugins that are ready-to-use.

Circonus Simplifies Monitoring & Observability for Large Enterprises with Spring '22 Release

We’re excited to announce today Circonus’ Spring ’22 Release, which adds capabilities to further support the observability demands of modern-day enterprises. The Spring Release adds features across all four components of the Circonus platform – Ingestion & Alerting, Visualizations and Reporting, IronDB time series database, and the Circonus Analytics Query Language (CAQL) – with a focus on extending coverage and improving ease-of-use.

Implementing OpenTelemetry in a Rust application for performance monitoring

OpenTelemetry can be used to trace Rust applications for performance issues and bugs. OpenTelemetry is an open-source project under the Cloud Native Computing Foundation (CNCF) that aims to standardize the generation and collection of telemetry data. Telemetry data includes logs, metrics, and traces. Rust is a multi-paradigm, general-purpose programming language designed for performance and safety, especially safe concurrency.

The Hybrid Workplace Requires a New Approach to SaaS Observability

Since we were first parachuted into distributed work in 2020, most companies have now adopted a sustained hybrid workplace. According to a recent Gallup survey, there’s no reason to expect this to change anytime soon. Meanwhile, the last couple of years have simultaneously seen an explosion in the use of SaaS. In fact, 2020 was the first year in which the cloud market became larger than the non-cloud market, and SaaS was the leading cause of this.

Technical Metrics to Measure Observability in Marketing

A website's performance can be measured using metrics. Metrics provide information on what is working, what is not, and where improvements are needed. Unlike numbers, spreadsheets, or data, it isn't as complicated or as time-consuming to use. For observability in marketing, website metrics that measure user engagement are vital. By analyzing metrics, the marketing department will be able to determine what web pages are not providing the company with value.

A Guide to PostgreSQL Performance Tuning

PostgreSQL is the most powerful and versatile SQL database available today. There is a problem that comes with this level of power and versatility. How do the developers of PostgreSQL fine-tune the default configuration for everyone? Well, they can't do it. The issue is that each database differs not only in design but also in needs. Some systems are used to store massive amounts of data that are rarely searched.

Status pages can now be viewed as JSON or XML

Next to a large collection of checks, Oh Dear offers the ability to easily create beautiful status pages. This way, you can communicate the status of your sites and services to your users. Take a look at the Oh Dear powered status pages for Flare and Laravel. Today, we added the ability to view the status page as JSON (or XML if that is your thing). You just have to append /json (or /xml to the status page URL. So for the Laravel status page, you'll find the JSON at status.laravel.com/json.

An introduction to trace sampling with Grafana Tempo and Grafana Agent

Greetings friends, one and all! Over here on the Field Engineering team, we’re often asked about tracing. Two questions that come up frequently: Do I need to sample my traces? and How do I sample my traces? The folks asking are usually using tracing stores where it’s simply not possible to store all of the traces being generated. Those are great questions and the answers depend on a few different factors.

Time Saved Monitoring Deployments Is Time Spent Building Better Products

Bigeye is the data observability platform that teams at companies like Zoom and Instacart use to keep their data pipeline fresh, high quality, and reliable. Their customers depend on them to detect problems in their data pipelines 24/7 and to keep data reliable enough for production use cases of analytics and machine learning. In this environment, margins for error are razor thin and waiting for a user to let you know that something isn’t working means it’s already too late.

Up the Creek Without a Paddle: Easing the Strain on Your Analytics Systems

When it comes to your analytics tools, would you say they’re getting easier to manage overall, or is it increasingly difficult? Can you easily scale to meet new compliance requirements, or is there so much custom work required that the pace of change is too much for your team to handle? Do you feel in control over how and where your observability data flows, or do you feel beholden to your vendors? This blog post will shed light on how you can ease the strain on your downstream systems.

TL;DR InfluxDB Tech Tips: Handling JSON Objects and Mapping Through Arrays

There are multiple ways to use Flux to bring in data from a variety of different sources including SQL databases, other InfluxDB Cloud Accounts, Annotated CSV from a URL, and JSON. However, previously you could only manually construct tables from a JSON object with Flux as described in this first example. We’ll describe how to work with three examples with increasingly complex JSON types. First we will describe how to work with these JSON types with metasyntactic examples.

Garbage Collection in Java

Garbage collection in Java is a familiar term in the coding world. You will come across it when learning the Java programming language. Because it’s built into Java memory management, the garbage collector is one of Java’s crucial features. It helps prevent serious errors and allows programmers to create new objects without worrying about unwanted objects.

Supercharge Your SBC Call Detail Records

As Teams Phone becomes the norm in the Enterprise space, managing the quality of service delivery and user satisfaction, whether it’s cloud or connected to the PSTN, is mission critical. Teams PSTN calls are used for just about every type of meeting as well as for Contact Centers, Customer service, town halls and client pitches. Because of this ubiquitous usage, Enterprise IT needs analytics to understand how this service is performing for users and when problems are occurring.

How to Monitor Host Metrics with OpenTelemtry

OpenTelemetry is at the core of standardizing telemetry solutions. At observIQ, we’re focused on building the very best in open source telemetry software. Our relationship with OpenTelemetry began in 2021, with observIQ, contributing our logging agent, Stanza, to the OpenTelemetry community. Now, we are shifting our focus to simplifying OpenTelemetry solutions to its large base of users.

5 features that help you power up AWS observability

Before we take a deep dive into the ways to achieve observability, it is important to understand what observability is and how it is achieved. Frequently, observability is confused with monitoring. Observability provides end-to-end visibility into a system’s internal health by using the data it generates: logs, traces, and metrics. In a multi-cloud environment, observability enables you to detect and resolve anomalies.

New: Protect Your Status Page with a Password

We’ve just released highly-requested feature for our public, aggregated status pages: Password protection. Now, StatusGator’s customizable, brandable, aggregated status pages can be protected behind a password. This feature is now available on our Venture plan. StatusGator is a status page aggregator. We make it easy to publish single page with the status of all your cloud vendors in one place.

The one where the Lloyds Banking Group suffered downtime

In a world where we are so reliant on technology for everything, from doing our weekly grocery shopping to online banking, it’s no surprise that when something goes wrong, it has a huge domino effect impact. The pressures on apps and online platforms in 2022 is so high that we almost solely depend on them for all of our day to day activities. It’s no surprise, therefore, that when the banking apps suffered partial downtime in March, it felt like Armageddon.

IllegalArgumentException in Java

Let’s look at IllegalArgumentException, which is one of the most common types of exceptions that Java developers deal with. We’ll see when and why IllegalArgumentException usually occurs, whether it’s a checked or unchecked exception, as well as how to catch and when to throw it. We’ll use a few examples based on common Java library methods to describe some of the ways to handle IllegalArgumentException.

Monitor your JumpCloud directory with Datadog

JumpCloud is a cloud-based directory platform that provides a unified approach to Active Directory and LDAP services centered around user authentication and network management. Using JumpCloud, companies can manage and provision user access to software, systems, and networks; enforce compliance with audit trails; and provide a unified login experience through single sign-on (SSO).

Monitor your Elixir application with OpenTelemetry and SigNoz

OpenTelemetry can be used to instrument your Elixir applications to generate telemetry data. The telemetry data can then be visualized using an observability tool to monitor your Elixir application performance. In this tutorial, we will use OpenTelemetry Elixir libraries to instrument an Elixir application and then visualize it using SigNoz. Somewhere during the lifetime of an application, it's inevitable that it will have some performance issues.
Sponsored Post

The Future of AIOps

According to Insight Partners, the AIOps platform market size is expected to grow from $2.83 billion in 2021 to $19.92 billion by 2028 at a compounded annual growth rate of 32.2%. This skyrocketing growth is fueled by the pace of the IT data deluge getting out of the human hand and the need for resource optimization. Every organization is increasingly producing more IT data, whether in a siloed or unified form.

Updates Paused: How are MSPs Navigating Today's Supply Chain Issues?

The last few years have thrown about everything they could at the status quo. Shifting climates, political instability, and a global pandemic have all contributed to a broad host of network device supply chain issues. Consumers all over the globe are still affected by computer chip shortages and many other items delayed by supply chain issues. And it’s not expected to end anytime soon.

How to collect Prometheus metrics with the OpenTelemetry Collector and Grafana

OpenTelemetry is a set of APIs, SDKs, tooling, and integrations that are designed for the creation and management of telemetry data such as traces, metrics, and logs. One of the main components of OpenTelemetry, or OTel for short, is the OpenTelemetry Collector. The OpenTelemetry Collector, or Otel Collector, is a vendor-agnostic proxy that can receive, process, and export telemetry data.

Concurrency in Golang: Building a Data Pipeline for Monitoring Microservices from Scratch

Time and resource consumption have become the driving forces of developing modern applications. While building cloud-native applications, it’s important to ensure that you have the most optimized code in place, and oftentimes that means leveraging concurrency. While writing concurrent code may sound overwhelming at first, Golang makes it extremely easy to get a handle on.

Automation and transformation in IT infrastructure with Jordan Lowe

Co-founder and CEO of Deft, Jordan Lowe stops by Network AF to talk to host Avi Freedman about all things IT infrastructure. Previously known as ServerCentral, Deft continues to innovate on its services to make managing IT infrastructure a better experience for the business and those who run it.

What Is Telemetry and Why Is It Important?

Properly leveraging telemetry is a true game-changer for any IT department looking to optimize and stabilize its systems. Telemetry provides the first step to answering the all-important question, “What’s happening in my network?” It’s your eye into the inner workings of your system, giving you a view into how different components are performing.

How Sumo SREs manage and monitor SLOs as Code with OpenSLO

At Nobl9’s annual SLOconf—the first conference dedicated to helping SREs quantify the reliability of their applications through service level objectives (SLOs)—Sumo Logic shared our contribution of slogen to the OpenSLO community, as well as our commitment to OpenSLO as an emerging standard for expressing SLOs as Code. slogen is an open source, SLO-as-code CLI tool based on the OpenSLO specification.

Improve Performance in your iOS Applications - Part 2

The performance of your iOS app is crucial when building and publishing it for any number of users. Your users expect it to be delightful, fast and responsive, so if your app seems sluggish or unresponsive, it will affect your reviews and you might lose valuable users. While solving this for your apps, it’s easy to overlook the influence of the choices made on performance throughout development.

How to Move Ahead of the Three Pillars of Observability

The acceleration in digitalization, also due to the pandemic, has brought an organization’s business and IT teams and strategic goals closer together than ever before. Chief Information Officers (CIOs) have existed since the early 1980s and, until recently, their typical role was primarily focused on managing the technology infrastructure. But in a post-pandemic world, that role is now expanding beyond traditional IT responsibilities.

Building a Stack Overflow browser as a VS extension

I have been writing a couple of integration with the Stack Overflow API for both the elmah.io app and some public exceptions pages that we launched recently (like System.DivideByZeroException). For this post, I want to show you how to pull data from Stack Overflow with C#. For demo purposes (and TBH because I wanted to play more with Visual Studio extensions), the sample code for this post will end out in a small Visual Studio extension (VSIX).

A practical approach to Active Directory Domain Services, Part 5: Replication in Active Directory

This blog series on Active Directory Domain Services (AD DS) is designed to help you gain a good working knowledge of what Active Directory (AD) is. Each successive blog sheds light on some aspects of AD. All blogs are curated to include the right mix of AD theoretical basics along with some valuable hands-on exercises. Through the earlier parts of the blog series, it has become clear that AD DS installed in a Windows environment opens up a host of benefits to organizations.

Monitor model performance with Superwise's offering in the Datadog Marketplace

Superwise is a monitoring platform that provides model observability for high-scale machine learning (ML) operations. Superwise provides teams with out-of-the-box (OOTB) metrics on their models’ production behavior, so they can effectively address drift, data quality issues, and other problems before they negatively impact business.

VMware Management Pack Update Release (22.3.2731.0)

VMware administrators can breathe a little easier today with the release of the latest OpsLogix VMware management pack for System Center Operations Manager. The new update includes a new feature for Ransomware vulnerability monitoring, designed to decrease the attack surface for your VMware infrastructure and keep your data safe. This new addition to the popular OpsLogix VMware Management Pack is free for anyone using our solution today.

4 Signs You Have a Cloud Migration Planning Problem

Cloud migration is a complex process, and the more up-front planning you put into it, the more likely you’ll be able to avoid challenges and setbacks during execution. And yet, challenges and setbacks seem to be the norm, not the exception, with almost three-quarters (72%) of those surveyed stating that they’ve run into problems so big they were forced to move migrated applications back on premises or jump into firefighting mode to figure out how to fix them quickly.

New in Grafana 8.5: How to jump from traces to Splunk logs

The recent release of Grafana 8.5 marks the start of enabling the jump from traces directly to Splunk logs. It’s a big leap that now allows you to draw a straight line between your traces — whether they are coming from Tempo, Zipkin, or Jaeger — to even more third-party logging data, all from the comfort of your traces view. Previously, the Grafana trace to logs enablement included only Loki logs.

Demystifying automation in SAP operations

With SAP Automation through Avantra your business can improve productivity, reduce costs, reduce risk while improving quality and compliance. But, navigating the topic of SAP automation can get tricky sometimes. With a wealth of information available out there, we often ignore the basics. In this session, Tyler Constable, Executive Director - Solution Engineering, Avantra and Tim Reiss, Technical Customer Success Manager, Avantra get down to the basics and demystify the world of SAP Automation for you.

Experience-Driven NetOps from Broadcom Software

Experience-Driven NetOps from Broadcom Software delivers the unified end-to-end network visibility needed to understand, manage and optimize the performance of your digital services - on whatever network they may be running on. The solution extends your monitoring reach into edge services, multi-cloud and SaaS, home wireless and ISP networks, letting you see every communication path and degradation point for the entire end-user experience.

12 free mini-tools for website owners

Not everyone knows that for quite some time now, as part of our website, we have been providing free tools designed for anyone who runs and promotes their own website. No matter if it’s a blog, a company website, an online shop or a SaaS application – everyone will find something useful here. Over the years we have gathered more than 10 tools, so we decided to remind you about them – or inform you if you don’t know them yet.

Make Meetings Work in the Modern Workplace: Introducing Microsoft Teams Meeting Room Monitoring

Today’s organizations are adapting office spaces and technology to meet the needs of a hybrid workforce. Microsoft is leading the way, bridging the gap between people working remotely and those in the office with the Microsoft Teams Meeting Room solution that allows everyone to be seen, heard, and fully participate from anywhere.

How Offishall Uses DigitalOcean and Papertrail to Simplify Hybrid Work

Paris-based tech startup Offishall is all about simplifying and streamlining modern hybrid work. CTO Bruno Ronzani and his team rely on reliability, speed, and simplicity from DigitalOcean Droplets and Papertrail™ log management. This foundation helps ensure Offishall delivers the seamless web experience their customers—and regional manager Dwight K. Schrute—demand.

Debugging Java Collections Framework Issues in Production

The Java Collections Framework was a huge leap forward when it was introduced as part of Java 2 (JDK 1.2). Thanks to the included collection classes we finally moved beyond the limits of Vector and Hashtable to more mature and generic solutions. With the introduction of streams and functional concepts in Java 8 the framework took everything to the next level. One of the core principles underlying the framework is coding to the interface.

Auth, Org management, Exceptions monitoring & a team workation - SigNal 12

This is our 12th monthly product newsletter, and every month our team has shipped code to make SigNoz better for our users. Our latest release is special! It is not only packed with much-awaited user-requested features, but it is also the first time our team met in person to ship a release together. Yes, after 12 months of seeing each other on Zoom, we finally got a chance to see each other in person during our week-long workation.

Observability Vs Monitoring: What's The Difference?

Clients expect prompt implementation of changes to their software, and this requirement motivates site reliability engineers to incorporate reliability into applications. The healthy practice of observability and monitoring can improve the reliability and security of software systems. Monitoring is the recording and interpreting data from software systems to keep track of their performance.

Kubernetes Networking: How to monitor Kubernetes using synthetic testing

Kubernetes enables DevOps efficiency by streamlining application and service deployment and management. While this gives greater control, it also makes it harder to monitor the health of the applications and services. Synthetic monitoring simulates network conditions and user actions by running continuous tests from global locations before adverse conditions impact end users. What you’ll learn.

Troubleshooting Slow Azure Virtual Desktop (AVD) Logons

In order to troubleshoot slow Azure Virtual Desktop logons, it is helpful to have a complete understanding of the application and how it operates. Here, we will outline the architecture and key components of AVD, the impact of DNS on the connection flow, and how eG Innovations offers time saving tools to troubleshoot problems with AVD before they negatively impact your organization’s productivity.

Monitoring next-generation maritime vessels at Royal IHC with Grafana Cloud

With a storied past in Dutch maritime history, Royal IHC is known for delivering reliable, integrated solutions for their customers. These clients rely on sophisticated vessels to create new ports, maintain navigable waters, clean up pollution, and slow shoreline erosion through the process of dredging, which involves removing sediment and debris from the water.

Monitoring Azure Stack HCI with SCOM 2022

SCOM 2022 has been recently made GA and I, coming from a SCOM background, was pretty interested in checking out the latest SCOM product. Luckily, Azure Stack HCI management packs were recently released too, so it was as if the stars had lined up for me to drop everything else and check out this new management pack with SCOM 2022. And that is exactly what I did!

Instrumenting Your Custom Application Code with OpenTelemetry

Application Monitoring, or Application Tracing, is an important piece of Observability within your application and stack. Application tracing involves installing an API and/or SDK in your application which then instruments, or wraps your application code with other code that measures the time spent in certain areas of your code, and adds important contextual information to the traces.

Resolving Teams User Experience Problems with Data Rather than Guesswork

Vitrolife Saves More than 10,000 Hours of Microsoft Teams Troubleshooting with Vantage DX Sweden-based Vitrolife has approximately 1,200 employees spread across 20 countries worldwide. In their hybrid workplace, the IT team felt as if they were working with a blindfold on when it came to figuring out the source of a problem impacting Microsoft Teams performance.

Workflow 2.0

This is a conversation we had with our Engineering, Product, and Design (EPD) organization. We are publishing it as we believe it’s important to our customers and fundamental to our open source approach. You can join the conversation on GitHub. Lately, I am spending a lot of my time thinking about Sentry and its core developer story. I also consider why we haven’t been able to overcome the main challenges we recognized years ago.

Building Oh Dear's new design: Implementing the design

In the previous blog post I gave an introduction about the project setup for the redesign of the new Oh Dear frontend. In this blog post I would like to show you how we are implementing the redesign of the Oh Dear frontend. Feel free to provide feedback on the design choices and statements made in this and future blog posts. We’d love to hear what you think of it.

Run Synthetic tests in your CI/CD pipelines with the Datadog CircleCI orb

CircleCI is a CI/CD service that allows organizations to rapidly build, test, and deploy within their pipelines on a single platform. If you are using CircleCI for your CI/CD pipelines, you can now leverage the Datadog Synthetics CircleCI orb to implement Synthetic tests as part of shift-left testing. CI/CD testing is a widely adopted DevOps standard that helps teams mitigate any potential issues that could arise as a result of faulty code deployments.

Kubernetes Logging with Elasticsearch, Fluentd and Kibana

Kubernetes, a Greek word meaning pilot, has found its way into the center stage of modern software engineering. Its in-built observability, monitoring, metrics, and self-healing make it an outstanding toolset out of the box, but its core offering has a glaring problem. The Kubernetes logging challenge is its ephemeral resources disappearing into the ether, and without some 2005-style SSHing into the correct server to find the rolled over log files, you’ll never see the log data again.

[Webinar] Unlock self-service infrastructure monitoring with the Sensu Integration Catalog

Introducing the Sensu Integration Catalog — a marketplace-like UX for simplifying new user onboarding, and deploying production-ready monitoring in a matter of minutes. The Sensu Integration Catalog is also an open marketplace that new and existing users can contribute to by sharing Sensu configurations. Backed by industry-leading monitoring as code solution, Sensu provides new users with a point-and-click interface to get started quickly, while facilitating DevOps and SRE automation best practices.

Python MQTT Tutorial: Store IoT Metrics with InfluxDB

MQTT is a standard messaging protocol used for the Internet of Things (IoT) because it requires minimal resources and can be executed by small microcontrollers found in connected devices. IoT devices have a real need for this type of lightweight protocol because it guarantees fast and reliable communication with minimal hardware requirements, keeping power consumption and manufacturing costs low.

The Online Travel Landscape

The travel industry was hit harder than any other industry throughout the pandemic. Frequent changing of rules and restrictions resulted in multiple setbacks. Travel companies had significant difficulties navigating the field and it wasn’t without devastating losses. The Office for National Statistics (ONS) data shows that turnover in travel and tourism businesses declined to 26.0% of February levels, compared with 73.6% in all other industries.

Flowmon and WhatsUp Gold: Automatic Threat Detection Through Single Pane of Glass

Network Detection & Response (NDR) is a key element that provides an additional level of security across the company wide network through detection of threats that bypass traditional security measures and materialize in the company’s digital environment. Progress Flowmon ADS (Anomaly Detection System) is a typical representative of an NDR system that combines various detection techniques to ensure that malicious activity is recognized and flagged as a security incident.

Prometheus 2.35 - What's new?

Prometheus 2.35 was released last month, focusing on a better integration with cloud providers. It also improved the service discovery, performance, and resources usage. One key change was the migration to Go v1.18. It has brought some changes in the support for TLS 1.0, 1.1, and certificates signed with the SHA1 hash function. Welcome to this first edition of What’s new in Prometheus. We love Prometheus, the de-facto open source standard monitoring tool!

How to Effectively Optimize Your Website For Voice Search

Talking to a device, a watch, or a virtual assistant is becoming an everyday activity as chatting on the phone. While many of us are used to taking part in a live webchat with a real person, speaking to a computer rather than typing on a screen has been an adjustment. But it's not nearly as unfamiliar an experience as it once was, and it has many advantages. Voice search is a convenient, hands-free way to access information, news, entertainment, and of course, websites.

Manufacturing smarter, faster with SAP hyperautomation

Smart factories have made manufacturing smarter. Mission critical IT applications like the SAP ERP processes are the nerve center of manufacturing industries ensuring that everything operates like a well oiled machine, from supply chains and factory floors, to finance and payroll systems. A winning manufacturing strategy is therefore dependent on successful management of the SAP landscapes. One that releases capacity from the drag of managing these complex systems and redirects resources to fuel innovation.

Going Beyond CloudWatch: 5 Steps to Better Log Analytics & Analysis

CloudWatch is great – if you require very basic logging and monitoring for the Amazon Web Services (AWS) cloud, at least. However, the reality is that most teams need more than basic logging and monitoring. They may also need to perform log analytics on data sources from outside AWS, which CloudWatch doesn’t support. That’s why, although CloudWatch may be one tool in your log analytics strategy, it probably should not be the only one.

Introducing the official ClickHouse plugin for Grafana

We are delighted to introduce the new first-party ClickHouse plugin for Grafana, developed by Grafana in collaboration with ClickHouse. Grafana is committed to continuing our partnership and maintaining this plugin, and we’re excited to add more features and to grow with ClickHouse. But why Grafana + ClickHouse?

How Moneytoring Improves Your Daily Digital World

Imagine finding yourself doing everyday activities such as buying a soda pop in your favorite grocery store or analyzing your finances on your mobile banking app. You may drink your beverage and continue to enjoy your day. You may finish your task in an exceptionally ordinary fashion. You may also discover that your investments grew a tiny percentage since the last time that you reviewed them.

Azure Cost Management with LogicMonitor

As the future of hosting in the cloud continues to evolve and expand, LogicMonitor continues to work to empower admins with useful insights and valuable data to make smart business decisions. Azure Cost Management with LogicMonitor helps optimize cost when planning future rollouts, auditing others already in-flight, and capturing shareable items on a recurring basis or with live access for accounting or management teams to get quick insight into Azure costs.

Getting Better Sysmon Data Using Cribl Stream

System Monitor, better known as Sysmon, is one of my favorite security datasets. The data is crazy detailed and offers a great way to power security detection and response since it gives cyber security teams a roadmap to understand exactly what systems or people are doing while they use any Windows operating systems. The avalanche of the data is the downside and why observability engineers need tools like Cribl Stream to manage and enrich Sysmon data to make it more useful and more cost-effective.

Using synthetics to get the big picture

Nobody actually cares about the network. Provocative words coming from a network visibility company, you might be thinking. However, consider what you’re doing right now. You’re reading a blog on a website, maybe clicking around other tabs, possibly streaming some music, and likely keeping an eye on your work chat. These are all applications, and that’s what we all truly care about, not the plumbing that delivers them.

The Real Benefits of HTTP Monitoring for Businesses

HTTP is one of the most popularly used protocols on the internet. Most user-facing applications expose HTTP APIs or apps of some form. The HTTP protocol is the basis for the World Wide Web or the tangible, visible part of the internet. However, you can also utilize this technology to test the performance and availability of your web apps.

The value of performance mesh testing for ISPs, CDNs, telcos and cloud service providers

As a network service provider, you want your customer to see that you consistently deliver excellent performance. You send your customers periodic reports — but those only provide a snapshot. With synthetic tests, you can present your customers with a dynamic report through a public web page, linkshare or a customer branded portal. Watch this webinar replay to learn how Kentik’s API can be used with network performance meshes in Kentik's synthetic monitoring solution to build a live latency report. Kentik’s Anil Murty and Martin Machacek will show you.

ElasticON Solution Series Keynote: Celebrating 10 Years of Elastic

Learn more about Elastic's origin story and how the world's most popular search engine evolved into the leading platform for search-powered solutions. Since the release of Elastic 7.0 there have been 17 additional 7.x releases. In less than 20 minutes you’ll hear the highlights from two years of Elastic 7.x and explore the latest news from 8.x — and — what’s to come in the future. Speaker: Mike Nichols, Product Lead - Elastic Security, Elastic

ElasticON Solution Seminar Customer Conversations: Putting Data to Work

Hear first hand accounts from Elastic customers on how they are using the power of search to solve for unique challenges and to reach new levels of success. Speakers: Ali Nazemian, Chief Technology Officer, Brolly Kevin Serafin, Director of Incident Response, Ecolab Matt Riley, General Manager, Enterprise Search, Elastic

6 Ways Topology-Powered Observability Gives Back Time to Your Organization

Having enough time available is a struggle we all experience. Technological innovations enable us to develop and deploy software at lightning speed: Sometimes we can push more to production than our organizations’ IT environments can handle. At the same time, we want to increase customer satisfaction by reducing downtime. But how are you going to keep customer satisfaction rates high if a large majority of incidents are caused by changes?

Digital Resilience a Top Priority as Cloud Spending on the Rise

Investments in cloud computing services have steadily increased over the past few years, largely a result of the rise of the digital workplace and the challenges brought on by remote and hybrid work. But there’s another reason businesses are investing more money into cloud solutions: driven by the chip shortage and subsequent hardware crisis, businesses are looking to build their digital resilience.

What is Kibana? (Updated Guide For 2022)

Kibana is a popular user interface used for data visualisation and for creating detailed reporting dashboards. This piece of software notably makes up a key part of the Elastic Stack alongside Elasticsearch and the extract, transform and load (ETL) tool, Logstash. In this comprehensive introduction to Kibana, we are covering all of the basics that you will need to know as a user considering using Kibana for your log data visualisation and reporting needs.

Monitoring Applications Declaratively with Terraform

Running infrastructure at scale almost always guarantees dizzying complexity and anxiety-inducing pressure to maintain systems in a production environment. This is further exacerbated when multiple delivery teams require slight variations of the same infrastructure components, across several cloud providers, each with a different set of observability requirements. Gradually, production environments become large, unmanageable, difficult to change, and perhaps resembling the figure below.

Kubernetes throttling? It doesn't have to suck!

Kubernetes has a bad habit of throttling CPU resources—with the result that you can suffer severely degraded performance or find yourself paying a fortune for extra, unnecessary infrastructure. Watch this video to learn how K8s clusters protect themselves from what they see as heavy CPU usage, and how you can monitor and troubleshoot the problem. We demonstrate how you can:– Use Netdata to reduce API response times by a factor of 7– Expect to reduce infrastructure resource requirements by 60-75%

Using SAP in manufacturing for increased productivity

As the landscape continues to become more challenging, it's safe to say automation is the future of manufacturing. Changing customer demands, supply chain constraints and pressure for faster turnaround are some of the realities that companies face. This has predicated an industry agnostic need for a hands off approach to manufacturing. SAP for manufacturing offers a transformative end to end manufacturing process for more resilient, future ready enterprises.

Micro Lesson: Troubleshoot an Incident Using Root Cause Explorer

The video uses a scenario to demonstrate how to use Root Cause Explorer to analyse and troubleshoot an incident faster. The video shows how Root Cause Explorer helps you dig deeper into the relevant logs and traces in order to isolate the root cause using various dashboards.

Survey Review: Key Challenges of Scaling Observability with Cloud Workloads

When you migrated critical infrastructure to the cloud, what were your goals and expectations? Odds are, you hoped leaving on-premises infrastructure would produce significant organizational benefits. You probably figured you’d streamline operations and reduce management overhead. You felt you’d have an easier time meeting business goals. Perhaps most important of all, you likely expected your environment would become less complex, and even cost less to operate.

How to capture Spring Boot metrics with the OpenTelemetry Java Instrumentation Agent

In a previous blog post, Adam Quan presented a great introduction to setting up observability for a Spring Boot application. For metrics, Adam used the Prometheus Java Client library and showed how to link metrics and traces using exemplars. However, the Prometheus Java Client library is not the only way to get metrics out of a Spring Boot app. One alternative is to use the OpenTelemetry Java instrumentation agent for exposing Spring’s metrics directly in OpenTelemetry format.

How to observe your Asterisk instance with Grafana Cloud

Observability and monitoring is a fundamental part of the contact center environment. When there are thousands of live voice and other multi-channel interactions happening, it is crucial to keep a close eye on the system because any issue in service gives an instant blow to the customer experience. Asterisk is a free and open source framework for building communications applications and is sponsored by Sangoma.

Shifting From ITOps to AIOps: Capgemini's Transformational Journey

In a hyper-digital world, business transformation is vital for all organizations to compete and deliver against business and customer demands. If you aren’t innovating and moving forward, you are falling behind as the competition and new market entrants are surely adopting technologies that give them the agility to meet needs and expectations. That means the challenge isn’t just about keeping pace, but about leading the way.

Guide To APM For Marketers

Image source: Unsplash.com With an increasing number of applications and populated data, monitoring becomes crucial since businesses are challenged to satisfy millions of users simultaneously. In order to detect performance problems in a timely manner, companies require APM tools to collect and process app and user data that is being generated continuously.

Smart, Secure and Sustainable Manufacturing - How Splunk and Google Cloud Are Helping Manufacturers to Skate Where the Puck is Going

* Co-author: Alexander Okl, Sr. Partner Development Manager EMEA | Google Cloud at Splunk “The way we look at manufacturing is this: the strategy should be to skate where the puck is going, not where it is.” - Tim Cook, CEO, Apple Inc.* So where is the puck going for manufacturers in 2022 and beyond?

A Pound of Cure - Why Sentry Matters

Benjamin Franklin was a smart dude. Among the many wonderful things he produced was an eternal bit of wisdom - an ounce of prevention is worth a pound of cure. To avoid a bigger problem, spend time early on the things that help you avoid it. That wisdom applies everywhere. Avoid nasty stuff with doctors by exercising and making healthier choices. Avoid getting hit by a car by looking both ways. Avoid sunburn by wearing sunscreen. You get it. So what does this have to do with Sentry? A lot.

When Disaster Strikes: Production Troubleshooting

Tom Granot and myself have had the privilege of Vlad Mihalcea’s online company for a while now. As a result we decided to do a workshop together talking about a lot of the things we learned in the process. This workshop would be pretty informal ad-hoc, just a bunch of guys chatting and showing off what we can do with tooling.

Looming 2022 (and Beyond) Network Security Threats

Every year hackers grow in numbers, aggressiveness, organization, and sophistication. And every year there are new attack types and new areas of IT infrastructure that cybercriminals target. 2022 is no different. We are about a third of the way in already and IT pros and security specialists already have their hands full with new attacks and new issues.

Monitor your .NET apps with the Datadog extension for Azure App Service

Azure App Service is a cloud-based platform-as-a-service (PaaS) for deploying functions, web apps, mobile apps, and other resources. It allows developers to deploy code—using common languages and frameworks—in minutes without worrying about provisioning or managing infrastructure. Developers can then use Azure App Service to scale their services dynamically to meet demand.

Sponsored Post

What is IBM MQ Monitoring?

In order to answer this question, it is best to first explain what is IBM MQ and the benefits that it can bring to a business. IBM MQ (Messaging and Queuing) is a messaging system that enables applications running on different computers to communicate quickly with each other in real-time. This is achieved by exchanging messages using queues, and processed as and when computing resources and internet bandwidth allow. IBM MQ has been designed to provide high availability and reliability and can be used in a variety of different environments, including cloud computing.

Sponsored Post

How to implement a Blameless Postmortem (part two)

This is Part 2 of a two-part series on Blameless Postmortems. The previous article went into why blameless postmortems are so effective; this second part goes into detail on how to build your own postmortem process and kick it into overdrive. Read Part 1 here. So you've read our first installment and recognized the value of the blameless postmortem for efficiency, culture, and output. Now you're ready to get off the blame train and kickstart a blameless postmortem process of your own. Where to begin?

Monitor Flutter application performance with Datadog Mobile RUM

Flutter is a popular open source framework that allows you to build, test, and deploy high-performance, multi-platform applications with a single codebase. Developed by Google, Flutter is backed by a robust developer community and is compatible with the latest native functionalities, including iOS Metal.

Where's Open Source Observability Headed in 2022?

For the last five years, Logz.io has tracked and measured the pulse of DevOps, as well as adoption of key trends and technology, through our DevOps Pulse survey and report. One of the obvious focus areas for us, as a company whose products are based on industry-leading open source, is the increased rise of incredibly useful open-source observability solutions, in general.

Design choices in ingesting 1 million events/s using Opentelemetry and SigNoz

In this video, Pranay will walk through different design considerations which should be taken into account in ingesting huge amount of data using Opentelemetry into SigNoz. He also presents some performance benchmarks we were able to achieve in ingesting around 1mn events/s This talk was originally presented at Kubernetes Community Days Bangalore 2022

Book Review: Digital Employee Experience for Dummies (A Wiley Brand)

Wiley’s Dummies series is best known for repackaging technical, nuanced material into practical and accessible lesson books. In partnership with Nexthink, the company’s latest addition, Digital Employee Experience, delivers on this same reputational goal. Most ‘for Dummies’ books are written either from a purely technical or from a higher level, management topic.

Protect the Business with Cribl Packs: Webinar Recap

The second in our Feature Highlights webinar series, Protect the Business with Cribl Packs, highlights Packs and security use cases. Packs enable you to share complex Stream/Edge configurations across multiple Worker Groups/Fleets, between Stream/Edge deployments or with the Cribl Community. Packs roll up best practices to ensure Site Reliability Engineering (SRE) teams have the required data to protect the business.

Agents of Transformation: Talking about cloud migration with AppDynamics GM Linda Tong and Match.com's Garrick Linn

Finding true love is never easy — nor is migrating to the cloud during a pandemic. Learn how Match.com, with help from AppDynamics, deftly managed its digital transformation while helping humans make meaningful connections.

You build it, you own it - Microservices operations with Datadog Service Catalog (Brooke Chen)

Managing microservices requires understanding many dependencies, both technical and operational. Join Brooke Chen of Datadog as they introduce Service Catalog, a new view combining telemetry, performance, topology, and metadata to enable at-a-glance understanding of even the most complex microservices architectures.

CI/CD Detection Engineering: Dockerizing for Scale, Part 4

Splunk builds innovative tools which enable users, their teams, and their customers to gather millions of data points per second from an ever-growing number of sources. Together, Splunk helps users leverage that data to deliver, monitor, improve, and secure systems, networks, data, products, and customers with industry-leading solutions and expertise.

How to Import/Export Orion Custom Query Widgets

Advanced Orion Platform users are familiar with the power of the Custom Query widget, but getting started can be difficult. Thankfully, you can download pre-existing widgets directly from THWACK to get you started. Then, after you've crafted some of your own, you can return the love and share yours with the community.

Elastic Observability 8.2: Tail-based sampling, plus more serverless visibility for AWS

As more organizations adopt cloud-native technologies and microservices-based architectures, application troubleshooting is becoming increasingly complex. With so many moving parts in an environment that is both dynamic and distributed, it is difficult to get the full picture. Yet complete visibility is crucial in order to find and fix issues quickly — especially ones that impact the bottom line.

Elastic Enterprise Search 8.2: Relevance controls for Elasticsearch

Elastic Enterprise Search 8.2 introduces new ways to ingest, search, and monitor data, giving developers the productivity benefits of using out-of-the-box capabilities along with the power and flexibility inherent in Elastic Stack tools. Operators also gain even more transparency for managing search experiences and observing search performance. For a visual walkthrough of some of the key capabilities in 8.2, check out the latest installment of What’s new in Enterprise Search on YouTube.

Whats new in Elastic Enterprise Search - 8.2

Elastic Enterprise Search 8.2 introduces new ways to ingest, search, and monitor data, giving developers the productivity benefits of using out-of-the-box capabilities along with the power and flexibility inherent in Elastic Stack tools. Operators also gain even more transparency for managing search experiences and observing search performance.

How to Monitor Riak Metrics with OpenTelemetry

observIQ’s OpenTelemetry members contributed Riak metric monitoring support to OpenTelemetry! You can now monitor your Riak agent performance with OpenTelemetry, and deploy simply with the oIQ OpenTelemetry Collector. You can add the Riak metric receiver to any OpenTelemetry collector. This post demonstrates a configuration for shipping metrics to Google Cloud Operations with OpenTelemetry components.

What Is a CMDB and What Role Does It Play in IT?

Organizations of all sizes have a complex array of hardware, software, staff, and vendors. Each of those assets comes with complex configurations and relationships between them. Visualizing and tracking these configurations and relationships over time is critical to quickly responding to incidents. Plus, it helps inform business decisions, especially regarding future IT components and upgrades.

Proactive Monitoring vs. Reactive Monitoring

Monitoring is a fundamental pillar of modern software development. With the advent of modern software architectures like microservices, the demand for high-performance monitoring and alerting shifted from useful to mandatory. Combine this with an average outage cost of $5,600 per minute, and you’ve got a compelling case for investing in your monitoring capability.

Kubernetes Throttling Doesn't Have To Suck. Let Us Help!

In the Kubernetes (K8s) community, there is a huge misconception about CPU allocation and utilization. Even highly experienced SREs find themselves struggling with the way Kubernetes allocates CPU resources, leading to misconfigured CPU allocations and extremely negative outcomes. For starters, this results in significant quality degradation on important service components, introduced by behind-the-scenes CPU limiting (or throttling).

The MSP Provider's Guide to Getting Hired

In the years that I’ve worked with large technology vendors and an MSP provider, there were a few common denominators between the ones that were successful. Partners that were consistently at the top of the food chain were the ones consistently providing value to their customers in some area other than just one specific product or service that was being offered.

Website downtime: The one where Google Maps went down

March saw many of the big tech companies have technical issues with their products and services. But the biggest one was by far the colossal Google; Google Maps experienced the much dreaded website downtime impacting thousands of users across the globe. It was reported online that Google Maps had suffered a partial outage meaning that many couldn’t access the location tool, but why and more importantly, how?

Sponsored Post

WiFi Observability to Boost Employee Digital Experience

We are all moving towards a digital workplace - or a hybrid work scenario. Whatever be the case, you can expect end-users to call and complain about a poor WiFi experience. That's because network monitoring needs to be done from their standpoint, not from the enterprise end. And without the correct WiFi observability data, it's challenging to narrow down the root cause of the problem affecting remote employees. And those problems - poor WiFi performance leading to poor digital experience - can be pervasive and persistent.

Aligning Dell's Industry-Leading Storage and Data Infrastructure Portfolio With a Comprehensive Monitoring and Optimization Platform

Dell Technologies World is upon us, and in addition to being a welcomed return to the in-person format, it’s also an opportunity for me to reflect on Virtana’s long history with Dell, our integration points, and the synergies with the Dell portfolio.

5 Executive Blindspots Around Hybrid IT Observability

For tech leaders, staying on top of hybrid and multi-cloud complexity with traditional monitoring tools is not easy task -- and can create distinct visibility gaps across your environments. SolarWinds Hybrid Cloud Observability can help put you on the path to better business outcomes.

New in Grafana Tempo 1.4: Introducing the metrics generator

Grafana Tempo 1.4 has been released and features a new optional component: metrics generator, which automatically generates RED metrics and service graphs from your traces. We’re actively rolling out the metrics-generator service to our own Grafana Cloud offering and are looking for Grafana Cloud Traces customers wanting early access. If interested, you can email our support team for more details.

Equipping Developers with the Tools to Succeed at Scale

Forethought is a leading AI company providing solutions that transform the customer experience. As a high-growth startup with 2x annual growth in their engineering team, they faced increasingly complex processes and found that what had worked in the past wasn’t going to cut it anymore.

Modernize Legacy Code in Production - Rebuild your Airplane Midflight without Crashing

I spent over a decade as a consultant working for dozens of companies in many fields and pursuits. The diversity of each code base is tremendous. This article will try to define general rules for modernizing legacy code that would hopefully apply to all. But it comes from the angle of a Java developer. When writing this, my primary focus is on updating an old Java 6 era style J2EE code to the more modern Spring Boot/Jakarta EE code.