Operations | Monitoring | ITSM | DevOps | Cloud

January 2022

The whats, whys, and hows of Windows network monitoring

Microsoft Windows is one of the most widely used operating systems and is preferred by users all around the world. A Windows device is associated with a lot of processes, services, and events that often need to be tracked from a single console. This is where a Windows network monitoring tool comes in handy. A Windows network monitoring tool is used to monitor the availability and performance of Windows devices in a network.

What's new in Sysdig - January 2022

As you already know, the “What’s new in Sysdig” blog team is involving more and more people, and this month is up to me, Giulio Puri. I’m based in Milan, Italy, and I’ve been part of the Sysdig EMEA team since May, 2021 as a Sales Engineer. I’m passionate about technology, innovation and cybersecurity, and in my free time, I love to cook, experiment with recipes and – not always successfully – surprise my friends with new dishes.

The Next Big Thing for CSP Network Monitoring - Autonomous Remediation

CSPs aiming to achieve zero-touch networks are looking for monitoring solutions that go beyond autonomous detection to autonomous remediation. Ira Cohen, Co-Founder and Chief Data Science at Anodot explains the principles behind autonomous remediation and details the needed building blocks and capabilities to achieve it.

A Guide to Service Request Management

It is common for end-users of service providers or vendors to request services to assist them with the services or the products. It is a classic use case wherein service request management enters the picture. The Information Technology Infrastructure Library (ITIL) framework includes service request management as a critical component. In view of the same, we shall walk through the following topics in this blog to understand the fundamentals of service request management.

Sponsored Post

New Security Reference Stack For Modern Enterprise

The security stack is a crucial part of any company’s IT infrastructure. However, Security teams increasingly report that traditional SIEM solution approaches are “costly, complex, and resource-consuming,” according to a recent ESG survey. Fortunately, there has been significant innovation in how firms approach cybersecurity with new cloud-native technologies stack and breaking free from vendor lock-in and giving themselves more flexibility, cost advantage, and future-proofing.

How to build a SaaS application?

SaaS is a new trend that is transforming how companies manage their software. Today, SaaS is one of the most popular ways to deliver your software. SaaS is cloud-based software deployed from a website and is used to provide software solutions at a cost lower than the price of traditional software licensing, including the ability to build and deploy a solution. It is gaining popularity because of its ease of use and flexibility to be updated frequently.

Best Practices in Java Logging for Better Application Logging

Examining Java logs is usually the quickest way to figure out why your application is experiencing trouble, so it's critical to have it in place. Best practices for Java logging can help you troubleshoot and address issues before they affect your users or business. In many circumstances, this entails utilizing a Java logging tool capable of automating your processes and delivering faster and more accurate results than manual logging.

How traceroute in the Synthetic Monitoring plugin for Grafana Cloud helps network troubleshooting

One of the powerful tools available in Grafana Cloud is Synthetic Monitoring, a black box monitoring solution that can provide insights that are hard to get in other ways. It provides a different view of your application by observing performance and uptime externally and from all over the world. As a result, you can build an understanding of what your end users are actually experiencing. However, as great as it is, synthetic monitoring does have limitations.

What Are the Main Points of Failure in the PSTN Supply Chain?

In this series of articles, we are reviewing in detail the PSTN routing for Microsoft Teams voice. In the previous article, we reviewed the challenges IT teams can face when mixing Microsoft Teams, PBXs and PSTNs. Now, let’s take a deeper look at the complexity and potential points of failure in the PSTN supply chain.

What is Network Observability and Why Should You Care

If you haven’t heard of network observability yet, you will very soon. And you’ll be hearing it a lot. Some say it is just marketing hype. Some say networks have always been observable. But network observability is the most important new concept to hit the network performance monitoring space in years. Join Kevin Woods, Kentik director of product marketing, to learn.

[Infographic] AWS Elastic Load Balancing from a Serverless perspective

Load balancing is a significant part of every internet-facing software, and with Elastic Load Balancing (ELB), AWS offers a set of load balancers for every use case. Since our latest update, Dashbird also gives you insights into these ELB services; let’s look at them and see how they can be used in a serverless environment.

From Kálmán to Kubernetes: A History of Observability in IT

You know that observability plays a crucial role in helping to manage today’s distributed, cloud-native, microservices-based applications. But you may be surprised to learn that – despite its close association with modern applications – observability as a concept was born more than a half-century ago. Its origins stretch all the way back to the late 1950s, long before anyone was talking about microservices and the cloud.

How to Simplify Your Out-of-the-Box Alerting with NEW! AutoDetect

Over 85% of global organizations will be running containerized applications in production by 2025 say Gartner, with 4 in 5 enterprises expected to move their workloads from on-premises infrastructure to the cloud. Migration to the cloud has IT admins and/or SREs managing an increasingly complex, hybrid IT environment, with an uphill battle of trying to monitor and troubleshoot their infrastructure components and services in real time.

Logit.io Featured On eChannelNews For New Partner Program Launch

We are excited to announce that Logit.io has recently been featured on e-ChannelNews where our founder Lee Smith was interviewed by the President of TechnoPlant, Julian Lee about our partner program. In this interview, Lee explains more about how the Logit.io platform can assist channel partners to grow their ability to offer enterprise-ready logging and metrics analysis.

Data Visualizations with InfluxDB: Integrating plotly.js

One of the great features of the InfluxData cloud platform is that it comes out of the box with all the tools you need to quickly read and write your data to the database. Here, we’ll walk through creating data visualizations with InfluxDB and plotly.js, a JavaScript graphing library built on top of d3.js and stack.gl.

Dashboard Fridays: Sample Excel Business KPIs Dashboard

The Management Team at SquaredUp needed a dashboard to visualise key metrics related to the different sectors of business to keep an eye on the KPIs and track targets. This dashboard keeps track of the different key targets set for the business over three time periods and highlights variations. The dashboard also plots the trend on different values hit over the time and helps identify fluctuations.

How DX NetOps by Broadcom Software Helps Enterprises Deliver Reliable Digital Services

Tim Diep, Head of Network Operations Solutions at Broadcom discusses the current challenges with deploying and managing new software-defined network architectures and the need for SDN-enabled network observability solutions in order to realize the full investments of your modern network deployments. For more info, visit broadcom.com/netops

How Broadcom Software Helps The Enterprise Secure and Protect the Network Edge

Kieran Taylor, Head of Marketing for Broadcom Software interviews Tim, Diep, Head of NetOps solutions at Broadcom Software on the current challenges of modern architectures like edge networking, and how Broadcom can help the enterprise secure and protect the edge. For more info, visit broadcom.com/netops

How to Monitor SD-WAN Migrations

Monitoring the migration from MPLS to SD-WAN networks is one of the most common use cases for monitoring network performance. When taking on any new migration, it’s important to monitor network performance before, during, and after for complete network visibility. Keep reading to learn how to monitor SD-WAN migrations using Network Monitoring.

The NetOps Expert - Episode 4: Ensure Successful Upgrades With the Broadcom Weekend Upgrade Program

Jeremy Rossbach, Head of DX NetOps Product Marketing and Matt Johnson, Head of Global Support for Broadcom Software, Agile Operations Division, discuss the Weekend Upgrade Program. Broadcom is dedicated to ensuring you are on the latest software releases with the most critical fixes, security enhancements and integrations. That’s why Broadcom Support has developed a Coordinated Upgrade Program that guarantees a successful upgrade of your DX NetOps platform.

What is Distributed Tracing? Key Concepts and Definition

Back in the day, monitoring applications from end to end was—for the most part—significantly easier than it is today. Though the basics of instrumentation and observability for metrics like CPU, memory, and I/O throughput haven’t changed, the way applications are built has changed significantly. There were, at most, a handful of application servers and likely a single database server.

Top 10 Monitoring Features for Multi-Tenant Managed Service Providers (MSPs)

eG Innovations works with Managed Service Providers (MSPs) across the world, who use eG Enterprise to deliver value-added services to improve their customers’ resilience and business outcomes. Many of these service providers choose eG Enterprise for its secure and granular role-based multi-tenancy support. The service provider does not have to configure and maintain one instance of eG Enterprise for each customer.

Introduction to (Performance) Monitoring Metrics

To ensure the reliability and stability of your services, it’s essential to understand the overall health of your infrastructure and systems. That variety of information from your systems helps you to get a proper context during your root cause investigation and react in real time. But also gives you the ability to make changes with confidence, so you don’t encounter the same problem in the future.

How banking giant ING is future-proofing payment processing with Elastic

ING Group is a Dutch-based multinational banking and financial services corporation serving more than 38 million customers globally. It’s one of the biggest banks in the world, and consistently ranks among the top 30 largest banks globally. Our 20-year-old COBOL-based financial messaging system — which provides electronic instructions to enable financial transactions between banks and customers — is slowly becoming obsolete and difficult to integrate.

Amazon S3 Cost Optimization Best Practices

Amazon Simple Storage Service (S3) is an essential cornerstone of AWS and among its most popular service offerings. S3 allows tenants to store, secure, and retrieve data from S3 buckets on demand. It is widely used for its high availability, scalability, and performance. It supports six storage classes and several use cases, including website hosting, backups, application data storage, and data lake storage. There are two primary components of Amazon S3: Buckets and Objects.

Five Key Capabilities You Need to Deliver Modern Managed Services

Enterprises use managed service providers (MSPs) to handle their distributed IT environments in a flexible, responsive, and cost-effective manner. Service providers that help enterprises embrace digital business models and deliver outstanding customer experiences will see faster revenue growth and better profitability.

Broadcom Software Announces Designated Weekend Upgrade Program for DX Unified Infrastructure Management 20.4

Broadcom Software is dedicated to ensuring you are on the latest software releases with the most critical fixes, security enhancements, and integrations. That’s why Broadcom Support has developed a DX Unified Infrastructure Management (DX UIM) Designated Weekend Upgrade Program to assist you with a successful upgrade to DX UIM 20.4.

5 hacks to maximise your website's potential

There are reportedly over 1.7 billion websites, making it almost impossible for yours to stand out unless your name is Jeff Bezos and you started a little-known company called Amazon. That’s why, when our customers asked us what we would suggest to help their website perform better, we thought we’d whip together 8 quick and easy hacks that can easily be implemented but make a big difference.

Top 5 challenges in Ethernet monitoring and how to simplify them with OpManager

An Ethernet connection helps businesses with critical communication, and even a slight interruption can irritate users or result in costly downtime. On top of this, the larger the network, the more complex the Ethernet network becomes.

Elevate AWS threat detection with Stratus Red Team

A core challenge for threat detection engineering is reproducing common attacker behavior. Several open source and commercial projects exist for traditional endpoint and on-premise security, but there is a clear need for a cloud-native tool built with cloud providers and infrastructure in mind. To meet this growing demand, we’re happy to announce Stratus Red Team, an open source project created to emulate common attack techniques directly in your cloud environment.

Monitor CDN performance within your Synthetic tests

Content delivery networks (CDNs) reduce latency by delivering cached data (e.g., JavaScript files, stylesheets, images, and videos) from a network of linked proxy servers to end users around the globe. CDNs help reduce the load on your origin servers and shorten the distance that data needs to travel, thus improving the end-user experience.

Be in charge of your cloud costs with these 2021 releases: CloudSpend recap

The widespread adoption of digital transformation triggered by the global health crisis has created a boom. For some businesses, the transition was smooth, but for others, it was an aggressive shift. Out of all the challenges posed by the transition to digital environments, messy cloud cost management and rocketing cloud bills are the most taxing. According to Gartner, through 2024, 60% of infrastructure and operations leaders will bear cloud costs that hurt their on-premises budgets.

Struggling with blurry website imagery? You're not alone. Here's how to optimize for better image clarity across different browsers.

When it comes to your website, visual content plays a huge role. In a world where it takes our brains only 13 milliseconds to process an image – visuals help narrate your brand story in a quick and visually captivating way. This ability to process images so quickly places even more importance on the need for crisp high-quality content. A website tainted with fuzzy and blurry images can affect engagement and lead to an overall negative experience for visitors.

Improvements Made to AppSignal for Node.js in 2022

During the last few months, we've been working hard on improving our Node.js integration. We've released loads of quality fixes and improvements to our diagnose command, configuration, and general package structure. Today, we'd like to highlight some of the enhancements and fixes that we've recently released.

What Should A Board Know About Tech In 2022? | Splunk & Accenture

There is so much happening in the technology space let alone each individual global market, how does an organisation keep up? What trends do they need to keep an eye on and which ones do they need to invest in? We will discuss some of these issues today. Join Brian Berg, Principal Director at Accenture, Blanca Galletero, Splunk’s GVP EMEA GTM Ecosystem and Mark Woods Chief Technical Advisor EMEA at Splunk as they discuss the topic ‘What should a board know about Tech in 2022?’.

[2022] Best Software Deals and Discounts for Schools and Educational Organizations

Since the pandemic struck, education has moved even further online. Many schools are now actively transforming their programs to ensure uninterrupted remote education of high quality. This means device deployments have commonly grown to 1-to-1 — every student now has a laptop. At the same time, budget constraints don’t allow buying subscription plans for all the software schools and educational institutions need. Sounds familiar?

Video: How to build a Prometheus query in Grafana

Once you have set up your Prometheus data sources in Grafana, it’s time to put them to work. In the one-minute tutorial video below, we show you how to build a query in Grafana 8.3 with Grafana’s easy-to-use Explore mode. Prometheus uses a query language called PromQL. If you are already familiar with PromQL, you can simply enter your query in the text field and run the query.

Stackify vs. New Relic vs. Scout | APM Tool Comparison

Stackify Retrace primarily supports Java, .NET, PHP, Nodej.js, Ruby, and Python applications. New Relic supports Java, node.js, Python, Go, PHP, .NET, and Ruby. On the other hand, Scout APM supports Ruby, Python, Node.js, PHP, Elixir & Phoenix, in addition to Error Monitoring, Database Monitoring and External Services Monitoring.

What's New at observIQ

You may have noticed a few changes around here. If you explore our new website, you’ll notice new products, expansions to our open source libraries, significant contributions to our favorite open source project, OpenTelemetry, and new integrations with Google Cloud. You might just think we’re taking “new year new me” a little too seriously, but in fact we’ve been planning some of these changes for a long time. It all stems from our firm belief in open source technology.

SQL Monitor: Time For a 2nd Look

I’m inordinately proud to work for Redgate Software. One of the biggest reasons for my pride is because I can say, without equivocation, we make fantastic software that will help you do your job better, easier, and faster. However, there was one piece of software, many years ago now, that I wasn’t so proud of. Let me put it this way: I tried to get rid of SQL Monitor.

3 Ways LogStream Can Improve Your Data Agility

Four months into this new gig at Cribl, I wish I could bottle up that “lightbulb” moment I get when walking people through how Cribl LogStream can help them gain better control of their observability data. So I hope the scenario walkthroughs below will capture some of that magic and shed some light on how LogStream can improve your organization’s data agility – helping you do more with your data, quickly, and with less engineering resources.

Cloud Technology Adoption Trends

In the second half of 2021, eG Innovations partnered with the DevOps Institute to conduct an online survey of more than 900+ individuals from Sys Admin, DevOps, SREs, and other IT backgrounds. We asked questions about: Some of the results included: You can download the full survey results here: Cloud Technology Adoption Trends | eG Innovations If surveys and statistics on technology adoption are of interest, we have some other recent ones available, conducted in the last 12 months,.

The Big Takeaways from Cyber 5 2021

If you did your holiday shopping online this year, you’re not alone. Cyber 5, the five days between Thanksgiving and Cyber Monday, represented one-fifth of all eCommerce sales for November and December in 2021 (despite a slight decline in overall spending since last year). Americans shelled out $8.9 billion on Black Friday deals and $10.7 billion on Cyber Monday specials.

A Splunk Approach to Baselines, Statistics and Likelihoods on Big Data

A common challenge that I see when working with customers involves running complex statistics to produce descriptions of the expected behaviour of a value and then using that information to assess the likelihood of a particular event happening. In short: we want something to tell us, "Is this event normal?". Sounds easy right? Well; Sometimes yes, sometimes no. Let's look at how you might answer this question and then dive into some of the issues it poses as things scale-up.

Talent Shortage 2022: Stretching Your Lean DevSecOps Team

The cybersecurity talent shortage is real. As of December 2021, a job-tracking database from the U.S. Commerce Department showed nearly 600,000 unfilled cybersecurity positions. And a 2021 study found that 57% of cybersecurity professionals worked at organizations that have been directly impacted by the cybersecurity talent shortage. Even so, many organizations want to “shift security left” or build security best practices earlier into the software development lifecycle (SDLC).

"What's in it for us?": Putting Users in the Driver's Seat of VDI w/ VMware

Back in 2007, when the VMware team was outlining the benefits of virtual desktop infrastructure (VDI), our presentations included a very specific use case: “global pandemic”. No, we didn’t have a crystal ball through which we could foresee the COVID crisis, more than a decade in advance. But even back then, we were looking at the security benefits of VDI, if global health crisis did suddenly force workforces to go remote.

Techstrong Predict 2022: Future of Observability

In a customer-centric world, observability is mission-critical for delivering great digital experiences. Join the "The Future of Observability" panel discussion to learn: What drivers create the need for observability What steps you should take to reach your observability goals What elements are necessary for an effective observability strategy What the future of observability looks like Panelist: Mitch Ashley - Techstrong Research Lodewijk Bogaards - StackState Brian Dawson - Dawson and Dawson, Inc. Cyrille Le Clerc - Elastic

GripMatix Citrix MP most used SCOM 3rd-party Management Pack for Citrix

The Big SCOM Survey 2021 results are available and we are happy to see that at organizations the use of SCOM is still expected to grow according to more than half of the respondents. It is a survey executed by SCOMathon. But above all, our Citrix Ready Management Pack solution for monitoring Citrix Virtual Apps and Desktops scored as second-most used 3rd-party Management Pack of all 3rd-party Management Packs out there.

Tip: DNS Speed Performance Test

Have you checked your DNS speeds lately? When it comes to online performance, speed is king! The faster your service resolves queries, the quicker end users get to their destination and the happier your customers will be. Here’s the thing: When it comes to DNS speed, every millisecond over your competitors’ speeds could mean a lost customer. The good news is, there’s a way to test your speeds and to see just how your DNS speeds stack up against the competition.

Run Datadog Synthetic tests in your Jenkins pipelines

Continuous integration (CI) has become the mainstream approach to software development as it enables organizations to iterate quickly while minimizing the risk of releasing faulty code. To implement CI, many organizations rely on Jenkins—one of the most mature and widely used automation servers on the market. Jenkins comes with hundreds of community-backed plugins to help you easily integrate it with other tools in your development workflow.

How to Troubleshoot Your Network Using Traceroute

Anyone who has investigated the cause behind a network issue knows how daunting it can be to pinpoint the problem. With so many different variables, the answer can be elusive. In our last blog, we explained how you could test your DNS speeds in PerfOps. Today, we’ll be showing you how to use our Traceroute tool to troubleshoot network issues and improve DNS speeds.

11 of the most costly software errors in history

The mere mention of a serious software error can strike fear into the heart of any developer, project manager or tech leader. The wrong error in the wrong system can be incredibly expensive and difficult to resolve, not to mention humiliatingly public. Catastrophic software errors are mercifully rare these days, but the potential for chaos, PR disasters and spiraling costs still remains.

Broadcom and AppNeta Deliver Industry-Leading Network Monitoring and End-User Experience Monitoring

On December 7, Broadcom announced its intent to acquire privately held AppNeta Inc. headquartered in Boston, MA. AppNeta’s SaaS-based solutions provide enterprise IT teams with precise, end-to-end visibility into network performance from the end-user’s point of view. Combined with DX NetOps by Broadcom Software, AppNeta monitoring capabilities will help enterprises and service providers to more efficiently diagnose and improve network performance for end-users, independent of what network they use to access applications. For more info, visit broadcom.com/netops

The State of AIOps SaaS and the Vacuum Left by Traditional Solutions

Modern workflows are primarily aimed at one thing—reducing operational complexities so that stakeholders can focus on initiatives that boost business and innovation. For IT teams, Artificial Intelligence and Machine Learning play key roles in bringing this goal to life. And even though AIOps is considered to be not yet in mature stages, there is no denying that IT teams that do not adopt AI processes will be left behind. By 2023, the market for AIOps tools is predicted to reach $11.02 B.

What's Next for AIOps? 4 Trends for the Future of AIOps

As an idea conceived by Gartner four years ago, AIOps is already a mature practice. But it is also one that continues to evolve as businesses turn to AIOps to support new use cases, and as AIOps vendors build better and more efficient AIOps tools. That fact begs the questions: what’s next for AIOps? What are the relevant trends that will shape the future of AIOps over the next several years, and how will AIOps use cases evolve going forward?

Prevent Data Downtime with Anomaly Detection

A couple months ago, a Splunk admin told us about a bad experience with data downtime. Every morning, the first thing she would do is check that her company’s data pipelines didn’t break overnight. She would log into her Splunk dashboard and then run an SPL query to get last night’s ingest volume for their main Splunk index. This was to make sure nothing looked out of the ordinary.

Top 5 Tools to Test Your Website in 2022

The reliability of a website affects its earning potential, as every second in the digital world counts. According to a study by BCG and Ryte, every second of loading speed costs from $3,000 to $9,000, depending on the eCommerce industry. That shows that your website has to perform optimally all year round. It's the only way to avoid losing money. Aside from outstanding performance, you need to work on your website's design. Certain tools help you improve it to get more conversions.

Raygun Alerting: Monitor your latest deployment

Modern development teams are shipping code faster than ever before. Having visibility into the issues that will inevitably get introduced into your software is crucial for the development process. Latest deployments for Raygun Alerting helps with just that. Now, you can tick the latest deployment checkbox on all Raygun alert types to only monitor your latest deployment and resolve issues before your customers ever even notice.

4 reasons why network visualization is integral to successful network management

Businesses in today’s world use networks for almost all their operations. As businesses grow and expand with time, so do their needs. As a result, their networks can become increasingly complex and sophisticated. This can result in network administrators having a harder time monitoring devices and identifying faults. These bottlenecks can be circumvented with network visualization.

What is Observability? Benefits, Use Cases & More

The year is over, and the word ‘Observability’ has been one of the buzzwords that kept everyone checking throughout the year for deserving reasons. The organizations do not want to leave any stone unturned to maintain performance and offer robust services from ‘monitoring’ practices to ‘observability’, ‘telemetry’, and visibility capacities. So let’s get into the meaning of each term and understand how they are vital for business growth.

Grafana Tempo 1.3 released: backend datastore search, auto-forget compactors, and more!

Grafana Tempo 1.3 has been released! We are proud to add the capability to search the backend datastore. This feature will also appear soon in Grafana Cloud Traces. If you want to dig through the nitty-gritty details, you can always check out the v1.3 changelog. If that’s too much, this post will cover the big ticket items. You can also register for our upcoming webinar “Distributed tracing in Grafana: From Tempo OSS to Enterprise” on Jan.

Automating Notification & Response with Notification & Collaboration Tools

With the ScienceLogic SL1 platform correlating and contextualizing data to generate actionable events that accurately reflect the issues that need attention, how do you make sure all your engineers and system admins are on the same page?

Datadog Cloud Security Platform

Datadog's Cloud Security Platform—consisting of Cloud SIEM, Posture Management, and Workload Security—delivers real-time threat detection and continuous configuration audits across your applications, hosts, containers, and cloud infrastructure. Datadog derives security insights from your observability data, enabling security and DevOps teams to work together to detect, investigate, and remediate threats.

How managed service provider CTAC onboards customers at lightning speed

As a managed service provider, CTAC provides all kinds of IT services for its clients. Efficient monitoring is crucial for them: that way, they can stay on top of performance issues within their customer's IT environments and, eventually, keep their customers happy. However, because CTAC works with many different clients and platforms (such as Microsoft and SAP), their monitoring is often very siloed - which makes it difficult to get an overall view of the performance and health of their customers' IT services.

Goliath Technologies Announces Release of Citrix Logon Duration Scorecard

Philadelphia, PA – January 25, 2022 – Goliath Technologies, a leader in end-user experience monitoring and troubleshooting software for hybrid cloud environments, announced today the release of their new Citrix Logon Duration Scorecard, expanding on its industry-leading end-user experience reporting and analytics suite. 

An introduction to the Avantra SUSE hardening Add in

Included with Avantra Enterprise edition, Avantra Add ins are pre-packaged best practice scenarios that accelerate your business time to value using our expertise. One such Add in is SUSE hardening and is based on the hardening guide from the makers of SUSE Enterprise Linux. This Add in is a collection of eight custom checks that are designed to be extensible by you to match your organizational requirements.

Using Oracle Cloud as a Data Lake Made Simple With Cribl LogStream

All Cloud providers such as AWS, Azure, Google Cloud Platform, and Oracle Cloud offer Object Storage solutions to economically store large volumes of data and retrieve it on demand. It’s far cheaper to store one petabyte of data in object storage than in block storage. As AWS S3 has become the standard, many on-premise storage appliance vendors have incorporated S3 APIs to store and retrieve data. Oracle wisely continued that trend to OCI (Oracle Cloud Infrastructure).

The Architecture of AIOps: 4 Best Practices

Artificial intelligence for IT operations (AIOps), although the new kid on the DevOps block, is here to stay. It’s the future of DevOps and the future is here with us now. AIOps is just the infusion of artificial intelligence to the practice of DevOps. This practice has shown that it’s of great value to a lot of software development firms. Firms that have already implemented AIOps have recorded a massive boost in overall IT productivity.

5 Ways to Improve Your Application Performance Monitoring

The world and technology keep evolving. Over time, applications with functions ranging from buying and selling online to holding meetings to keeping up with friends and family have progressed. Now, we are able to automate actions that used to be manual or at least perform them in the most efficient way possible. This automation is made possible through the use of our applications. Now imagine one of these applications stops working for just 10 minutes.

The Five Tenets of Observability

A new year is a chance to have a new start, and one thing that it’s a great opportunity to think about is the monitoring and observability platform you’re using for your applications. If you’ve been using a legacy monitoring system, you’ve probably heard about observability all over the ‘net and want to figure out if this is really something you need to care about.

Preventing Network Configuration Drift

A cynical network engineer might say, “configuration drift happens when you take the day off.” Someone changed something they shouldn’t have, and didn’t tell anyone. As a result, the network gets just a little less secure, and a little harder to troubleshoot. And then it happens again. And again. And over time, all those little changes that people thought would never mean anything suddenly add up to a network looking a lot different from what it’s supposed to be.

Does Your MSP Have a Single Point of Failure?

When I’m speaking to any IT solution provider or managed service provider (MSP), one of the most common questions I’m asked is, “What’s the one mistake you’d recommend any IT businesses avoid?” My answer is always this: Not documenting your business. The reason is simple. If you don’t document your business, you’ll inevitably find your business is prone to an SPF—a single point of failure.

Make the most of your observability data with the Data Volume app

As a DevOps, SecOps, or IT operations manager, you're surrounded by all the technology for the systems running the entire organization. This means legacy infrastructure, multi-cloud environments, services, tools, and applications. All of these components generate data—a huge amount of data—some of which you need to leverage for full-stack observability to ensure those systems supporting the business are running efficiently.

Algist Bruggeman Uses Insights from InfluxDB to Optimize Industrial Processes and Production

Founded in 1884 and located in Ghent, Belgium, Algist Bruggeman supplies fresh, liquid, and dried yeast to industrial, semi-artisanal, and artisanal bakeries, as well as to the beer, wine, and pharma industries. Algist Bruggeman is part of the Lesaffre Group, a key global player in fermentation for more than a century. Even with more than a century of industrial production behind it, Algist Bruggeman continues to evolve its manufacturing processes.

Tanzu Observability Brings Full-Stack Monitoring of Kubernetes Clusters for OpenShift

Red Hat OpenShift is an enterprise Kubernetes platform that provides users with a unified cloud experience wherever it’s deployed. VMware Tanzu Observability by Wavefront offers observability and analytics for multi-cloud Kubernetes environments. Now these two products work even better together.

Observability Pipelines for Dummies

How do you get the data out of your infrastructure and applications in order to properly observe, monitor, and secure their running states while minimizing overlap, wasted resources, and cost? Many business folks need a broad category of tools in all their environments to solve challenges such as up and down monitoring, metrics, a time series database (TSDB), log analytics, event streaming, security information and event management (SIEM), user behavior analytics (UBA), and data lakes. The answer to the proposed question to solve these hurdles is using an observability pipeline.

10 Microsoft Teams Performance Use Cases for IT Admins

Dependence on Microsoft 365 and Teams has never been greater, and the pressure is on for IT teams to deliver exceptional user experiences - anytime, anywhere. The modern workplace sees users connecting from the office, home and pretty much any place in between. This hybrid work model has a significant impact on IT, the network and the overall quality of service perceived by the users.

Designing production-ready AWS serverless applications

Serverless has become an increasingly popular paradigm among organizations looking to modernize their applications as it allows them to increase agility while reducing their operational overhead and costs. But the highly distributed nature of serverless architectures requires developers to rethink their approach to application design and development. AWS-based serverless applications hinge on AWS Lambda functions, which are stateless and ephemeral by design.

Best practices for building serverless applications that follow AWS's Well-Architected Framework

In part 1 of this series, we looked at common design principles and patterns for assembling microservices in serverless environments. But when it comes to building serverless applications, designing your architecture is only part of the challenge. You also have to ensure that each of your individual functions and services are secure, reliable, and highly performant—without incurring enormous costs.

StatusIQ: A roundup of our journey in 2021

In between online meetings and chat conversations, we've all embraced the digital way of life and work, and it is here to stay. We may not know any other way to operate businesses in a few years' time except the digital space. This way of life will require clear communication channels for businesses to connect with their users. Keeping that in mind, as well as your feedback in our community, we've shaped StatusIQ to help ease the incident communication process.

Press Release: Kubernetes Management Pack Announcement

Today OpsLogix announces the upcoming release of their new Kubernetes Management Pack. This product is designed to help organizations monitor their Kubernetes clusters using System Center Operations Manager (SCOM). The management pack provides comprehensive monitoring of all aspects of your Kubernetes environment, from individual nodes and pods to entire clusters.

Ask the Product Experts | THWACK Livecast

We were all new once, stumbling in the dark, feeling our ways along the walls looking for the light. Searching blindly isn’t a good way to approach any technology project, so hearing from the experts is one of the better ways to expound on your knowledge. It doesn’t matter if you’re new to monitoring or a long-time SolarWinds professional; we’re sure there are questions you’ve got for our seasoned veterans.

A (de)bug's life: Diagnosing and fixing performance issues in Grafana Loki's read path

Beep, beep, beeeeeeeep. Read path SLO page, again. And I’ve almost found the noisy neighbor! That was me. And will probably be me again at some point in the future. As we continue to scale up the team that builds and runs Grafana Loki at Grafana Labs, I’ve decided to record how I find and diagnose problems in Loki.

Getting Ready for a smooth, speedy migration to the Splunk Cloud Platform

This video shows you how a little bit of preparation before you kick off your cloud migration can lead to a speedy, smooth ride. Additionally, this video will help you decide on your migration strategy that is best for your environment and show you how to assess the efforts required for migrating your environment to the Splunk Cloud Platform.

How Reliability and Product Teams Collaborate at Booking.com

With more than 1.5M room nights booked per day, Booking.com requires a solid infrastructure that’s constantly monitored. And indeed, Booking.com now has a footprint of 50,000+ physical servers running across four data centers and six additional points of presence. The sheer size of this server fleet makes it viable for Booking.com to have dedicated teams specializing into looking only at the reliability of those servers.

Dashbird now integrates with 5 new AWS services

TL;DR: Dashbird launches observability for five new AWS services (ELB, SNS, RDS, OpenSearch, and HTTP API Gateway) to allow for a faster, more secure, and smoother serverless observability experience. Dashbird, the leading monitoring platform for serverless AWS applications, announces five new AWS integrations.

Continuous Service Virtualization, Part 2: Steps for Optimizing DevOps

In my prior blog, Continuous Service Virtualization, Part 1: Introduction and Best Practices, we offered an introduction to continuous service virtualization (SV) and discussed some key best practices. In this, the second and final post in the series, we will discuss the continuous SV lifecycle and how it helps to optimize DevOps and the continuous integration/continuous delivery (CI/CD) pipeline.

Continuous Service Virtualization, Part 1: Introduction and Best Practices

Service virtualization (SV) has evolved as a popular technique and technology over the last decade. Traditionally, SV has primarily been used by testers to simulate other application components that the application under test interacts with. Typically, virtual services have been created and maintained by center of excellence (COE) teams.

Leveraging AIOps to Enable Greater Customer Experiences

As time progresses and competition grows, being “good enough” means that you may be falling behind. Engineers will discover new ways to solve problems, which will enable rapid increases in availability and scalability. With these increases comes more complexity and the generation of more data. Rather than just monitoring the new data and letting the old data sit there collecting dust, you should consider using it to gain maximum insights into your environment.

19 Questions To Ask Your Cloud Cost Management Vendor

Not all cloud cost management tools are equal. Whether you’re in the process of evaluating cloud cost management vendors or already have a tool in place, here are 19 questions you should ask to ensure you have all the capabilities needed to maximize performance and minimize cost of your hybrid cloud deployment across the following.

Communicating to Users During Incidents

Imagine you're having a regular day at work, opening up your browser, double checking something for a client in that web app your team built for them, when suddenly, you see this screen: You hit refresh a few times, just to be sure. Nope. Still down. What happens next depends on how well your team has planned for incidents like this (some folks call it unplanned downtime).

New in StatusGator: Reordering Services

As our public status dashboards have become more popular, so has the ability to customize them. Over the next several weeks, we will be rolling out a series of features that allow more customization of your dashboard. Already we’ve added custom CSS capabilities. Today, we’re rolling out service reordering. Our new dashboard management page has a slimmed-down look.

Improving your team's on-call experience

Your engineers probably dislike going on-call for your services. Some might even dread it. It doesn't have to be this way. With a few changes to how your team runs on-call, and deals with recurring alerts, you might find your team starting to enjoy it (as unimaginable as that sounds). I wrote this article as a follow-up to Getting over on-call anxiety.

The Observability Pipeline

Today’s systems are more distributed, dynamic, and complex than ever before – plus, users have more expectations. Also, the historical reliance on an operations team to monitor, triage, and/or resolve issues has become untenable as the number of services increased. This means that many of the tools that were well-suited before might no longer be adequate.

What are CDN Logs and Why Do They Matter

Content Delivery Network produces numerous log files called CDN logs to deliver video across the internet to our homes and mobile devices. These logs contain crucial information about the CDN servers' performance and video streaming quality. Also, it contains terabytes of data, which has its own set of hurdles in terms of handling it in real-time and performing analytics to understand user experience and network concerns.

Dashboard Fridays: Sample Microsoft Teams Dashboard

Join SquaredUp's Adam Kinniburgh and Purdue University's Daniel Parrott as they showcase this sample Microsoft Teams dashboard used by Purdue to visualize key Microsoft Teams usage metrics for their online classrooms. Built using SquaredUp, this dashboard keeps track of the total number of Teams and which are empty, allowing them to pinpoint issues with the data load to create the teams or update issues. The dashboard also monitors for empty class sections to help identify issues with the class selection process.

Top 10+ Best System Monitoring Software & Tools [2022 Comparison]

It’s virtually impossible to manage today’s complex IT environments at scale without a comprehensive system monitoring solution that allows you to check the health of all your applications and services from a single pane of glass. When your end users are experiencing difficulties, you must have such a tool in place that lets you quickly ascertain and remediate the root cause of the slowdown or error.

Getting over on-call anxiety

You've joined a company, or worked there a little while, and you've just now realised that you'll have to do on-call. You feel like you don't know much about how everything fits together, how are you supposed to fix it at 2am when you get paged? So you're a little nervous. Understandable. Here are a few tips to help you become less nervous.

5 Ways the World of IT Operations Will Shift in 2022 (and Beyond)

The following first appeared on VentureBeat. Despite the economic uncertainty due to the global pandemic, worldwide technology spending in 2021 increased by nearly 9% to $4.2 trillion. The U.S. economy grew sharply across the past three quarters of 2021, leading to stronger consumer spending, higher price inflation, and persistent employee shortages amid the great resignation.

Datadog NPM now supports Consul networking

Consul is a service networking platform from HashiCorp that helps you manage and secure communication between microservices. You can use Consul with Kubernetes, and it supports on-prem, hybrid, and multi-cloud architectures. Consul service mesh provides a control plane which allows you to automate the management of traffic between your services via features like service discovery, DNS, load balancing, and routing.

Our Selenium Synthetic Monitoring Stack

We maintain a highly optimised browser automation stack in order to provide the most stable environment for our customers to run their Selenium scripts in. Our goal is to deliver the best user experience for writing and maintaining a synthetic script and configuring the browser environments it runs in. The synthetic monitor data we produce is used for simulating website processes such as form-based authentication, eCommerce transactions, and regulatory checks.

Harnessing AIOps to Improve System Security

You’ve probably seen the term AIOps appear as the subject of an article or talk recently, and there’s a reason. AIOps is merging DevOps principles with Artificial Intelligence, Big Data, and Machine Learning. It provides visibility into performance and system data on a massive scale, automating IT operations through multi-layered platforms while delivering real-time analytics.

Cloverleaf and Customized Management Packs

Every business is different, and every IT environment has its own set of challenges. Customized SCOM Management Packs are created to meet the monitoring and automation requirements of your company's critical applications. OpsLogix offers a wide range of off-the-shelf monitoring products, but for companies working with niche applications, these are not always applicable.

End-User Monitoring: How it Can Impact Your Business

After the global pandemic and lockdowns, most businesses look onward to online solutions for their applications. They want to take their business online by creating mobile or web applications. Companies are looking to monitor every aspect of the application, including deployment, bugs, API failures, etc. But the most important thing is how the application behaves when it goes into the hands of the end-users.

Network capacity planning made easy in 2022

Didn’t quite get to that task of capacity planning in December? Well, not to worry, this month Kentik has overhauled our Capacity Planning workflow and is introducing a slew of new features to make capacity planning easier and more intuitive than ever before. Those who are familiar with Kentik will know that one of our core offerings is the ability to monitor and plan actual network and interface utilization.

LogStream for InfoSec: VPC Flow Logs - Reduce or Enrich? Why Not Both?

In the last few years, many organizations I worked with have significantly increased their cloud footprint. I’ve also seen a large percentage of newly launched companies go with cloud services almost exclusively, limiting their on-premises infrastructure to what cannot be done in the cloud — things like WiFi access points in offices or point of sale (POS) hardware for physical stores.

How to save on your Azure Monitor and Log Analytics Costs

Thomas Stringer has a couple of great blog posts on how to understand your Azure monitoring costs and also on how to reduce your costs, see Azure Monitor Log Analytics too Expensive? Part 2 – Save Some Money | Thomas Stringer (trstringer.com). In the past I’ve blogged on How to calculate the Azure Monitor and Log Analytics costs associated with AVD (not an easy task!).

How the IT service provider q.beyond thrives with Icinga

We are proud of our many customers and users around the globe that trust Icinga for critical infrastructure monitoring. That´s why we´re now showcasing some of these enterprises with their Success stories. It´s stories from companies or organizations just like yours, of any size and different kinds of industries. Some of them are our long-standing customers, others have just recently profited from migrating from another solution to Icinga.

Ask Miss O11y: Long-Running Requests

You need not fear a long-lived streaming workload. A few simple tricks can transform a request that may not ever terminate for hours or days into something you can get regular health and status updates on. We in fact have one of those continuous processing services—Beagle, our Service Level Objective stream processor—which we’ve instrumented in this fashion.

AI-Powered Monitoring Could Have Saved Millions for Global Bank

As most people were preparing to celebrate the new year, the UK’s Santander Bank was dealing with a crisis. On Christmas day, roughly 75,000 people who received payments from companies with accounts at Santander Bank received a duplicate payment transaction. The total damage amounted to £130m, and recovery in these situations is a painful process for both the bank and its customers.

The Top 5 Use Cases for AIOps Today

By now, you’ve likely heard of AIOps, a technique that promises to inject new levels of efficiency into IT operations with the help of AI and machine learning. But what, exactly, does AIOps mean in practice? Which specific use cases can IT organizations enable or improve with the help of AIOps? Those may be more difficult questions to answer if you have yet to see AIOps at work in your organization.

Kickstart your Splunk App with @Splunk/Create

I’ve been contributing to, and creating, Splunk apps for the better part of the last 10 years. But never have I felt more excited to be a Splunk Developer than right now. One of the primary reasons why I am so excited is because of build tools like @splunk/create. At Splunk, we recognize that developers are so crucial to our entire ecosystem.

Monitoring AWS Spot instances using Sumo Logic

Spot worker nodes on EKS (Elastic Kubernetes Service) are a great way to save costs by allowing customers to take advantage of unused capacity. With Sumo Logic, we have experimented with and adopted spot worker nodes for some of our EKS clusters to see if we can pass along the same benefits. We decided to share some of the learnings, challenges, and caveats with using spot instances along with the monitoring setup.

How to use OpManager as an effective disk space monitor for your network monitoring environment

Disk space availability in servers is crucial. Applications that run on these servers save log files and write data to a database that is also installed on the server; if there isn’t enough disc space, the application may not work properly and may crash. Monitoring disc space is critical for IT administrators to maintain server performance and network availability by preventing a sudden and unexpected lack of server disc space.

8 Best Sentry Alternatives You Should Try

Selecting the best sentry alternatives for error monitoring is likely to be difficult. It might be difficult to sort between the features, benefits, and drawbacks of many software companies and sellers. Let's talk about how to make this process easier by looking at the eight best alternatives to Sentry. Sentry is open-source application performance and error-tracking tool that allows developers to track and fix errors in real-time.

APM Insight: 2021 in Review

2021 was the year of hybrid work. An interesting year full of hope and thoughtfulness, 2021 saw increased office-based collaborations complementing our diverse remote workforce. Through this, we sought to look within to sort our processes and deliver what it takes to ensure the best monitoring experience for our customers worldwide. Here is a quick recap of the APM features we rolled out last year and a brief note on our plans for 2022.

A beginner's guide to network monitoring with Grafana and Prometheus

Networks are the backbone of inter-communications within computer systems and applications. When networks go down or experience any interruption of service, the impact is widely felt and can result in significant service disruptions and lost revenue. This is why network monitoring is mission critical for organizations. Visibility into network performance is key to ensuring that network engineering teams can be more proactive and identify problems before those issues cause outages.

Webhook, Pub/Sub, and Slack Alerting notification channels launched

When an alert fires from your applications, your team needs to know as soon as possible to mitigate any user-facing issues. Customers with complex operating environments rely on incident management or related services to organize and coordinate their responses to issues. They need the flexibility to route alert notifications to platforms or services in the formats that they can accept.

Creating custom notifications with Cloud Monitoring and Cloud Run

The uniqueness of each organization in the enterprise IT space creates interesting challenges in how they need to handle alerts. With many commercial tools in the IT Service Management (ITSM) market, and lots of custom internal tools, we equip teams with tools that are both flexible and powerful. This post is for Google Cloud customers who want to deliver Cloud Monitoring alert notifications to third-party services that don’t have supported notification channels.

Introducing Flexible Subscriptions: Websites Are Dynamic, Monitoring Should Be Too

Have you ever felt limited or “locked into” a fixed SaaS subscription plan? Have you ever been forced into a Sales call only to struggle with the decision – and costs – of upgrading to a higher plan tier to add incremental features or usage you need? Are you subscribed to a SaaS plan today that’s chock-full of features or capabilities you’ve never used (or asked for!) – but are still paying for? If so, you’re not alone.

Modernizing Government Technology: How Federal Agencies Are Progressing on Technology Transformations

When the U.S. Congress passed the Modernizing Government Technology Act (MGT) of 2017 as part of the 2018 National Defense Authorization Act, it established both funding and a process intended to help bring aging federal IT systems and infrastructure up-to-date with state-of-the-art technologies common in the private sector. According to the legislation, the goals of MGT are to.

A Complete Introduction to SQL Server Transactions

One of the most fundamental concepts in any relational database management system (RDBMS), such as SQL Server, is the transaction. During my consulting career, I've seen many performance problems caused by developers not understanding how transactions work in SQL Server. In this tutorial, I'll explain what transactions are, why they're necessary, and how they work in SQL Server.

Wisdom of the Crowds: The Value of User Sentiment Observability

What’s the first thing most people do when they’re unhappy with a business? Take to social media to complain about it. Observing those comments – otherwise known as “user sentiment observability” – gives you a head’s up as to when problems become big enough to impact user experience. How can you monitor that voice of the customer? And why is it important to do so? Let’s take a deeper look at the issues.

Why you need network monitoring?

Network monitoring is a continious analysis of a network to detect and correct any performance issues. Network monitoring involves collecting network statistics to determine the quality of services offered by the network. With tools like Icinga, it’s possible to monitor hardware and software in your network. Espacially in the pandemic when many employees work from home, it’s good to have a tool which checks the network permanently.

ICYMI: Honeycomb Developer Week: The Partner Ecosystem

We know that you value collaboration. That’s why we share incident reviews and learnings—because we believe the entire community benefits by working together transparently. In the spirit of working better together, we invited ecosystem partners from ApolloGraph, Cloudflare, LaunchDarkly, and PagerDuty to present at Honeycomb Developer Week, a three-day event filled with snackable, time-efficient learning sessions to help you uplevel your observability skills.

Show it Off with Splunk TV! More Ways to Display Your Best Dashboards

Splunk TV lets you easily display your data on the big screen to visualize and monitor what’s going on in your business. Splunk TV is optimized for a hands-off experience, with slideshows and automatic scrolling so you can display the most important metrics securely and easily. We’re happy to announce that in addition to Classic (Simple XML) dashboards, we now support Studio Dashboards and IT Service Intelligence Glass Tables.

Monitoring Endpoint Logs for Stronger Security

The massive shift to remote work makes managing endpoint security more critical and challenging. Yes, people were already using their own devices for work. However, the rise in phishing attacks during the COVID pandemic shows that all endpoint devices are at a higher risk than before. Plus, more companies are moving toward zero-trust security models. For a successful implementation, you need to secure your endpoints.

Getting the best out of Samsung Knox management with Mobile Device Manager Plus

In case you missed it, Samsung Knox has verified Mobile Device Manager Plus as a Knox Validated Partner solution. This means that our EMM solution meets its business-level requirements for 2022, and that we support a wide range of features to help you get the best out of all your mobile devices that support Samsung Knox capabilities.

Monitor Dell EMC Isilon with Crest Data Systems' integration in the Datadog Marketplace

Dell EMC Isilon is a petabyte-scale network attached storage (NAS) system that allows you to archive unstructured data. Isilon operates in a cluster to provide high availability, and you can scale up its throughput, IOPS, and storage space by adding nodes to your cluster. Isilon automatically replicates your data throughout the cluster to ensure durability and provides caching to minimize data retrieval latency.

What is ServiceOps?

ServiceOps is a new business technology strategy that combines IT service management (ITSM) with IT operations management (ITOM). ServiceOps is fundamentally about connecting people, processes, and technology that are dependent on one another to enable successful service delivery and make user experiences better with automation, collaboration, and visibility across traditionally fragmented departments.

How to Build the Ultimate Database Monitoring Dashboard

Using the PerfStack feature in the Orion Platform, we will show you how to create the ultimate database dashboard that allows you to correlate performance across your entire IT stack. Quickly see if the problem truly is a database problem or resides somewhere else, such as the networking or system layers.

How to Import/Export Orion Alerts

The Out of the Box alerts on the Orion Platform are good, but alerts for specific needs are better. The flexibility of the alerting engine in the Orion Platform is one of the best ways to tune the platform to your needs. We'll show you how to import alerts from THWACK and how to share any of your excellent alerts with the SolarWinds community. Working with real-world examples is just another way to have your Orion Platform performing properly for your organization.

Have You Forgotten About Application-Level Security?

Security is one of the most changeable landscapes in technology at the moment. With innovations, come new threats, and it seems like every week brings news of a major organization succumbing to a cyber attack. We’re seeing innovations like AI-driven threat detection and zero-trust networking continuing to be a huge area of investment. However, security should never be treated as a single plane.

Patterns for better insights and troubleshooting with hybrid cloud logs

Hybrid and multi-cloud environments produce a boundless array of logs including application and server logs, logs related to cloud services, APIs, orchestrators, gateways and just about anything else running in the environment. Due to this high volume, logging systems may become slow and unmanageable when you urgently need them to troubleshoot an issue, and even harder to use them to get insights.

Uptime vs. Availability

Unlike physical stores and organizations that operate during set hours, the IT world never sleeps. In today’s highly connected digital environment, many believe that when an investment is made in technology, it should be accessible at all times — which is virtually impossible to guarantee. Since disruptions occur, organizations should evaluate the services needed to run operations smoothly. For example, what services are required during an IT service outage to ensure minimal disruptions?

Cheat Codes for Game Development with Sentry and Unity

Whether you’re building the latest FPS or a turn-based classic, you need visibility in how your game is performing on a gamer’s device. Unity is arguably the most popular engine used to develop games so there’s a pretty good chance you, the game developer, are using it. Join Joona Rahko, Principal Software Engineer at Unity, Sentry’s own Bruno Garcia, Mobile Engineering Lead, and Stefan Jandl, Unity SDK Engineer, as they wax poetic about game development, why monitoring matters, and what’s possible with Sentry’s new Unity SDK.

How to Troubleshoot Networks with Employees Working from Home

Since the start of the pandemic, many employees have been working from home, which has changed the way that companies manage their IT services. In this article, we’re running you through how you can use Obkio Network Monitoring to help IT Teams troubleshoot and solve a variety of network problems affecting users working from home.

Efficient Container Monitoring with Pepperdata

Container monitoring strategies and purpose-built container monitoring tools just may be the next hot topics swirling around the Kubernetes discussion forums this year. Over 77% of IT professionals expected to migrate 50% or more of their workloads to containers with Kubernetes by the end of last year. With the rise of container usage growing, having the ability to monitor the performance of your containerized workloads is critical.

Networks and interconnectivity with Hank Kilmer | Network AF Episode 9

In today's episode of Network AF, Avi interviews Hank Kilmer, Vice President of IP Engineering at Cogent. The two discuss Hank's career running major internet backbones, how he got into networking in the late 80's, and his thoughts on mentorship in the networking community. Watch now!

Tonga downed by massive undersea volcanic eruption

On Saturday, the pacific island nation of Tonga was decimated by a massive volcanic eruption that was visible from space. At 5:27pm local time, the underwater volcano Hunga Tonga-Hunga Ha’apai unexpectedly erupted, sending ash and debris for hundreds of miles. As of this writing, all internet and telephone communications between Tonga and the rest of the world are still down.

How to Monitor Calico's eBPF Data Plane for Proactive Cluster Management

Monitoring is a critical part of any computer system that has been brought in to a production-ready state. No IT system exists in true isolation, and even the simplest systems interact in interesting ways with the systems “surrounding” them. Since compute time, memory, and long-term storage are all finite, it’s necessary at the very least to understand how these things are being allocated.

Azure Active Directory (Azure AD) - 101

This is a multi-part series that covers monitoring Microsoft Azure Active Directory (AD). In this blog post, which is part 1 of the series, you will learn about and understand Microsoft Azure Active Directory (Azure AD) and how it is different from an on-premises Active Directory (AD). As technology keeps evolving, companies increasingly look to technologies like Cloud Computing to expand, modernize and stay competitive, and in doing so companies can expose themselves to risks.

Zendesk Plugin: New integration incorporated to Pandora FMS

It is always a luxury to show off a new plugin in Pandora FMS, and for that reason we decided to devote an article in style to this Zendesk plugin on our blog. We will discuss what it is and how it can help us. Step by step, and concisely, so that no one gets lost along the way.

Server Management Software: 5 Tools to Check Out

Around 30 years ago, a server was usually a standalone PC or mainframe that provided only one service. Think of a dedicated mainframe for emails, for instance. Things evolved from this to single standalone hardware that provides multiple services (email, http, ftp combined, for example) through virtual machines that can host multiple operating systems running dozens of services, and up to today, where servers are software defined and can run on anything from your washing machine to a drone.

5 Ways AIOps Tools Can Increase ITOps Productivity

There’s no doubt that the software development and engineering industry has become one of the fastest evolving industries in the world. This greatly affects the way things are done within the industry. The rapid change within the software industry can be said to be in an exponential form. These changes are evolving so quickly that you can measure and track them within daily timelines as compared to the growth of other industries.

Automic Automation Kubernetes Edition v21

Kubernetes has become a fixture in production for most IT Operations teams. The VMware “State of Kubernetes 2021 Report” shows a distinct shift towards a reliance on Kubernetes, with almost two thirds of respondents now saying they use it in production. Companies with over 500 developers are driving this adoption, with 78% reporting that they run mostly- or all-containerized workloads in production.

The Delicate Art of Monitoring Kubernetes

The process of monitoring servers and applications has undergone many transformations throughout the years. When it began, the main question was whether the server was up or down. Now, monitoring helps answer questions about the internal state of an application and infer its status (also called white box monitoring). Monitoring today's complex infrastructure systems can be just as much an art as a technical skill.

Distributed Healthcare Network Monitoring is Possible (with the Right System)

Healthcare SysAdmins haven’t just taken the concept of digital transformation to heart—they’re one of its largest proponents. Quality care improves patient outcomes, so healthcare organizations have embraced the idea of highly connected and digitally enhanced environments to help deliver that care. But while healthcare might often be at the forefront of the digital transformation and tech adoption wave, that doesn’t mean the process isn’t fraught with challenges.

Eight best practices for a successful cloud migration strategy

As a result of the pandemic, we are all navigating an unpredictable mix of virtual, hybrid, and in-person conditions in our business and personal lives. This situation isn’t going away any time soon. The pandemic has prompted businesses across all industries to accelerate their digital transformation initiatives, where the cloud is critical. On-demand self-service environments provide a reason for cloud migration as cloud architectures help businesses reinvent and address uncertainties.

Telegraf Best Practices: Config Recommendations and Performance Monitoring

Telegraf has reached the ripe old age of V1.21.2. Thanks to community feedback and contribution, there have been many features added over the years. Lately, I have seen these questions pop up. If any of these questions plague your mind, have no fear — this blog is here to help! Here are my golden rules for maintaining best practices when building your Telegraf solution.

The optimum page speed for your website and how to achieve it

Website page speed has become more important than ever, especially now Google’s Core Web Vitals are pushing mobile-first and penalising websites it deems to be “too slow”. This means that there is a direct link between slow page speed and lower SEO rankings, and therefore potentially, less traffic to your website.

DevOps State of Mind Episode 7: How AI and ML Can Empower Your Developers

Nick Durkin is the Field CTO and VP of Field Engineering at Harness, where he focuses on solving the technical challenges that are standing in the way of true innovation. Today, we're talking about how Harness uses AI and ML to remove the most annoying parts of your job so that you can focus on what you do best.

The growing demand for multi-cloud monitoring and Site24x7's featured releases: 2021

Catalyzed by the pandemic, 2020 and 2021 witnessed an exponential growth in digital transformation and cloud computing. According to Forbes, this growth is expected to continue in 2022. Site24x7 added more integrations and features to its cloud monitoring portfolio to keep up with the business trends and enterprise demands. Let's dive deep into Site24x7's multi-cloud monitoring releases in 2021 and shed some light on what you can expect from us in 2022.

Implementing SRE at the largest online retailer of NL and Belgium w/ Bart Enkelaar (bol.com) | EP #5

For the fifth episode of the StackPod, we invited Bart Enkelaar. Bart is a lead SRE at the largest online retailing platform in the Netherlands and Belgium: bol.com. He's been a backend engineer for 13 years and is now responsible for setting up site reliability engineering across more than a hundred DevOps teams. In this episode, Bart and Anthony talk about.

How to Diagnose Internet Problems in your Network | Obkio

Many Internet problems are intermittent which means that they appear in your network for a short time, and then reappear when you least expect them to. So you need the right tools to diagnose Internet problems. Choose a Network Monitoring Tool Obkio continuously monitors end-to-end network performance with synthetic traffic using Network Monitoring Agents. Monitoring Agents monitor network performance from the source up to the destination to identify network issues, diagnose connection problems, and collect information to help you troubleshoot.

Lightrun For Application Security - Detecting, Investigating and Verifying Fixes for Security Incidents Using Lightrun

Cover major milestones in app security: finding the issue, evaluating a breach, proving it and validating the fix. We didn’t design Lightrun for this task, but it rises to the challenge. I’m not a security expert. I’d like to think of myself as a security conscious developer, but this is a vast subject with depth and breadth. What I understand is Lightrun and Debugging. In that capacity, I can show some creative ways you can use it as a security tool.

8 Signs You Have a Cloud Optimization Problem

In a survey we conducted last year, we discovered that many enterprises overestimate their ability to optimize their hybrid cloud infrastructures. More than three-quarters of respondents gave themselves high marks for a range of capabilities. Here’s the exact breakdown of the number who rated their abilities a 4 or 5 out of 5: This sounds like great news, except there’s a problem. When we asked about how easy it is to get a global view of cloud costs, 42% said that it takes some effort.

Benchmarking Prometheus-compatible time series databases

Some time ago, Aliaksandr Valialkin published a medium post about comparing VictoriaMetrics and Prometheus resource usage when scraping metrics from thousands of targets. He used node_exporter as a source for metrics to scrape, which is very close to most real-world scenarios. However, the benchmark itself was just a bunch of scripts and a lot of manual work for every test.

Percepio Joins the Zephyr Project

Last summer, we added support for the open source Zephyr RTOS to Tracealyzer. Today, in recognition of the potential of Zephyr to become the leading independent platform for small IoT devices, Percepio joined the Zephyr Project as a Silver sponsor. Percepio made significant contributions to the Zephyr source code, extending the tracing subsystem to allow full tracing of Zephyr applications in line with the capabilities of Tracealyzer.

Customize Your Status Page

Many customers have asked us how they can customize the look and feel of their public status dashboard. These pages have become a popular feature of StatusGator. In addition to notifications about the services you depend on, StatusGator provides you single page that you can publish to your users or team aggregating the status of all your services.

Operations Analytics - The Next Big Thing

Cloud Data Warehouses (CDW) were designed to support business intelligence use cases focused on historical data analysis, but less so on “what is happening now?” class of queries. We think operational analytics are the next big focus and we want to discuss the space and how enterprises will connect their operational data to these new tools to get results right now instead of next week.

Dashboard Fridays: Sample SolarWinds Orion Nodes Dashboard

This SolarWinds Orion dashboard gives an overview of nodes alongside a summary of health, using the Web API and dashboard sharing capabilities of SquaredUp. Using the powerful Web API tile in SquaredUp, the SolarWinds dashboard connects to the SolarWinds Information Services REST API endpoint, using SQL queries in the request data, in real-time. Join Adam Kinniburgh and Customer Solutions Engineer Casey as they showcase how it was made, the challenges it solves, and their top tips for building it yourself.

Communicating to Users During Incidents

Imagine you're having a regular day at work, opening up your browser, double checking something for a client in that web app your team built for them, when suddenly, you see this screen: You hit refresh a few times, just to be sure. Nope. Still down. What happens next depends on how well your team has planned for incidents like this (some folks call it unplanned downtime).

New Analyst Report: The State of Microsoft 365 Performance Management

Martello recently commissioned EMA to conduct an entirely independent exploration of the state of Microsoft 365 and how enterprises are managing its performance and user experience in today’s modern workplace. With special attention given to Microsoft Teams, EMA set out to gauge the criticality, impact, performance, management, challenges, and best practices of Microsoft 365.

All about the Grafana Labs Hackathon 2.0

After the success of our first company-wide hackathon last June, we committed to hosting more hackathons each year. So in December, Grafana Labs invited the company to once again press pause on the daily grind and commit five days to our second hackathon. And the Grafanistas showed up: 148 staffers (almost 20% more than the last round) signed on for the week-long event that involved virtual brainstorming, collaborative coding, and creative presentations.

Delivering Seamless Customer Experiences in the Financial Services Industry

A good customer experience is one of the most important metrics of success for financial services, whether it's in person over the phone or on a device. And to deliver information transactions and interactions quickly and efficiently to your customers, you need to rely on a vast collection of interconnected technologies that work seamlessly together.

Network AF, Episode 8: Staying curious with Ron Winward

In the first podcast episode of 2022, Avi welcomes Ron Winward to Network AF! Ron is the vice president of network services at INAP, global provider of secure, performance-oriented, hybrid infrastructure. Like Avi, Ron also grew up in Pennsylvania and is a member of the East Coast Access of Infrastructure.

Volvo Uses InfluxDB to Evolve Its DevOps Monitoring to Enable Data-Driven Decisions

Production delays or stoppages are the bane of any manufacturer. When you’re a global automaker like Volvo, even the smallest delays can have significant ripple effects. But not even global leaders are immune to IT issues. This was the situation Volvo faced several years ago. It had a legacy DevOps monitoring solution in place for the previous 15–20 years, but that system no longer met the company’s needs. On the surface, it seems like a robust system.

In the need for speed, filmstrip and other powerful tools offer better transparency into website speed metrics

Talk about performance — and about how to make website content load faster — has increasingly been centered around optimizing web experiences leading to higher end-user engagement. More specifically, around metrics leading to website speed, greater conversions and ROI. Seeing what slows down your website and taking action will lead to better results overall.

How We Implemented a Zero-Error Policy Using Coralogix

With dozens of microservices running on multiple production regions, getting to a point where any error log can be immediately identified and resolved feels like a distant dream. As an observability company, we at Coralogix are pedantic when it comes to any issue in one of our environments. That’s why we are using an internal Coralogix account to monitor our development and production environments.

How Can Enterprise Organizations Reduce DevOps Tool Sprawl?

The world revolves around DevOps tools. DevOps engineers go insane when they have too many tools. The first statement is correct. Also, the second one. Tooling that helps in the automation of software development and infrastructure provisioning workflows and pipelines is critical for both the engineers who create the automations and the developers who use the automated workflows on a daily basis.

Learning from the AWS Outage: Internal Monitoring Alone Isn't Enough

If you have set up your own monitoring services with Amazon CloudWatch, Azure Monitor or another internal tool, we suggest you consider looking beyond the horizon. These services often provide internal web monitoring only. Perhaps they validate HTTP availability from locations outside their networks, but HTTP checks won’t give you a 360º view into the state of your services.

Virtual offsite ideas that work: How the Grafana Cloud team brings together 150 people online

It was a Wednesday in November, and we had just wrapped Grafana Labs' third virtual Grafana Cloud offsite of 2021. Outside my window, it was a dark and cold (8 degrees Celsius) night in Cologne (Köln), Germany. In Austin, Texas, it was early afternoon and headed for 80 degrees Fahrenheit. In Cape Town, South Africa, it was a windy and cool spring evening. And in Melbourne, Australia, our final speaker — who was up very early at 5 a.m. — was heading into a cool spring day.

How to Monitor Any SaaS Application Without Scripting Requirements

Exoprise lets IT administrators easily monitor any SaaS application (internal or external) without scripting. Collect performance data metrics (login time, upload and download time, network path performance, connect time, proxy connect time, DNS lookup time, TTFB, etc.) using our Web Login sensor. The sensor auto detects credential forms, supplies login credentials, and is able to record the server uptime and availability statistics in real-time. All this without the need for any complex scripting. It's simple to deploy the sensor. Start web application monitoring right away and improve the end-user digital experience.

What is Network Performance Management and how is it evolving in the cloud era

Join Michael Patterson, Kentik network technologist, to find out what NPM has become and why legacy solutions met their demise. Get in-depth details about these three must-have monitoring techniques: Watch this Kentik webinar replay to learn: Why these technologies bring deeper visibility into how your company’s internet connections and applications are being impacted and by whom. How to identify the third parties to reach out to when problems occur and see where to focus your optimization efforts.

Best Practices for Maximizing the Value of Situation Alarms

Today, IT operations teams have to process large volumes of events or alarms in near real-time in order to protect service levels, stay competitive, and deliver a great experience to customers. If it takes too long for teams to spot and repair issues, an organization runs the risk of significant business service downtime, SLA penalties, and brand reputation damages. As IT landscapes continue to grow in scale and complexity, guarding against these risks becomes increasingly difficult.

Logit.io Launch New & Improved Alerting Features

We are pleased to announce that we’ve recently launched new and improved alerting features which have been rolled out to users across all of Logit.io’s operating regions. As part of these improvements, we have sought to improve platform usability and have now included a new menu from which users can readily configure a number of popular alert types straight from our pre-configured templates.

What is an Access Control List (ACL)?

A commonly used tool at the Cisco command line is the access control list (ACL). At their simplest, access control lists are collections of IP addresses that are used by a router, switch, or a firewall to identify network traffic that must be handled in a special way. Cisco and other network vendors use ACLs for many different purposes. This article focuses on IOS access control lists as used on Cisco routers, although much of this discussion applies to Cisco switches as well.

HighByte and InfluxDB Provide Critical OEE Data for Manufacturing Companies

HighByte is an industrial software company based in Portland, Maine building Industry 4.0 based solutions that address the data architecture and integration challenges inherent in manufacturing. The company developed the first DataOps solution purpose-built to meet the unique requirements of industrial assets, products, processes, and systems at the Edge.

Transforming Education and Government IT and Cloud Environments

State, local government, and educational institutions have unique IT requirements and regulations that traditional commercial organizations don’t face. From specific IT budgeting processes to strict compliance, public sector organizations are constantly forced to maximize every IT resource in their complex IT environments.

Run Synthetic tests in CI/CD with the new Datadog GitHub Action

Testing early and often throughout the software development process (shift-left testing) helps teams stay agile and reduce the time it takes to validate and release new updates. Datadog Synthetic CI/CD Testing enables you to implement shift-left testing throughout your CI/CD pipeline so that your team can prevent faulty code deployments from degrading your end user experience.

SNMP monitoring solution: Significant monitoring insights in a click

Simple Network Management Protocol (SNMP) is a networking protocol that aids in the transfer of data among devices, thereby managing and monitoring devices present in the internet protocol network. Networks have an array of devices connected to them, and new devices get introduced to them as trends in technology evolve. These new devices are often used to simplify complex processes but end up complicating simple networks.

How did Site24x7 monitor your infrastructure in 2021?

2021 was not much different from 2020 concerning the pandemic. However, it did give many the opportunity to focus on the way forward with offices around the globe opening up, at least partially. At Site24x7, we also started to focus on monitoring other IT components along with remote work infrastructure. Grab a look at our infrastructure monitoring releases if you missed our announcements and “what's new” updates.

Top tips to extend SCOMs monitoring

In today’s monitoring world most SCOM admins are pulling data from a range of tools and applications, as well as cloud platforms. It can be difficult to sift through all this data, let alone distill it down into a useful format. SCOM is a great resource for making this happen, and if it’s done well can be a superb monitoring hub for your other applications. But how do you get SCOM to consume data from other sources?

A successful Monitoring as a Service Case: Refinery and Base Oil Industry

Saving time, money, and resources while keeping your IT infrastructure better monitored than ever – that’s what one of our customers in the refinery and base oil industry did. By implementing our Monitoring as a Service, they managed to turn unstructured monitoring into high quality, consistent such. Their biggest pain point was the lack of time allocated for monitoring activities.

How to deploy the Google Cloud Ops Agent with Ansible

Site Reliability Engineering (SRE) and Operations teams responsible for operating virtual machines (VMs) are always looking for ways to provide a more reliable, more scalable environment for their development partners. Part of providing that stable experience is having telemetry data (metrics, logs and traces) from systems and applications so you can monitor and troubleshoot effectively. Many Google Cloud services, including Google Compute Engine, provide basic system metrics out of the box.

Configuring Grafana Tempo and Linkerd for distributed tracing

Anders Østhus is a DevOps Engineer on the Digital Tools team at Proactima AS, a consulting firm based in Norway that offers services and expertise in risk management, cybersecurity, healthcare, environmental solutions, and more. It can be difficult to orient yourself in the distributed tracing space, and getting all the parts of a tracing setup to play well with each other can be a bit tricky. But the benefits of tracing are undeniable.

Collecting Metrics from Windows Kubernetes Nodes in AKS

Windows applications constitute a large portion of the services and applications that run in many organizations. When moving to a Kubernetes-based architecture, there is a need to support these as well. Up until April 2020, the lack of container support within the Windows operating system left Linux container images as the only viable option for Kubernetes container deployment.

What is End User Experience Monitoring? How Does It Help?

End user experience monitoring is a mindset and a philosophy. It’s the acknowledgement that IT is not the outcome, but rather a means to an end. Think of it this way: IT is here to support business operations. It does so by delivering technology to the tech-dependent workforce so employees can do their jobs seamlessly. Therefore, the most important thing to monitor in the IT ecosystem has shifted. It’s not the network, the device, or the cloud – these are only delivery mechanisms.

Payment Optimization with AI-Based Analytics

The fintech market grows larger and more diverse each day. The financial news website Market Screener says the global fintech market will be worth $26.5 trillion by 2022, with an average annual growth rate of 6%. In Europe alone, the use of financial technology increased by 72% during 2020. Competition in this market segment is also on the rise.

The Importance of Network Insights in Achieving Full End-To-End Observability

When we talk about observability, we tend to focus first and foremost on the metrics, logs, and traces that you can collect from applications – such as request rates, error rates, and request duration. Infrastructure-level metrics, like CPU and memory utilization, might factor into the discussion as well. Here’s a third category of critical observability insights that teams tend to overlook: the network.

Refined User Experience, New Executive Visibility, and Enhanced Cloud Monitoring with Splunk Enterprise Security 7.0

Just like that, another year has gone by full of remote work, virtual conferences, and lengthy Zoom calls. And, although we were not able to see our fellow Splunkers in person at.conf21 that didn’t stop us from previewing the latest enhancements to Splunk Enterprise Security. And now, it gives us great pleasure to announce that Enterprise Security 7.0 is available!

Netreo Full-Stack Monitoring and Observability Suite Achieves Veracode Verified Standard Recognition

Netreo, the award-winning provider of IT infrastructure monitoring and observability solutions and one of Inc. 5000’s fastest growing companies, today announced that the Netreo full-stack IT infrastructure monitoring and Retrace by Netreo full lifecycle APM solutions have both earned Veracode Verified Standard recognition for proven security practices in application development.

Top Dashboards For Remote Digital Employee Experience Monitoring

Your business is already operating in a hybrid model, isn’t it? Perhaps, you are deciding to become an entirely remote workforce. Whatever be the case, IT must support employees working from multiple locations and track remote digital employee experiences using real-time dashboards. In this article, we are going to cover three Digital Experience Monitoring dashboards to help your team identity and troubleshoot problems that your employees might face in their home environment.

How Automated Application Monitoring Saves Time and Money

With the increase in demand for technology and quality assurance, companies are looking to improve their products day by day. For many, time and workforce are limited, so companies are interested in solutions that automate their day-to-day work. Whether developing software or testing it, almost every part of application lifecycle management is reliable on the human workforce. But some parts like monitoring, tracking, and alerting can be automated using some latest software.

Top 5 user-requested synthetic monitoring alerts in Grafana Cloud

We often hear from Grafana Cloud users who are asking for guidelines on how to write better alerts on synthetic monitoring metrics and get notified when synthetic monitoring detects a problem. We already ship a predefined alert in Grafana Cloud synthetic monitoring. A predefined alert that we ship is alerting on the probe_all_success_sum metric and makes use of the alert sensitivity config to create multiple Grafana Cloud alerting rules. Check out synthetic monitoring alerting docs for details.

Comparing REST and GraphQL Monitoring Techniques

Maintaining an endpoint, especially a customer-facing one, requires constant monitoring, whether using REST or GraphQL. As the industry has looked for solutions to build a more adaptive endpoint technology, it is also a must to monitor these endpoints. GraphQL and REST are two different technologies that allow user-facing clients to link to databases and platform logic. Both GraphQL and REST include monitoring techniques.

Data Federation and the Modern Enterprise

In our increasingly hyper-connected, data-dependent world, it can be difficult to keep track of where resources are, how to access them, and how to put data assets to work to run a more efficient and reliable enterprise. Traditional approaches to IT operations analytics are becoming outmoded as the sources and types of data grow more mobile, ephemeral, diverse and distributed.

How to measure the performance of a website

If you’re a person who works from home, you almost certainly have to deal with occasional internet connection issues. More often than complete outages, you’re likely dealing with occasional slowness. And you know from experience that any one of dozens of devices and services along the path can cause latency.

Why cloud native requires a holistic approach to security and observability

Like any great technology, the interest in and adoption of Kubernetes (an excellent way to orchestrate your workloads, by the way) took off as cloud native and containerization grew in popularity. With that came a lot of confusion. Everyone was using Kubernetes to move their workloads, but as they went through their journey to deployment, they weren’t thinking about security until they got to production.

Configure Cribl LogStream to Avoid Data Loss With Persistent Queuing

Preventing data loss for data in motion is a challenge that LogStream Persistent Queues (PQ) can help prevent when the downstream Destination is unreachable. In this blog post, we’ll talk about how to configure and calculate PQ sizing to avoid disruption while the Destination is unreachable for few minutes or a few hours. The example follows a real-world architecture, in which we have.

What Is IT Infrastructure Monitoring? Here's Everything You Need to Know

IT infrastructure is the backbone of any IT organization. An IT infrastructure comprises multiple components such as hardware assets, networks, applications, etc. And for an IT organization to function smoothly, it is important for all these components to function properly—autonomously as well as with each other. But how do we know if our IT infrastructure and its components are functioning properly? Through IT infrastructure monitoring.

What is an AWS Lambda Function?

In this article, we will cover the basics of a Lambda function and its functionality in our every day digital lives. AWS Lambda, as we already know, is a compute service that allows you to run code without managing servers. AWS Lambda runs the code when it is needed, and it is automatically scaled. The code you execute on AWS Lambda is called Lambda function, and it can be considered, for better understanding, as a formula in a spreadsheet.

Splunk Enterprise Logs Now Available in Splunk Observability for Simplified Troubleshooting

We are excited to announce that Splunk Log Observer Connect for Splunk Enterprise, previewed at.conf21, is now generally available! Log Observer Connect is a new feature that lets observability users explore the data already being sent to existing Splunk instances with Splunk Log Observer’s intuitive no-code interface for faster troubleshooting and root-cause analysis.

Beyond Zero Trust: What is Continuous Security Validation?

Continuous security (or control) validation helps me explain network security with one of my favorite analogies. Network security is like jiu-jitsu. You have no idea how strong your defenses are if you’re not rolling (sparring) regularly. Let’s take a closer look at continuous security validation, and explain why, just like jiu-jitsu, you need to keep your system in practice to keep it sharp.

Sustainable competitive advantage: 10 Ways a Small MSP Can Punch Above Its Weight

Sometimes we associate “big” with power and success, but being small doesn’t mean you can’t compete—you just need to be smarter at creating a sustainable competitive advantage. Here are 10 things you can do as a small managed service provider (MSP) to punch above your weight.

Network Basics: Configuring Interfaces on Ethernet Switches

Administrators are tasked with configuring interfaces on network devices more than any other single thing. This makes sense—interfaces are the main points of connectivity, sending and receiving traffic throughout an organization. Configuring interfaces can be deceptively easy. Switchports might require a VLAN assignment, and often function with no other configuration at all. Router interfaces can be up and running with only an IP address.

New API Tokens UI

InfluxDB Cloud allows users to create API tokens that are used for authentication and authorization to sets of resources when interacting with our API. We recently made changes to the user interface so that after generating a token, you will need to immediately store it in a secure vault of your choosing for safekeeping. We made this change to conform to industry best practices around both token generation and retrieval.

Graylog Insights -- How 2021 Will Shape 2022

People may not reminisce over 2021, but as Winston Churchill once said, “Those that fail to learn from history are doomed to repeat it.” 2021 swooped in on the coattails of a major supply chain data breach, and a lot of the challenges we experienced during this past year seemed to follow suit. To celebrate the best and hopefully move away from the worst that 2021 had to offer, this look back at 2021 trends can inspire us all to learn, and most of all, show us how to move forward.

Debug source code in real time with Rookout's Datadog App

Earlier this year we launched Datadog Apps, which seamlessly integrate functionality from third-party tools into Datadog’s centralized monitoring platform. This project has enabled us to collaborate with some of our partners, such as PagerDuty and LaunchDarkly, to extend the Datadog UI and provide our customers with new solutions for incident management, feature flag optimization, and more.

Key metrics for monitoring Azure SQL databases

Microsoft Azure SQL Database is a platform-as-a-service (PaaS) database offering for modern cloud applications. It’s a fully managed service that runs on the latest version of the SQL Server database engine, enabling you to create highly available and performant database instances without needing to maintain hardware upgrades, patches, or backups.

Tools for collecting Azure SQL Database data

In Part 1 of this series, we discussed key metrics for monitoring Microsoft Azure SQL databases. We also looked at how your database resource and audit logs complement metrics to provide more insight into database performance, activity, and security. In this post, we’ll show you how to collect metrics and logs from your database instances and monitor them with Azure’s monitoring and reporting tools.

Monitor Azure SQL databases with Datadog

In Part 2 of this series, we showed you how to monitor Azure SQL Database metrics and logs using the Azure platform. In this post, we will look at how you can use Datadog to monitor your Azure SQL databases alongside other technologies in your infrastructure. Datadog provides turn-key integrations for Azure along with more than 500 other technologies, enabling you to track long-term performance trends across all systems in your infrastructure, not just your SQL databases.

2022: Let's do this!

Happy new year! 🎉 It's 2022, and even though not much has changed in the past year, I'm happy to know that Monitive is running smoothly and brings value to our customers. For the first quarter of 2022 our main focus is fixing issues, either bugs we know of or small tweaks that make everyone's lives better. There are some feature requests in our backlog that we're jumping on starting February, and also a few surprise updates that are in the works and will be launched when they're ready.

Site24x7 end-user experience monitoring: 2021 in a nutshell

From coping with life during a pandemic, to getting updated for the new virus variants, to witnessing a major outage of the social media stalwart, to rebuilding hope with the vaccines, 2021 has been an eventful year. We, at Zoho, managed to run our services smoothly in a hybrid mode with employees working from home as well as from our offices. Our journey in 2021 was fueled by the support from our dedicated customers and that helped us meet our goals for the year.

Why Mixing Microsoft Teams, PBXs and PSTNs is a Challenge

The work landscape has changed dramatically over the past two years and it has been a challenge for an organizations IT teams to keep the workforce connected. For many businesses, Microsoft Teams has become a core part of their efforts to keep connected but supporting the Microsoft Teams infrastructure has been something many IT departments have had to learn on the fly. Pre-pandemic, businesses might have had thousands of employees working in a handful of offices.

Reducing MTTR and tracking SLAs with Grafana Cloud

Attracting and retaining top developer talent is a No. 1 priority for a lot of companies these days, including location technology company TomTom. As both the builder of the world’s largest developer community and an employer of thousands of developers, TomTom is always looking for developer-friendly tools to help their employees feel productive, efficient, and inspired.

Docker Logging: How Do Logs Work With Docker Containers?

Docker containers are a great way to create lightweight, portable, and self-contained application environments. Logging is critical for every application since it gives valuable information for troubleshooting, evaluating performance issues, and drawing an overall picture of the behavior of your architecture. This article presents a thorough tutorial covering all you need to know to start with Docker logging. It also provides some recommended practices for optimizing the logs of your containerized apps.

The Importance of Observability for the SRE

The term Site Reliability Engineer (SRE) first appeared in Google in the early 2000s. In Google’s 2016 SRE Book, Benjamin Treynor Sloss wrote that, generally speaking, “an SRE team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their service(s).” This means that the SRE teams at Google decide how a system should run in production as well as how to make it run that way.

5 Misconceptions About Cloud Cost Optimization Tools

Unless you are one of the 1% of enterprises that have zero workloads running in the publice cloud, you need a cloud cost optimization tool. Yes, you do. And if you have workloads running in multiple public clouds—which somewhere in the neighborhood of 85% of enterprises do—you really need a cloud cost optimization tool. If I’m preaching to the choir, feel free to skip to the end of this article where you’ll find a link to try Virtana Optimize for free.

Why Are IT Pros Hesitant to Deploy Windows 11?

A key aspect of my job is to speak to End-User Computing (EUC) professionals on a regular basis—analysts, customers, partners, etc.—to better understand their challenges and objectives. Unsurprisingly, one topic has been coming up a lot lately: Windows 11. Surprisingly, nobody wants to actually deploy it. Or, at least, not right now. Why? After countless discussions, 4 reasons were made clear for their reluctance with Windows 11: These are all valid points.

Crash Course in Crash Grouping

Supporting large applications with enormous crash volumes can be a real pain in the hindquarters. It is extraordinarily difficult for organizations to optimally dispatch engineering resources without excellent data and proper tooling. At BugSplat, we recently upgraded the tooling we provide to developers so that they can group related crashes and better target their support efforts, deliver more stable applications, and deliver more value to their customers.

Lindesberg Municipality Improves Network Reliability & Quality for its 5,500 Users w/ WhatsUp Gold

Lindesberg Municipality’s 5,500 simultaneous users were constantly losing connection with its network and IP Telephony systems, which led to a volume of support calls that was straining its IT department. It implemented WhatsUp Gold to monitor its connections and servers and study its access points. Lindesberg has moved from a network-centric to a service-centric installation, with 99.9% uptime, happier customers, and less support tickets.

Top Cloud Security Trends 2022

Enterprise cloud use cases are changing and expanding, and companies are now realizing new security challenges that need to be resolved. Cloud security solutions can include everything from new security tools to more advanced training to investing in new team strategies. See below to learn about the tops trends cloud security experts are seeing in the market.

Observability - Software & Tools.

A developer's viewpoint is distinct. It can be difficult to keep track of operations and detect the fault that is causing the software to malfunction when handling numerous sectors. What if you could detect the issue ahead of time and fix it as soon as possible? The tactics that we concentrate on and put into action are those that assist us in properly managing our tasks. Knowing about observability makes this possible. Let's take a closer look at it in this blog.

Applications Manager: Looking back on the features and improvements from 2021

2021 has been an incredible year for us at ManageEngine. Throughout the year, we added several new capabilities and improvements to Applications Manager, our application performance monitoring software. Here’s a look back at the highlights.

20+ Best Log Management Tools for Monitoring, Analytics & More: Pros & Cons Comparison [2022]

Whether you capture them for application security and compliance, production monitoring, performance monitoring, or troubleshooting, logs contain valuable information about the health of your apps. But it all comes down to what and how you log, which is where log management tools come into play.

Top 15 Website Speed Testing Tools [2022 Comparison]

There are a lot of reasons why people choose to shop at one online store over another or pick one streaming service over another from the type of service they are getting to pricing, quality and, you’ve guessed it from the title, speed. The speed to which I’m referring is the speed at which the website loads and reacts to user input. In one of my previous articles about netwok latency, I’ve talked about how big of a difference even a two-second extra delay makes.

Building an effective remote-first team during the pandemic

I’m an engineering manager at Grafana Labs serving on the Grafana Enterprise Operations team. I joined Grafana Labs in December 2020 and I just celebrated my first year at the company. The last 12+ months have been filled with the most exciting and rewarding experiences in my career, full of new opportunities and learnings. More importantly, I am lucky enough to meet and work with the wonderful people at Grafana Labs.

Interview with CTO Kathleen Moriarty

For the newest instalment in our series of interviews asking leading technology specialists about their achievements in their field, we’ve welcomed Kathleen Moriarty, Chief Technology Officer at the Center for Internet Security. During her tenure in the Dell EMC Office of the CTO, Kathleen had the honour of being appointed and serving two terms as the Internet Engineering Task Force (IETF) Security Area Director and as a member of the Internet Engineering Steering Group from March 2014-2018.

Recharts and InfluxDB Tutorial - Visualize IoT Sensor Data with ReactJS

In this tutorial, you will learn how to create a custom data visualization with ReactJS using the Recharts charting library to display time series data stored with InfluxDB. To do this you will store some real-time data being recorded by some IoT sensors which record the temperature, humidity, and carbon monoxide levels of a room.

Vicky User Community 2021: Thank You for the Contributions!

2021 was a great year for VictoriaMetrics! We delivered a lot of new features, our team doubled in size, and so did the list of public case studies written by VictoriaMetrics users as well as the community contributions to the product. See our 2021 Momentum blog post for details on all our achievements last year. All this wouldn’t be possible without our supportive community, their help, patience and creativity.

What's new in VictoriaMetrics 2021?

The 2021 year is finished, so it’s time to look at changes VictoriaMetrics has gained during the past year. The first release in 2021 was v1.52.0. The last release in 2021 was v1.71.0. More than 20 new releases of VictoriaMetrics were published during the 2021. The full changelog is available at this page. Let’s look at the most interesting changes.

5 essential metrics to monitor in your VMware environment

VMware enables businesses of all kinds to set up and employ virtual machines (VMs) and servers. These virtual instances provide an abundance of benefits, such as faster speeds, reduced costs, and reduced downtime. However, they can also be quite difficult to manage. Without proper VMware monitoring software, these virtual instances can suffer severe performance loss.

Azure vs. AWS vs. GCP: A Direct Comparison

Ever since the pandemic hit the world in 2020, many businesses and institutions have begun to adopt cloud computing into their operations. With more employees working from home, companies have moved from on-site data centers and chosen cloud computing services. Although cloud computing is not new, its relevance has increased during the last year and has revolutionized how businesses and institutions are operating now.

Mapping Statistics - What You Need to Know

When your Elasticsearch cluster ingests data, it needs to understand how the data is constructed. To do this, your Elasticsearch cluster undergoes a process called mapping. Mapping involves defining the type for each field in a given document. For example, a number or a string of text. But how do you know the health of the mapping process? And why do you need to monitor it? This is where mapping statistics come in. Mapping statistics give you an overall view of the mapping process.

Log Management: Your Obvious Choice for Capacity Planning and Optimization

Recently, I wrote an article titled Life Cycle Monitoring: Why an Ounce of Prevention Is Worth a Pound of a Cure. The great Benjamin Franklin coined the term. In the article, I highlighted the value, efficiency, and logic of putting more time into a proper capacity planning and optimization process for all types of IT environments. Most IT professionals would tell you the first thing that comes to their mind when asked how they use log management tools is troubleshooting.

Momma Said Grok You Out: Use LogStream to Streamline Searches, Aid in Reformatting Data and Parsing

It is commonly believed that once data is collected and ingested into a system of analysis, the most difficult part of obtaining the data is complete. However, in many cases, this is just the first step for the infrastructure and security operations teams expected to derive insights.

Ask Miss O11y: How Can I Add o11y to Databases?

Oh goody, I’m so tickled to get this one. *rubs hands gleefully* Funny story, back in 2016–2017 we thought we were building Honeycomb primarily for DB use cases. The use cases are that killer. I’ve never seen another tool do the kinds of things you can do on the fly with Honeycomb and databases.

OpsRamp Recognized Across Three Categories of the 451 Research Market Map for Application and Infrastructure Performance Monitoring

OpsRamp was one of only two vendors to be recognized in 451 Research’s Market Map for Application and Infrastructure Performance Monitoring (AIPM) in the categories of Infrastructure Monitoring, Event Correlation and Alerting. 451’s AIPM Market Map offers a holistic perspective on key emerging categories in the IT monitoring and observability space.

Don't Settle for Observability. Strive for Actionability

You’ve heard of observability, which has fast become one of the IT industry’s buzzwords du jour. But what about actionability, or the ability to translate observability into meaningful action? The latter term may not be a trending buzzword (not yet) – indeed, “actionability” perhaps sounds almost boring – but it’s just as essential as observability in managing complex, cloud-native environments.

Gaining Situational Awareness at Every Point of Sale with Broadcom's DX APM App Experience Analytics

The National Retail Federation forecasted historic holidays sales this 2021 season, as retailers grappled with high volumes of in-store and digital traffic, along with a need for full visibility into the user experience. They turned to monitoring their Point-of-Sale (POS) systems for key analytics that revealed unique, real-time details about what customers were experiencing.

Network Automation: What It Is, Where It's Heading, and How to Start Planning

Automation. It’s a common term. Some may call it a buzzword, even. When it comes to building a network, it’s usually not your first consideration. After all, what’s most important is getting users and devices online—and keeping them there. Isn’t automating the network an advanced step? Are we even capable of making network monitoring and management automatic?

Accelerating software delivery through observability at two very different organizations

Delivering value to customers quickly and efficiently is critical to the success of modern businesses. Understanding the process and timeframe during which an organization generates new ideas and then designs, develops and deploys them is vital to success. At the recent Illuminate User Conference, Drew Horn, Director of Business Development, held a discussion with Clara Ko, Director of Engineering at Sauce Labs, and Bryan Veselka, Director and Product Owner for Cloud and Automation at Vizient.

5 Dashboard Design Best Practices

In an increasingly data-driven world, the ability to summarize and display data while making it easy to understand and actionable is more important than ever. Dashboards appear in all types of software with various approaches behind their design. Despite how they differ in appearance and the information they display, at a conceptual level all dashboards have the same goal and purpose.

Sponsored Post

Speedscale Launches CLI: Free API Observability Tool

We are excited to announce the launch of Speedscale CLI, a free observability tool that inspects, detects and maps API calls on local applications or containers. The offering underscores the importance of continued and proactive API testing to quickly detect and debug defects within a shifting array of upstream and downstream interdependencies.

Next Level Ruby on Rails Application Monitoring with AppSignal

In the first of this two-part series, we covered how to set up AppSignal in a Ruby on Rails application for many great insights out of the box. AppSignal can automatically track errors, monitor performance, and report metrics about some dependencies. But, in many cases, each of our applications behaves in different ways, so we'll want more than just generic monitoring. In this post, we will run through adding custom instrumentation and monitoring to a Ruby on Rails application.

Learning the tricks of Grafana Loki for distributed logging at scale in a Kubernetes environment

Logging can provide immense detail when used well, or it can become a firehose and take hours to trawl through. The team supporting the Kubernetes platform at Civo needed a solution that was simple and performant and could be queried in ways to help and not hinder them In this talk, Civo SRE Anaïs Urlichs and Principal Engineer Alex Jones will illustrate how Loki was chosen and brought into the organization to empower engineers. Integrating with Prometheus and Grafana dashboards, Loki has allowed engineers to filter for precise information that helps them debug quicker.

How to Find WordPress Performance Bottlenecks

Monitoring is a critical part of managing a WordPress site since you need to know what's going on with it, such as how many visitors it has, how quickly it loads, and whether it's constantly online. Data on these areas will aid you in making critical decisions, resulting in improved performance, happier visitors, and, if applicable, a higher bottom line. Many factors can cause WordPress to slow down, but you don't need to be a techie to address them.

How the new k6 Cloud app plugin makes it easy to correlate QA data and system metrics in Grafana

One of the common challenges when doing performance testing is the difficulty of correlating the metrics of your application with your testing results. Having available QA, infrastructure, and application metrics together allows engineering teams to better understand the behavior of their systems during the testing, helping to detect and prevent potential issues in their applications.

Detecting Log4J/Log4Shell exploits with LogStream

Shortly before the December holidays, a vulnerability in the ubiquitous Log4J library arrived like the Grinch, Scrooge, and Krampus rolled into one monstrous bundle of Christmas misery. Log4J maintainers went to work patching the exploit, and security teams scrambled to protect millions of exposed applications before they got owned. At Cribl, we put together multiple resources to help security teams detect and prevent the Log4J vulnerability using LogStream.

Detecting and Preventing Log4J Attacks with Cribl LogStream

Shortly before the December holidays, a vulnerability in the ubiquitous Log4J library arrived like the Grinch, Scrooge, and Krampus rolled into one monstrous bundle of Christmas misery. Log4J maintainers went to work patching the exploit, and security teams scrambled to protect millions of exposed applications before they got owned. At Cribl, we put together multiple resources to help security teams detect and prevent the Log4J vulnerability using LogStream.

Easy Lambda Function Monitoring with the AWS Lambda InfluxDB Template

AWS Lambda is a serverless compute service that allows you to run code without having to manage servers. Lambda provides autoscaling and bills only on compute time, so you aren’t paying for unused resources. Some common use cases are file processing, stream processing, and acting as a backend for web and mobile applications. AWS Lambda functions can be invoked with external HTTP requests as well as by events triggered by over 200 different AWS services.

How frequently should you monitor your website?

Hopefully, you are consistently getting fresh leads and customers through your website. However, like an automobile that needs to be checked regularly for problems, you need to maintain your website for optimal performance. That's important for your search rankings, general user experience, and conversions. This guide will discuss how often you should monitor your website. We'll also look at some of the essential tasks you should take to ensure things progress nicely.

Sponsored Post

Build more resilient mobile apps with Flutter Crash Reporting

Powered by the Dart language, Flutter is one of the fastest-growing cross-platform programming frameworks in the world. Since its release in 2017 it has been empowering developers to build mobile apps that work seamlessly across iOS and Android with a single code-base. If you're a Flutter developer, you'll already know the importance of building better quality software, faster - after all, it was made for that very reason. That's why today, Raygun is proud to bring you Flutter support Crash Reporting. This highly requested release gives you complete visibility into the health of your Flutter applications, with rich diagnostics that take you to the root cause of errors and crashes.

Ensuring a Reliable Microsoft Teams User Experience: Modern Workplace Use Cases

Rob Doucette, VP, Product Management reviews real-life examples from businesses in various industries and see how IT can gain complete end-to-end visibility of the Microsoft Teams user experience, to rapidly detect and resolve problems before they impact the user experience.

Five tricks for logging at scale in a Kubernetes environment with Grafana Loki

Legacy logging solutions simply couldn’t keep up with the complex, hyperconverged regional infrastructure at Civo, a Kubernetes service provider that enables users to launch k8s clusters within 90 seconds. “With our infrastructure and application deployment getting more complex and more distributed, we needed our logging solution and our entire observability stack to scale up with our needs,” said Anaïs Urlichs, Site Reliability Engineer at Civo.

The Case for Frontend Performance Monitoring

Application Performance Monitoring has been a popular concept among developers and companies alike. APM data has helped product teams increase their growth and revenue manifold. Whether it is an issue affecting the availability of a service or a trend that suggests an incoming increase in user engagement, monitoring has always helped organizations get the best out of their product strategies.

LogicMonitor's Verified HashiCorp Terraform Integration Allows You To Do More With Less

Here at LogicMonitor, we’re really big on extensibility and automation. We’re constantly adding to our catalog of monitoring coverage and ensuring that setup is as simple as possible. We also monitor almost any data you can expose on a network. People have done way more with LogicMonitor than we would have ever imagined, and I’m extremely excited to announce our next step in that commitment to extensibility and automation.

CI/CD - What You Need to Know

Continuous integration (CI) and continuous delivery or deployment (CD) cover the process of automatically merging, building, and testing code changes ready for release, and – in the case of continuous deployment – releasing those changes to users. If you’re developing software for others to use, you’ll need to go through some form of build and test process before you make your latest changes available.

Why is SAP security monitoring important?

SAP applications drive the most business-critical processes in companies around the globe. It will not surprise anyone that cybersecurity is of utmost importance to prevent SAP customers from vulnerabilities. A joint threat-intelligence report from SAP and Onapsis, released on 6 April 2021, warns that cyber attackers are actively exploiting known SAP security vulnerabilities to steal information and compromise mission-critical SAP landscapes.

Paths to networking with Ron Winward | Network AF Episode 8

In today's episode of Network AF, Avi interviews Ron Winward, VP of Network Services at INAP. With 20 years of network services experience under his belt, we want to know more about how he got into networking and how an expert like him learns. Today's discussion will also involve the community and what Ron looks for in those who want to get into networking. Listen now!

Dr. Changelove: Or How I Learned to Stop Going Vendor-Specific and Love the LogStream

Here at Cribl, we have a cloud offering of our LogStream product. In building and supporting our cloud product, we have a service-based architecture. And we want to be able to gather metrics from our services, in order to monitor those services and make sure we meet our SLAs.

6 Surprising things you can learn from our APM survey

In the summer of 2021, eG Innovations joined forces with the DevOps Institute to run an APM survey to find out how the industry would look in the new normal. The 2021 APM survey was conducted over three months between July and September. Over 900 people from DevOps, SREs, and ITOps backgrounds participated and we got a broad spectrum of responses.

Gain the upper hand over adversaries with Osquery and Elastic

With the Elastic 7.16 release, Osquery Manager is now generally available for Elastic Agent, making it easier than ever to deploy and run Osquery across your environments. By collecting Osquery data and combining it with the power of the Elastic Stack, you can greatly expand your endpoint telemetry, enabling enhanced detection and investigation, and improved hunting for vulnerabilities and anomalous activities.

How search enables role-based data classification and sharing across the government

Government data strategies lay a promising groundwork for how data will be used to drive more informed decision making internally and more streamlined public services externally. A commonality between these strategies is the need for improved role-based data sharing and data re-use. The sticking point, however, is in the way to implement data sharing when there are known silos across and within various departments.

Why "AIOps vs. Observability" Is a False Dilemma

What comes first – observability or AIOps? Can you achieve observability without AIOps? Do you need AIOps if you already have an observability solution in place? These are all questions that any team considering AIOps will want to answer in order to determine the real-world value that AIOps tools stand to offer.

A New Way to Look Like Splunk

During.conf21, we announced the public release of the Splunk UI Toolkit, a collection of packages and libraries that provides some of the same underlying tools powering our product line to you, the Splunk developer. Now, any Splunk developer can incorporate Splunk UI components into their own custom applications and tools. This includes everything from buttons and inputs from our @splunk/react-ui package, or our new parallel coordinates visualization from our @splunk/visualizations package.

Getting Started with Ruby and InfluxDB

Scroll down for the author’s photo and bio. Time series databases like InfluxDB index data by time. They are efficient at recording constant data streams like server metrics, application monitoring, sensor reports, or any other data containing a timestamp. The structure makes analyzing change over time a breeze. This tutorial will show you how to set up InfluxDB with a sample Ruby application.

DevOps State of Mind Podcast Episode 6: The Future of DevSecOps with EMA

Chris Steffen is a research director for information security at Enterprise Management Associates. EMA is a leading analyst and consulting firm that prides itself on going beyond the surface to provide deep insights about the IT industry. I'm Liesse from LogDNA. Before we dive in, I just wanted to take a moment to thank all of you for tuning in to season one of DevOps State of Mind.

Sponsored Post

Observability, AIOps, APM, and i2M: The Partner Ecosystem for IBM MQ Enterprises

Complex enterprises have an integration infrastructure (i2) layer that connects technologies and applications across cloud, data center, virtualized systems, mainframe, edge computing, etc. The i2 layer includes a core middleware application (such as IBM MQ) along with many other "integration" technologies, such as MFT (managed file transfer), IoT, REST APIs, DataPower Gateway, and other messaging technologies (i.e., Kafka, TIBCO EMS, IBM ACE, IBM Integration Bus (IIB) and more).

Sponsored Post

Digital Experience Monitoring Growth For the Business Win

Application Performance Management (APM) measures how a SaaS or Web application performs on the backend (for Devops). End-User Experience Management (EUEM) focuses on user behavior within those applications. Network Performance Monitoring and Diagnostics (NPMD) collects network telemetry to facilitate performance degradation. DEM combines all these tools to holistically look at the entire digital journey and see how each dependency drives successful experiences for customers and employees.

Exoprise 2021 Year in Review

Happy New Year 2022! In 2021, Exoprise’s critical focus was on improving its product for monitoring digital experiences and mobilizing internal teams to improve customer adoption and SaaS/network experiences everywhere. As Covid continues to dominate the world, IT and business teams are increasingly looking for solutions like Exoprise Digital Experience Monitoring (DEM) to ensure end-users are productive with a seamless work-from-home experience.

What Is AIOps? A Complete Beginner's Guide

Gartner predicted, by 2020 90% of Artificial Intelligence (AI) and Machine Learning (ML) would have been deployed in enterprises through “AIOps” – a combination of machine learning and operations. An AIOps approach has the potential to reduce costs and risks by automating routine IT Operations tasks while returning more control over decisions to the organization.

ELK vs Graylog: Log Management Comparison

As organisations face outages and various security threats, monitoring an entire application platform is critical in order to determine the source of the threat or the location of the outage, as well as to verify events, logs, and traces in order to understand system behaviour at the time and take proactive and corrective actions.

Introducing Grafana University: our virtual hands-on education platform that's free and easy to use

Grafana Labs has had a long commitment to educating our customers and community about all of our open source technologies and products, with our community Slack, webinars, conferences, documentation, and of course, this blog. In 2021, we decided that it was time to create a formal education program to provide more structured, repeatable, and scalable learning experiences – all while providing the same compelling and quality content our community is accustomed to.

Deploy SigNoz using Helm charts, 500+ members on our slack community - SigNal 08

Welcome to SigNal 08, and the last SigNal issue of 2021! 🥳 This month, we made numerous PRs improving our product experience, added new awesome contributors, and launched a new initiative to discover better UX for our users. We also crossed 500+ members on our Slack community! 🥳 Wrapping up 2021, let’s see what Humans at SigNoz were up to in the month of December!

6 AIOps Myths You Should Be Wary Of

AIOps myths and how to avoid them Gartner coined the term AIOps in 2016 to refer to the combining of “big data and machine learning to automate IT operations processes, including event correlation, anomaly detection and causality determination.” In the five years since, AIOps has grown leaps and bounds — last year, AIOps was at the peak of the Gartner hype cycle.

Why Intuitive Troubleshooting Has Stopped Working for You

It’s harder to understand and operate production systems in 2021 than it was in 2001. Why is that? Shouldn’t we have gotten better at this in the past two decades? There are valid reasons why it’s harder: The architecture of our systems has gotten a lot more sophisticated and complex over the past 20 years. We’re not running monoliths on a few beefy servers these days.

Hybrid Cloud Infrastructure: A Complete Migration, Cost Management, and Optimization Checklist

The success of your enterprise’s digital transformation relies in no small part on your hybrid cloud infrastructure, which SearchCloud Computing defines as “a cloud computing environment that uses a mix of on-premises, private cloud and third-party, public cloud services with orchestration between these platforms.” Because this infrastructure is not a homogeneous environment, migration, management, and optimization can be an ongoing challenge.

The comprehensive guide to HANA migration

Are you ready to migrate to HANA? When HANA platform is implemented correctly, it has proven itself to deliver greater results in analytic intelligence, performance, data processing, and better ROI with faster time-to-value. Planning and executing a successful migration to SAP HANA depends upon a comprehensive understanding of the overall process and careful monitoring during the five main stages of a migration.

Five Cloud Networking Deployment Mistakes That Will Cost You

Useful tips to better plan, monitor, and troubleshoot your public cloud networking Cloud networking across a myriad of applications, clouds, and data centers can get complex. And with the rapid pace of transition to the cloud, you'll need to prepare for new concepts like VPCs, cloud interconnects, and multiple availability zones and regions. Kentik has put together a list of the top five cloud networking deployment mistakes to avoid so you can take full advantage of the cost savings and flexibility of the cloud.