Operations | Monitoring | ITSM | DevOps | Cloud

September 2023

Monitoring Machine Learning

I used to think my job as a developer was done once I trained and deployed the machine learning model. Little did I know that deployment is only the first step! Making sure my tech baby is doing fine in the real world is equally important. Fortunately, this can be done with machine learning monitoring. In this article, we’ll discuss what can go wrong with our machine-learning model after deployment and how to keep it in check.

Install MLflow in less than 5 minutes

Install MLflow quickly on Ubuntu using our distribution, Charmed MLFlow. You can integrate it with different tools, so you can run it on your workstation with Jupyter Notebook or at scale with Charmed Kubeflow. Charmed MLFlow is a fully open source distribution of the upstream project, that benefits from security patching, tool integration and automated lifecycle management.

Machine Learning for Fast and Accurate Root Cause Analysis

Machine Learning (ML) for Root Cause Analysis (RCA) is the state-of-the-art application of algorithms and statistical models to identify the underlying reasons for issues within a system or process. Rather than relying solely on human intervention or time-consuming manual investigations, ML automates and enhances the process of identifying the root cause.

Our first ML based anomaly alert

Over the last few years we have slowly and methodically been building out the ML based capabilities of the Netdata agent, dogfooding and iterating as we go. To date, these features have mostly been somewhat reactive and tools to aid once you are already troubleshooting. Now we feel we are ready to take a first gentle step into some more proactive use cases, starting with a simple node level anomaly rate alert. note You can read a bit more about our ML journey in our ML related blog posts.

Introduction to MLFlow

MLFlow is an open source platform used for managing machine learning workflows. It is a crucial component of the open source MLOps ecosystem, having passed 10 million monthly downloads at the end of 2022. It has four main components that ensure experiment tracking, model registry, model deployment and code packaging. Join our webinar to learn more about MLFlow During this webinar, Andreea Munteanu will discuss MLFlow and Charmed MLFlow, Canonical’s distribution of the open source platform.