Operations | Monitoring | ITSM | DevOps | Cloud

September 2023

Build a Data Streaming Pipeline with Kafka and InfluxDB

InfluxDB and Kafka aren’t competitors – they’re complimentary. Streaming data, and more specifically time series data, travels in high volumes and velocities. Adding InfluxDB to your Kafka cluster provides specialized handling for your time series data. This specialized handling includes real-time queries and analytics, and integration with cutting edge machine learning and artificial intelligence technologies. Companies like as Hulu paired their InfluxDB instances with Kafka.

Mage.ai for Tasks with InfluxDB

Any existing InfluxDB user will notice that InfluxDB underwent a transformation with the release of InfluxDB 3.0. InfluxDB v3 provides 45x better write throughput and has 5-25x faster queries compared to previous versions of InfluxDB (see this post for more performance benchmarks). We also deprioritized several features that existed in 2.x to focus on interoperability with existing tools. One of the deprioritized features that existed in InfluxDB v2 is the task engine.

The Plan for InfluxDB 3.0 Open Source

The commercial version of InfluxDB 3.0 is a distributed, scalable time series database built for real-time analytic workloads. It supports infinite cardinality, SQL and InfluxQL as native query languages, and manages data efficiently in object storage as Apache Parquet files. It delivers significant gains in ingest efficiency, scalability, data compression, storage costs, and query performance on higher cardinality data.

Time Series Is out of This World: Data in the Space Sector

While time series data is critical for space industries, managing that data is not always straightforward. While humans have yet to develop light-speed travel, teleportation or lots of the other cool things we see in movies or read in books, that doesn’t mean we aren’t making progress. Advances in technology are starting, ever so slowly, to blur the lines between science fiction and reality when it comes to outer space.

Introduction to Apache Arrow

A look at what Arrow is, its advantages and how some companies and projects use it. Over the past few decades, using big data sets required businesses to perform increasingly complex analyses. Advancements in query performance, analytics and data storage are largely a result of greater access to memory. Demand, manufacturing process improvements and technological advances all contributed to cheaper memory.

How InfluxData and Dremio Leverage the Apache Ecosystem

InfluxData and Dremio have always been at the forefront of embracing open source solutions to enhance their product offerings. This post discusses how both companies currently leverage the Apache Ecosystem and describes the downstream impact these powerful technologies have on their offerings. InfluxData created and maintains InfluxDB, a time series platform.

LBBC Technologies Creates a Custom Predictive Maintenance Program with InfluxDB, AWS, and MQTT

LBBC Technologies is almost 150 years old and dedicates time and resources to pushing the boundaries of pressure vessel and autoclave design through precision engineering, advanced technologies, and electronic intelligence. They prioritize investments in research and development to advance their vision for the future.

Announcing InfluxDB Clustered: InfluxDB 3.0 for Self-Managed Environments

Today, we’re excited to announce InfluxDB Clustered, our latest product developed on the InfluxDB 3.0 product suite. InfluxDB Clustered is the evolution of InfluxDB Enterprise, our popular self-managed product for large-scale time series workloads. For enterprises, the performance leap from InfluxDB Enterprise to InfluxDB Clustered is orders of magnitude higher with significant improvements across analytics, storage, and costs.

InfluxData Announces InfluxDB Clustered to Deliver Time Series Analytics for On-Premises and Private Cloud Deployments

SAN FRANCISCO – September 6, 2023 – InfluxData, creator of the leading time series platform InfluxDB, today announced InfluxDB Clustered, its self-managed time series database for on-premises or private cloud deployments. With the release of InfluxDB Clustered, InfluxData completes its commercial product line developed on InfluxDB 3.0, its rebuilt database engine optimized for real-time analytics with higher performance, unlimited cardinality, and SQL support.

Predictive Analytics Using a Time Series Database

Predictive analytics harnesses the power of big data, statistical algorithms and machine learning techniques to anticipate future outcomes based on historical data. Various industries use predictive analytics, from finance and healthcare to retail and marketing. Among its many uses, predictive maintenance and anomaly detection are two significant applications.

Can Companies Really Self-Host at Scale?

Self-hosting is effective for many companies. But when is it time to let go and try the easier way? There’s no such thing as free lunch, or in this case, free software. It’s a myth. Paul Vixie, vice president of security at Amazon Web Services, creator of the original Domain Name System (DNS), gave a compelling presentation at Open Source Summit Europe 2022 about this topic.