Effective Management of High Volume Numeric Data with Histograms with Fred Moyer

Effective Management of High Volume Numeric Data with Histograms with Fred Moyer

May 8, 2018

A pre-show recording of Fred Moyer's talk at DataEngConf 2018.

In this talk, we take a look at both log-linear and cumulative histograms and how they provide advantages over storing data as quantiles, averages, and other histogram implementations, such as linear and fixed-bucket. We explain an open source histogram software library, and show some sample statistical operations using it. You'll come away with an understanding of use histograms to make your data engineering life easier.