Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 335 Bytes

README.md

File metadata and controls

4 lines (3 loc) · 335 Bytes

the-explorer

A tool that uses the queue package to parallel process an entire directory of csv files by printing summary stats and a histogram for each column. Step2 in the analysis path after using my other tool 'The Splitter'.

With these two tools combined you can split and then begin to explore massive datasets using Pandas.