3.3.5. Visual EDA Tools¶
3.3.5.1. Ydata-profiling¶
(Partly) Automated Visual EDA via Ydata-profiling
From their website:
ydata-profiling is a leading package for data profiling, that automates and standardizes the generation of detailed reports, complete with statistics and visualizations. The significance of the package lies in how it streamlines the process of understanding and preparing data for analysis in a single line of code! If you’re ready to get started see the quickstart!
First, look at the quickstart overview to get an idea of how it works and what it can do.
Check out some examples of the output it produces on their site
If you like what you see there, try it!
3.3.5.2. Lux¶
(Partly) Automated Visual EDA via Lux
Lux “makes data science easier by automating certain aspects of the data exploration process. Lux is designed to facilitate faster experimentation with data, even when the user does not have a clear idea of what they are looking for. Lux is integrated with an interactive Jupyter widget that allows users to quickly browse through large collections of data directly within their Jupyter notebooks.”
First, look at the quickstart overview to get an idea of how it works and what it can do.
If you like what you see there, try it!
Load a dataframe
Look at Lux’s recommended visualizations, point and click, and guide it towards what you want
When you have something you like, you can easily output the code that built that figure
Feedback wanted: Please let me know if you use it and how it goes! If it’s good, I’ll integrate it more next year into the class.
For more, the repo is here and the docs are here.