Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

package for visual data comparison using small multiples #16

Open
ataustin opened this issue Mar 3, 2020 · 2 comments
Open

package for visual data comparison using small multiples #16

ataustin opened this issue Mar 3, 2020 · 2 comments

Comments

@ataustin
Copy link

ataustin commented Mar 3, 2020

After refactoring data prep code or refreshing a data set, you may want to check the latest data against the original data to determine the extent to which your numeric and categorical features changed. This can be done with summary statistics, but that can be tedious especially for visualizing data drift from multiple data sets through time.

I propose a package to make the visual comparison of data sets simple and automated. A user can supply multiple data sets, and the package would provide functions to assist in the preparation of visual aids for studying data changes. Numeric data distributions can be compared using histograms and categorical data can be compared using bar plots.

The package's main dependencies would include:

@ataustin
Copy link
Author

it exists now! Here's the repo: https://github.com/ataustin/flyover

@emilyriederer
Copy link
Collaborator

Very neat, @ataustin -- this looks great!! Can't wait to take it for a spin 🤩

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants