Clean Up Your Digital Life: How I Found 1929 Fully Identical Images, Dark, Bright and Blurry Shots in Minutes, For Free.
A companion repo for the blog post Clean Up Your Digital Life: Simplify Your Photo Organization and Say Goodbye to Photo Clutter.
-
fastdup_report/
-- Folder to store fastdup files. -
images/
-- Images folder. Use your own or download from Kaggle. -
fastdup_analyze.ipynb
-- A Jupyter notebook to run fastdup.
First, let’s install fastdup with -
pip install fastdup==0.903
Run fastdup -
import fastdup
work_dir = "./fastdup_report"
images_dir = "./images"
fd = fastdup.create(work_dir, images_dir)
fd.run()
Get a list of broken images found by fastdup:
fd.invalid_instances()
View duplicate images -
fd.vis.duplicates_gallery()
View dark shots.
fd.vis.stats_gallery(metric='dark')
View bright shots.
fd.vis.stats_gallery(metric='bright')
View blurry shots.
fd.vis.stats_gallery(metric='blur')
View clusters -
fd.vis.component_gallery()
If you have any questions or feedback, please don't hesitate to reach out to me. I'm active on the following platforms.
I am thrilled to share my work with you and I hope you find it useful.
If you do, please consider supporting my efforts by making a donation and/or sharing this repository on your social media.
Your support will help me to continue developing and maintaining this project, as well as create new ones.