Skip to content

Latest commit

 

History

History
17 lines (14 loc) · 825 Bytes

README.md

File metadata and controls

17 lines (14 loc) · 825 Bytes

olx-image-scraper

This tool allows you to scrap images of a specific category from the OLX website

You can find it useful for datascience and machine learning projects as a way to gather more real-life images for your dataset. This tool gather no personal data of the users, only raw images publicly available on the internet.

Run the script:

  • Set up a Python environment (min. Python 3.7 is required) and download everything from requirements.txt
  • Download Chrome driver first (it should correspond to your current Chrome version) https://chromedriver.chromium.org/downloads
  • Allow this script to control your PC

Additional suggestion: sort all saved images by size after running the script and remove duplicates (OLX repeats some promoted ads)

Main Python libraries used

  • Selenium
  • Pillow
  • PyAutoGUI