Zoominfo-Scrapping

Simple scraper for collecting data about the companies specified in the input.csv file from the https://www.zoominfo.com/ website.

About

The scraper collects the following data about companies:

Headquarters
Phone
Revenue
Number of employees
Website

As a result, it generates a separate CSV file for every company from the input list. These files are located in the output folder. Each filename is created using the name of the company - {company_name}.csv. For example - Amazon.csv, Google.csv, etc.

Technologies

Scrapy 2.3.0
scrapy-rotating-proxies 0.6.2
rotating-free-proxies 0.1.2

How to install and run

Clone the repo: git clone https://github.com/dfesenko/zoominfo_scraper.git. Go inside the zoominfo_scraper folder: cd zoominfo_scraper.
Create a virtual environment: python -m venv venv.
Activate virtual environment: source venv/bin/activate.
Install dependencies into the virtual environment: pip install -r requirements.txt.
Change directory: cd zoominfo.
Create or change the input.csv file. Place there the names of the companies you want to parse.
Issue the following command: scrapy crawl zoominfo.
Now the script should be started. The output directory should appear in the directory. The script populates it with files (one csv file per company).
If you want to change some scrapper parameters you can explore the /zoominfo/zoominfo/settings.py file.

Notes about proxies

The scraper uses scrapy-rotating-proxies and rotating-free-proxies packages to get the list of available free proxies and rotate them automatically. You can turn off this feature by commenting out the DOWNLOADER_MIDDLEWARES variable in the settings.py file. Also, during the work, these libraries create the proxies.txt file in the root of the project. There are a list of free proxies stored.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
zoominfo		zoominfo
README.md		README.md
input.csv		input.csv
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zoominfo-Scrapping

About

Technologies

How to install and run

Notes about proxies

About

Releases

Packages

Languages

pokemon918/Zoominfo-Scraper

Folders and files

Latest commit

History

Repository files navigation

Zoominfo-Scrapping

About

Technologies

How to install and run

Notes about proxies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages