Bachelor_degree

Codes of scrapers from my Bachelor thesis

Files:

First_Steps_In_BS4.py, Selenium.py and Scrapy.py are only to present basics of those methods.

Main part

Main part of my BA thesis is PremierLeagueDataScraper which scrapes data from premierleague.com/stats about appearances of forwards in English Premier League in current season (In my case it was 2019/20). I created this scraper in order to collect data to subsequent analysis to find the most universal attacker in the league. It's an example of using web scraping in football scouting process. To open web scraper You have to run Initialize_PremierLeagueDataScraper.py.

DataCleaner.py

Merging lots of csv files containing scraped data to one, cleaning scraped data etc.

Addition (files: sofifaScraper.py & sofifaCleaner.py):

I also wanted to have age of players, so I had to build another scraper (from sofifa.com) and another data cleaner to get and clean it. I took a list of players names (in another csv) from my first scraped data and I used search field to find all of them on Sofifa page and scraped their age.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bachelor_degree

Files:

Main part

DataCleaner.py

Addition (files: sofifaScraper.py & sofifaCleaner.py):

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
DataCleaner.py		DataCleaner.py
First_steps_in_BS4.py		First_steps_in_BS4.py
Initialize_PremierLeagueDataScraper.py		Initialize_PremierLeagueDataScraper.py
PremierLeagueDataScraper.py		PremierLeagueDataScraper.py
README.md		README.md
Scrapy.py		Scrapy.py
Selenium.py		Selenium.py
sofifaCleaner.pu		sofifaCleaner.pu
sofifaScraper.py		sofifaScraper.py

yngalxx/Bachelor_degree

Folders and files

Latest commit

History

Repository files navigation

Bachelor_degree

Files:

Main part

DataCleaner.py

Addition (files: sofifaScraper.py & sofifaCleaner.py):

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages