Skip to content

Latest commit

 

History

History
10 lines (7 loc) · 971 Bytes

README.md

File metadata and controls

10 lines (7 loc) · 971 Bytes

Web Scraping: Billboard Hot 100 in R

GitHub GitHub top language GitHub last commit

R project to create a tool that scrapes the web for data.

For this project I first web scraped wikipedia to get a list of the top 100 songs on the Billboard Hot 100 over a number of years. I then used this song information in order to web scrap the Lyrics for these songs from Genius.

I used the lyrics to plot the frequency of the words used in the songs as well as try and detect differences in the wordchoice throughout the decades.