Skip to content

Latest commit

 

History

History
6 lines (5 loc) · 659 Bytes

README.md

File metadata and controls

6 lines (5 loc) · 659 Bytes

Project NASS - News Archives Scrapper & Searcher

Web Scrapper | Search Engine for Scrapped Results | Python | BeautifulSoup

This is a passion project which showcases web scraping abilities with python knowledge, where I have scrapped data from the web archives of a national newspaper called "The Hindu" and created a search engine to look for names inside the scraped data. What it does: Get all the articles from the archives of The Hindu from the entire year of 2010 and creates a 'database' (text file) within which we can search for articles with matching names of famous people. BeautifulSoup libraries were used to get the clean data from the site.