Project NASS - News Archives Scrapper & Searcher

Web Scrapper | Search Engine for Scrapped Results | Python | BeautifulSoup

This is a passion project which showcases web scraping abilities with python knowledge, where I have scrapped data from the web archives of a national newspaper called "The Hindu" and created a search engine to look for names inside the scraped data. What it does: Get all the articles from the archives of The Hindu from the entire year of 2010 and creates a 'database' (text file) within which we can search for articles with matching names of famous people. BeautifulSoup libraries were used to get the clean data from the site.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Project NASS - News Archives Scrapper & Searcher

Files

README.md

Latest commit

History

README.md

File metadata and controls

Project NASS - News Archives Scrapper & Searcher