Skip to content

Web Scrapper | Search Engine for Scrapped Results | Python | BeautifulSoup

Notifications You must be signed in to change notification settings

mishalalex/ProjectWebScrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project NASS - News Archives Scrapper & Searcher

Web Scrapper | Search Engine for Scrapped Results | Python | BeautifulSoup

This is a passion project which showcases web scraping abilities with python knowledge, where I have scrapped data from the web archives of a national newspaper called "The Hindu" and created a search engine to look for names inside the scraped data. What it does: Get all the articles from the archives of The Hindu from the entire year of 2010 and creates a 'database' (text file) within which we can search for articles with matching names of famous people. BeautifulSoup libraries were used to get the clean data from the site.

About

Web Scrapper | Search Engine for Scrapped Results | Python | BeautifulSoup

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages