Skip to content

gurtaransingh/scraping

Repository files navigation

Web Scraping and Text Extraction

  1. PPT : Link
  2. Beautiful Soup and Flipkart Mini Project : Link
  3. Text Extraction Code and Text from Multiple Images Detection Mini Project : Link
  4. Sample File : Open Sample Files Folder above and download required file type
  5. Self Learning Web Scraping Book Resource : Link

Star and fork this repository for future help

For any queries : Link

Projects/opportunities for future

  1. Share Predictor: Live share/stock market graph data scraping, implimenting basic machine learning models and then making a web application telling me probability to buy a particular share or not, or suggeting me the right time to buy or sell.
  2. Handwriting Recogniser: Train your own OCR model to read handwritings in different languages like English, Hindi, Punjabi etc. Try cursive text, doctor's handwriting, any bill which has something handwritten on it.
  3. Your Own Google Lens: Merge many ocr models to extract text, qr codes, images. Compare the scraped data together. Search the most repeated words or images on backend and show most relavent searches.
  4. Kaggle Dataset Master: Scrape different websites, get relavent filtered information, upload datasets on kaggle like platform. Ask your friends to upvote you, add comments on dataset. If dataset is good and big upto 10k entries ask RANA SIR to host a kaggle competition with your dataset.
  5. Cold Emailing: Write a code - just put a url, and scrape all email IDs. And automatically cold email them. Use filter like hr@company.com

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published