This project scrapes news articles from a public news website, tags them using classification provided by the website, and uses a text classification model to predict article sections.
- Python 3.9.13
- Install required libraries:
pip install -r requirements.txt
- Clone the repository:
git clone https://github.com/Abhipawar02/news-data-scrapping-and-section-classification.git
- Navigate to the project directory:
cd news-data-scrapping-and-section-classification
- Run the script:
streamlit run main.py
- The raw data was collected from Public News Website.
- For detailed steps and code, refer to the News Website-Scrapper.
- Explore the steps, refer to the Scrapped News Data Preprocessing And Training
- Both of notebooks could be find in /notebook directory