news-data-scrapping-and-section-classification

This project scrapes news articles from a public news website, tags them using classification provided by the website, and uses a text classification model to predict article sections.

Getting Started

Prerequisites

Python 3.9.13
Install required libraries: pip install -r requirements.txt

Running the Script

Clone the repository: git clone https://github.com/Abhipawar02/news-data-scrapping-and-section-classification.git
Navigate to the project directory: cd news-data-scrapping-and-section-classification
Run the script: streamlit run main.py

Data Collection

The raw data was collected from Public News Website.
For detailed steps and code, refer to the News Website-Scrapper.

Data Preprocessing and Model Training

Explore the steps, refer to the Scrapped News Data Preprocessing And Training

Model Evaluation

Note

Both of notebooks could be find in /notebook directory

Results

Inside /results-screenshots Directory
Result-1
Result-2
Result-3

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
dataset		dataset
model		model
notebook		notebook
results-screenshots		results-screenshots
ReadMe.md		ReadMe.md
main.py		main.py
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

news-data-scrapping-and-section-classification

Getting Started

Prerequisites

Running the Script

Data Collection

Data Preprocessing and Model Training

Model Evaluation

Note

Results

About

Releases

Packages

Languages

Abhipawar02/news-data-scrapping-and-section-classification

Folders and files

Latest commit

History

Repository files navigation

news-data-scrapping-and-section-classification

Getting Started

Prerequisites

Running the Script

Data Collection

Data Preprocessing and Model Training

Model Evaluation

Note

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages