Classifying jobs in a dataset (from Kaggle) as fake or real using machine learning.
-
Updated
Apr 20, 2022 - Jupyter Notebook
Classifying jobs in a dataset (from Kaggle) as fake or real using machine learning.
A machine learning model is built using PySpark's MLlib library to automatically flag suspicious job postings on Indeed.com. The dataset includes 18,000 job descriptions, out of which about 800 are fake.
The model predict fake job posts, used Natural Language Processing to analyze the sentiments and pattern in the job posting.
Add a description, image, and links to the fake-jobposts-prediction topic page so that developers can more easily learn about it.
To associate your repository with the fake-jobposts-prediction topic, visit your repo's landing page and select "manage topics."