For each part, start the Spark cluster and run the corresponding test.sh
Accuracy for each category:
- toxic: 0.96
- severe_toxic: 0.99
- obscene: 0.97
- threat: 0.99
- insult: 0.97
- identity_hate: 0.99
Accuracy: 0.70
AUC ROC score: 0.70
Accuracy: 0.88
Random forest accuracy: 0.89
Decision tree accuracy: 0.68