Classification of questions into 5 classes. The five classes are as follows :
HUM
for questions about humansENTY
for questions about entitiesDESC
for questions asking you for a descriptionNUM
for questions where the answer is numericalLOC
for questions where the answer is a locationABBR
for questions asking about abbreviations
Concepts
- RNN
- LSTM
- Word Embeddings
- Multilayer and Bi-Directional RNNs
RESULTS
Trained for 5 epochs on Kaggle.
Test Accuracy of ~91%
Model is confused between Entity questions and questions about humans.
- PyTorch
- Torchtext (Used
torchtext.legacy
) - Spacy
- You can pretrain the model
- You can use the predict_classes function to try your own questions.
Thanks to the author of this for this amazing repo and its explanation and help.