My project is called Inverse Text Normalization. This project is based on Neural Models of Text Normalization for Speech Applications. However, I applied it to the Inverse Text Normalization downstream task. Furthermore, I improved the results by utilizing the Transformer mechanism. Specifically, I used the bert2BERT architecture.
To get started, you should have prior knowledge on Python and Pytorch at first. A few resources to get you started if this is your first Python or Tensorflow project:
-
Clone the repo
git clone https://github.com/phkhanhtrinh23/inverse_text_normalization.git
-
Use any code editor to open the folder inverse_text_normalization.
-
The dataset is: spoken_norm_assignment from a VietAI's course that I studied.
-
Read and run
main.py
to train the bert2BERT model. -
Read and run
infer.py
to predict the results. My checkpoint is saved here.
Contributions are what make GitHub such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the project
- Create your Contribute branch:
git checkout -b contribute/Contribute
- Commit your changes:
git commit -m 'add your messages'
- Push to the branch:
git push origin contribute/Contribute
- Open a pull request
Email: phkhanhtrinh23@gmail.com