topics_and_summary is a library that identifies topics in a collection of text documents and classifies the documents inside that topics. It also generates summaries of text documents. This is done using NLP techniques.
- Identify the topics present in the collection of documents.
- Identify the relation of each document in the collection with each topic.
- Classify each document in the collection inside a topic.
- Identify the relation of a given text document with each topic.
- Classify a given text document inside a topic.
- Obtain the most representative documents of each topic.
- Obtain the documents of the collection more related with a given text document.
- Create an extractive summary of a given text document.
All this information and more is present in the documentation. To generate it, execute:
cd topics_and_summary/docs
./generate-api-doc.sh
The documentation will be generated in HTML format in the folder topics_and_summary/docs/build/html
. The index.html file is the main page of the documentation.