Skip to content

Releases: ddangelov/Top2Vec

gensim version fix

01 Apr 15:43
Compare
Choose a tag to compare

1.0.23

12 Feb 16:06
Compare
Choose a tag to compare

Added numpy>=1.20.0 dependency.

1.0.22

12 Feb 01:23
Compare
Choose a tag to compare

Numpy related bug fix and document id validation performance upgrade.

added umap/hdbscan custom args

05 Feb 00:35
Compare
Choose a tag to compare

Addressed #90, #125, #126

Added custom umap and hdbscan arg option. Fixed issue with loading model with custom tokenizer.

added use_embedding_model_tokenizer option

09 Jan 00:13
Compare
Choose a tag to compare

Added use_embedding_model_tokenizer parameter. If set to True and if using an embedding_model other than doc2vec, use the model's tokenizer for document embedding.

Fixed dependency issue with joblib.

Fixed issues with wordclouds caused by negative similarity scores.

fix saving bug

10 Dec 22:25
Compare
Choose a tag to compare

Fixed bug #91

word indexing

10 Dec 01:32
Compare
Choose a tag to compare

Added option for indexing word vectors, this will speed up search for models with large vocabularies. Specifically search_words_by_vector and similar_words.

Added new method search_words_by_vector.

document indexing

07 Dec 21:00
Compare
Choose a tag to compare

Added option for indexing document vectors, this will speed up search for models with large number of documents. Specifically search_documents_by_vector, search_documents_by_keywords, and search_documents_by_documents.

Added new method search_documents_by_vector.

Added code to prevent hierarchical topic reduction error #79.

Separate dependencies

10 Nov 16:21
Compare
Choose a tag to compare

Dependencies for universal sentence encoder and BERT sentence transformer options are now optional.
With pip install top2vec[sentence-encoders] and pip install top2vec[sentence_transformers]

Faster cosine similarity.

logging bug fix and default change

16 Oct 20:40
Compare
Choose a tag to compare

The verbose parameter will be set to True by default.

Fixed a bug that stopped showing logging updates after downloading pre-trained models.