Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ddangelov / Top2Vec Public

Notifications You must be signed in to change notification settings
Fork 374
Star 3k

Code
Issues 61
Pull requests 16
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: ddangelov/Top2Vec

Releases Tags

Releases · ddangelov/Top2Vec

gensim version fix

01 Apr 15:43

ddangelov

1.0.24

e133bb1

Compare

Choose a tag to compare

View all tags

gensim version fix

Fixes #152

Assets 2

All reactions

1.0.23

12 Feb 16:06

ddangelov

1.0.23

95b1930

Compare

Choose a tag to compare

View all tags

1.0.23

Added numpy>=1.20.0 dependency.

Assets 2

All reactions

1.0.22

12 Feb 01:23

ddangelov

1.0.22

c67c866

Compare

Choose a tag to compare

View all tags

1.0.22

Numpy related bug fix and document id validation performance upgrade.

Assets 2

All reactions

added umap/hdbscan custom args

05 Feb 00:35

ddangelov

1.0.21

a2348c1

Compare

Choose a tag to compare

View all tags

added umap/hdbscan custom args

Addressed #90, #125, #126

Added custom umap and hdbscan arg option. Fixed issue with loading model with custom tokenizer.

Assets 2

All reactions

added use_embedding_model_tokenizer option

09 Jan 00:13

ddangelov

1.0.20

9686e44

Compare

Choose a tag to compare

View all tags

added use_embedding_model_tokenizer option

Added use_embedding_model_tokenizer parameter. If set to True and if using an embedding_model other than doc2vec, use the model's tokenizer for document embedding.

Fixed dependency issue with joblib.

Fixed issues with wordclouds caused by negative similarity scores.

Assets 2

All reactions

fix saving bug

10 Dec 22:25

ddangelov

1.0.19

c23e9a0

Compare

Choose a tag to compare

View all tags

fix saving bug

Fixed bug #91

Assets 2

All reactions

word indexing

10 Dec 01:32

ddangelov

1.0.18

c899af8

Compare

Choose a tag to compare

View all tags

word indexing

Added option for indexing word vectors, this will speed up search for models with large vocabularies. Specifically search_words_by_vector and similar_words.

Added new method search_words_by_vector.

Assets 2

All reactions

document indexing

07 Dec 21:00

ddangelov

1.0.17

a8acdf5

Compare

Choose a tag to compare

View all tags

document indexing

Added option for indexing document vectors, this will speed up search for models with large number of documents. Specifically search_documents_by_vector, search_documents_by_keywords, and search_documents_by_documents.

Added new method search_documents_by_vector.

Added code to prevent hierarchical topic reduction error #79.

Assets 2

All reactions

Separate dependencies

10 Nov 16:21

ddangelov

1.0.16

9271bdd

Compare

Choose a tag to compare

View all tags

Separate dependencies

Dependencies for universal sentence encoder and BERT sentence transformer options are now optional.
With pip install top2vec[sentence-encoders] and pip install top2vec[sentence_transformers]

Faster cosine similarity.

Assets 2