WIP: Update model at runtime #21

bakwc · 2018-04-13T17:12:29Z

Ability to add text fragments to a model in runtime, relates to #18

iosadchiy · 2018-06-11T10:45:50Z

A very useful feature. Is there any chance to see it merged?

Also I don't quite understand how it updates the perfect hash: if the text fragment to be added contains an n-gram that is not present in the hash, how to add it? Don't you have to re-calculate perfect hash from scratch?

bakwc · 2018-06-11T21:36:37Z

A very useful feature. Is there any chance to see it merged?

I'll try to continue working on this, but not sure when.

Also I don't quite understand how it updates the perfect hash: if the text fragment to be added contains an n-gram that is not present in the hash, how to add it? Don't you have to re-calculate perfect hash from scratch?

I planed to add a separate hash_table for additional words, it would be ok only for adding small amount of new words / sentences at runtime. For example in text editors when you plan to add some exceptions / ignore some errors.

iosadchiy · 2018-06-12T12:09:54Z

Actually, what I'm trying to do is to first create the model based on n-gram frequencies (#31) and then update it with some domain-specific texts. What could be the best approach here?

It seems like the functionality from this PR is not the best choice (due to the use of an additional hash table).

The other approach is to store all the ngrams in the model (without loading them into memory). This should allow to re-train the model on additional data.

Maybe you'll suggest something better?

bakwc · 2018-06-16T18:13:45Z

The other approach is to store all the ngrams in the model (without loading them into memory). This should allow to re-train the model on additional data.

It could lead to a performance issues if we will go to disk each time we need to get frequency.

Jbiloki · 2019-04-10T00:37:39Z

Is there a main function to invoke these commands?

rprilepskiy · 2020-01-27T13:56:12Z

@bakwc, do you (by any chance) have plans to solve the conflicts?

bakwc · 2020-01-27T16:32:01Z

sory, currently have no time (

bakwc · 2020-10-01T00:04:45Z

Done in Pro version.

JamSpellPro is available at jamspell.com

Add new words at runtime - added support to lang model

26f0dac

bakwc added enhancement WIP Work In Progress labels Apr 13, 2018

bakwc added 2 commits April 13, 2018 23:56

Fixed tests

5168b5c

Indexing runtime words

33ff71b

bakwc closed this Oct 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Update model at runtime #21

WIP: Update model at runtime #21

bakwc commented Apr 13, 2018 •

edited

Loading

iosadchiy commented Jun 11, 2018

bakwc commented Jun 11, 2018

iosadchiy commented Jun 12, 2018

bakwc commented Jun 16, 2018

Jbiloki commented Apr 10, 2019

rprilepskiy commented Jan 27, 2020

bakwc commented Jan 27, 2020

bakwc commented Oct 1, 2020

WIP: Update model at runtime #21

WIP: Update model at runtime #21

Conversation

bakwc commented Apr 13, 2018 • edited Loading

iosadchiy commented Jun 11, 2018

bakwc commented Jun 11, 2018

iosadchiy commented Jun 12, 2018

bakwc commented Jun 16, 2018

Jbiloki commented Apr 10, 2019

rprilepskiy commented Jan 27, 2020

bakwc commented Jan 27, 2020

bakwc commented Oct 1, 2020

bakwc commented Apr 13, 2018 •

edited

Loading