- Fix
failure: length > 1 in coercion to logical
issue inn_gram_merge()
, in commit 4336ee6.
- Package is now linking to the
stringdist
C API, and calling C functions in place of usingstringdist::stringdistmatrix()
. This change results in speed improvements in functionn_gram_merge()
, and requires thatstringdist
v0.9.5.1 or greater be installed.
- In function
n_gram_merge()
, renamed argedit_dist_weights
toweight
. The only purpose of this arg is to be passed along to functionstringdistmatrix
from the stringdist package (which uses the nameweight
, so this change is simply to match that).
-
Fixed issue in which input strings that contained accent marks were not being properly handled/clustered (#9). The fix involved adding stringi to
Imports
and usingstringi::stri_trans_general()
. -
Fixed issue in
n_gram_merge()
in which incorrect values were being return when input argignore_strings
was notNULL
, and argbus_suffix = FALSE
(#7). -
Fixed issue in which input strings that contained punctuation that was NOT surrounded by spaces was returning incorrect values (#6).
-
Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster (#5).
- Rewrote some of the cpp functions to incorporate
std::unordered_map()
, resulting in a substantial speed improvement when passing large character vectors (length 100,000+) to either of the exported functions (#8).
- released on CRAN