- Breaking Change - refactoring
limpiar_emojis
intolimpiar_recode_emojis
andlimpiar_remove_emojis
-limpiar_emojis
no longer exists so code using it would need to be updated. - Introduced
limpiar_non_ascii
andlimpiar_alphanumeric
to make removing special characters quicker and easier - Introduced
limpiar_wrap_string
for reading outputs in {DT}'s Data Tables and {plotly} hovers. - Introduced a vignette -
near_duplicates.Rmd
for using thelimpiar_spam_grams
function to remove near duplicates. - Updated the
limpiar_intro.Rmd
vignette. - behind doors improvements to documentation, error handling, and tests
- Import & download models with
limpiar_pos_import
- Multilingual
- Caching support
- Annotate models with
limpiar_pos_annotate
- Removal of the
limpiar_df
function (be mindful about individual cleaning steps!)