Changelog
Source:NEWS.md
readabilityes 0.1.0 (2025-08-11)
What’s new
-
Sentence segmentation
-
segment_sentences()
: protects abbreviations (e.g., Dr., etc.), initials (J. R. R.), decimals, and ellipses; supportsextra_abbr
,keep_delim
,normalize_spacing
. -
count_sentences()
: counts sentences per element, withna_as_zero
anddrop_empty
.
-
-
Tokenization and counts
-
tokenize_words()
: optionslowercase
,keep_accents
,strip_punct
,keep_hyphens
,remove_numbers
,strip_symbols
,flatten
. -
count_words()
: token count per element (defensive flag normalization withisTRUE()
).
-
-
Syllabification (ES)
-
syll_split()
,syll_count()
,syll_hyphenate()
: deterministic core with diphthongs/triphthongs and C+l/r clusters; optional cache viasyll_cache_info()
/syll_cache_clear()
.
-
-
Average metrics
-
Readability indexes
Changes and fixes
-
Segmentation regex: replaced variable-length lookbehind with a PCRE-compatible pattern to avoid
lookbehind assertion is not fixed length
. -
count_words()
and derived functions: defensive coercion of logical flags (isTRUE()
). - Documentation updated: new
strip_symbols
andflatten
arguments intokenize_words()
. - Expanded testthat suite and README examples in ES/EN.