4 lines
211 B
Plaintext
4 lines
211 B
Plaintext
The segments package provides Unicode Standard tokenization routines and
|
|
orthography segmentation, implementing the linear algorithm described in
|
|
the orthography profile specification from The Unicode Cookbook.
|