Files
ports/textproc/py-segments/pkg-descr

4 lines
211 B
Plaintext

The segments package provides Unicode Standard tokenization routines and
orthography segmentation, implementing the linear algorithm described in
the orthography profile specification from The Unicode Cookbook.