1
1
mirror of https://github.com/explosion/spaCy.git synced 2025-01-16 12:36:23 +03:00
spaCy/licenses
Adriane Boyd 1d59fdbd39
Update Vietnamese tokenizer ()
* Adapt tokenization methods from `pyvi` to preserve text encoding and
whitespace
* Add serialization support similar to Chinese and Japanese

Note: as for Chinese and Japanese, some settings are duplicated in
`config.cfg` and `tokenizer/cfg`.
2021-05-17 18:16:20 +10:00
..
3rd_party_licenses.txt Update Vietnamese tokenizer () 2021-05-17 18:16:20 +10:00