mirror of
https://github.com/explosion/spaCy.git
synced 2025-10-27 14:11:04 +03:00
* Exceptions for single letter words ending sentence Sentences ending in "i." (as in "... peka i."), "m." (as in "...än 2000 m."), should be tokenized as two separate tokens. * Add test |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_tokenizer.py | ||