mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 01:46:28 +03:00
e1f777b151
* don't split on a colon. Colon is used to attach suffixes for abbreviations * tokenize on any of LIST_HYPHENS (except a single hyphen), not just on -- * simplify infix rules by merging similar rules |
||
---|---|---|
.. | ||
__init__.py | ||
test_text.py | ||
test_tokenizer.py |