mirror of
https://github.com/explosion/spaCy.git
synced 2025-11-12 13:55:48 +03:00
* don't split on a colon. Colon is used to attach suffixes for abbreviations * tokenize on any of LIST_HYPHENS (except a single hyphen), not just on -- * simplify infix rules by merging similar rules |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_text.py | ||
| test_tokenizer.py | ||