mirror of
https://github.com/explosion/spaCy.git
synced 2026-01-07 09:11:12 +03:00
* user_dict fields: adding inflections, reading_forms, sub_tokens deleting: unidic_tags improve code readability around the token alignment procedure * add test cases, replace fugashi with sudachipy in conftest * move bunsetu.py to spaCy Universe as a pipeline component BunsetuRecognizer * tag is space -> both surface and tag are spaces * consider len(text)==0 |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_lemmatization.py | ||
| test_serialize.py | ||
| test_tokenizer.py | ||