mirror of
https://github.com/explosion/spaCy.git
synced 2024-11-11 04:08:09 +03:00
150a39ccca
* user_dict fields: adding inflections, reading_forms, sub_tokens deleting: unidic_tags improve code readability around the token alignment procedure * add test cases, replace fugashi with sudachipy in conftest * move bunsetu.py to spaCy Universe as a pipeline component BunsetuRecognizer * tag is space -> both surface and tag are spaces * consider len(text)==0 |
||
---|---|---|
.. | ||
__init__.py | ||
test_lemmatization.py | ||
test_serialize.py | ||
test_tokenizer.py |