mirror of
https://github.com/explosion/spaCy.git
synced 2025-12-10 11:44:35 +03:00
* user_dict fields: adding inflections, reading_forms, sub_tokens deleting: unidic_tags improve code readability around the token alignment procedure * add test cases, replace fugashi with sudachipy in conftest * move bunsetu.py to spaCy Universe as a pipeline component BunsetuRecognizer * tag is space -> both surface and tag are spaces * consider len(text)==0 |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| examples.py | ||
| stop_words.py | ||
| syntax_iterators.py | ||
| tag_bigram_map.py | ||
| tag_map.py | ||
| tag_orth_map.py | ||