spaCy/spacy/tests/lang
Hiroshi Matsuda 150a39ccca
Japanese model: add user_dict entries and small refactor (#5573)
* user_dict fields: adding inflections, reading_forms, sub_tokens
deleting: unidic_tags
improve code readability around the token alignment procedure

* add test cases, replace fugashi with sudachipy in conftest

* move bunsetu.py to spaCy Universe as a pipeline component BunsetuRecognizer

* tag is space -> both surface and tag are spaces

* consider len(text)==0
2020-06-22 14:32:25 +02:00
..
ar
bn
ca
da
de Tidy up and auto-format 2020-05-21 14:14:01 +02:00
el Tidy up and auto-format 2020-05-21 14:14:01 +02:00
en Add missing pronoums/determiners (#5569) 2020-06-10 18:47:04 +02:00
es Tidy up and auto-format 2020-05-21 14:14:01 +02:00
eu
fa
fi
fr Tidy up and auto-format 2020-05-21 14:14:01 +02:00
ga
gu Tidy up and auto-format 2020-05-21 14:14:01 +02:00
he
hu
hy Add missing declaration 2020-05-21 17:30:05 +02:00
id Tidy up and auto-format 2020-05-21 14:14:01 +02:00
it
ja Japanese model: add user_dict entries and small refactor (#5573) 2020-06-22 14:32:25 +02:00
ko
lb
lt
ml Tidy up and auto-format 2020-05-21 14:14:01 +02:00
nb Tidy up and auto-format 2020-05-21 14:14:01 +02:00
ne Add Nepali Language (#5622) 2020-06-22 10:25:46 +02:00
nl
pl
pt
ro
ru
sr
sv Tidy up and auto-format 2020-05-21 14:14:01 +02:00
th Revert #4334 2019-09-29 17:32:12 +02:00
tr
tt
uk
ur
yo
zh Tidy up and auto-format 2020-05-21 14:14:01 +02:00
__init__.py
test_attrs.py
test_initialize.py