spaCy/spacy/lang
Adriane Boyd 30030176ee
Update Korean defaults for Tokenizer (#10322)
Update Korean defaults for `Tokenizer` for tokenization following UD
Korean Kaist.
2022-02-21 10:26:19 +01:00
..
af
am
ar
az
bg
bn
ca
cs
da
de
el
en
es
et
eu
fa
fi
fr Auto-format code with black (#10333) 2022-02-21 09:15:42 +01:00
ga
grc
gu
he
hi
hr
hu Ignore prefix in suffix matches (#9155) 2021-10-27 13:02:25 +02:00
hy
id
is
it
ja
kn
ko Update Korean defaults for Tokenizer (#10322) 2022-02-21 10:26:19 +01:00
ky
lb
lij
lt
lv
mk
ml
mr
nb
ne
nl
pl
pt
ro
ru
sa
si
sk
sl
sq
sr
sv
ta
te
th
ti
tl
tn
tr
tt
uk
ur
vi
xx
yo
zh
__init__.py
char_classes.py
lex_attrs.py
norm_exceptions.py
punctuation.py
tokenizer_exceptions.py