spaCy/spacy/lang
adrianeboyd 2164e71ea8
Improved Romanian tokenization for UD RRT (#5036)
Modifications to Romanian tokenization to improve tokenization for
UD_Romanian-RRT.
2020-02-19 16:15:59 +01:00
..
af
ar
bg
bn
ca
cs
da
de Fix/Improve german stop words (#5024) 2020-02-17 18:59:22 +01:00
el
en
es
et
fa
fi
fr
ga
he
hi
hr
hu
id
is
it
ja
kn
ko
lb
lt Move lookup tables out of the core library (#4346) 2019-10-01 00:01:27 +02:00
lv
mr
nb
nl
pl
pt
ro Improved Romanian tokenization for UD RRT (#5036) 2020-02-19 16:15:59 +01:00
ru
si
sk
sl
sq
sr
sv
ta
te
th
tl
tr
tt
uk
ur
vi
xx
yo
zh
__init__.py
char_classes.py
lex_attrs.py
norm_exceptions.py
punctuation.py
tag_map.py
tokenizer_exceptions.py