| .. |
|
af
|
|
|
|
ar
|
|
|
|
bg
|
|
|
|
bn
|
|
|
|
ca
|
|
|
|
cs
|
|
|
|
da
|
Tidy up and auto-format
|
2020-05-21 14:14:01 +02:00 |
|
de
|
Rename argument: doc_or_span/obj -> doclike (#5463)
|
2020-05-21 15:17:39 +02:00 |
|
el
|
span / noun chunk has +1 because end is exclusive
|
2020-05-21 19:56:56 +02:00 |
|
en
|
Move lemmatizer is_base_form to language settings (#5663)
|
2020-06-29 14:16:57 +02:00 |
|
es
|
Spanish tokenizer exception and examples improvement (#5531)
|
2020-06-01 18:18:34 +02:00 |
|
et
|
|
|
|
eu
|
|
|
|
fa
|
span / noun chunk has +1 because end is exclusive
|
2020-05-21 19:56:56 +02:00 |
|
fi
|
|
|
|
fr
|
corrected issue #5524 changed <U+009C> 'STRING TERMINATOR' for <U+0153> LATIN SMALL LIGATURE OE' (#5526)
|
2020-05-31 22:08:12 +02:00 |
|
ga
|
|
|
|
gu
|
Tidy up and auto-format
|
2020-05-21 14:14:01 +02:00 |
|
he
|
|
|
|
hi
|
|
|
|
hr
|
|
|
|
hu
|
|
|
|
hy
|
Some changes for Armenian (#5616)
|
2020-06-22 08:50:34 +02:00 |
|
id
|
span / noun chunk has +1 because end is exclusive
|
2020-05-21 19:56:56 +02:00 |
|
is
|
|
|
|
it
|
Tidy up and auto-format
|
2020-03-25 12:28:12 +01:00 |
|
ja
|
Revert "Convert custom user_data to token extension format for Japanese tokenizer (#5652)" (#5665)
|
2020-06-29 14:34:15 +02:00 |
|
kn
|
|
|
|
ko
|
|
|
|
lb
|
Reduce stored lexemes data, move feats to lookups (#5238)
|
2020-05-19 15:59:14 +02:00 |
|
lij
|
|
|
|
lt
|
Tidy up and auto-format
|
2020-03-25 12:28:12 +01:00 |
|
lv
|
|
|
|
ml
|
Tidy up and auto-format
|
2020-05-21 14:14:01 +02:00 |
|
mr
|
|
|
|
nb
|
span / noun chunk has +1 because end is exclusive
|
2020-05-21 19:56:56 +02:00 |
|
ne
|
Add Nepali Language (#5622)
|
2020-06-22 10:25:46 +02:00 |
|
nl
|
|
|
|
pl
|
Fix Polish lemmatizer for deserialized models
|
2020-05-26 09:56:12 +02:00 |
|
pt
|
Reduce stored lexemes data, move feats to lookups (#5238)
|
2020-05-19 15:59:14 +02:00 |
|
ro
|
|
|
|
ru
|
Reduce stored lexemes data, move feats to lookups (#5238)
|
2020-05-19 15:59:14 +02:00 |
|
si
|
|
|
|
sk
|
|
|
|
sl
|
|
|
|
sq
|
Update languages and examples (see #1107)
|
2019-06-26 16:19:17 +02:00 |
|
sr
|
Reduce stored lexemes data, move feats to lookups (#5238)
|
2020-05-19 15:59:14 +02:00 |
|
sv
|
span / noun chunk has +1 because end is exclusive
|
2020-05-21 19:56:56 +02:00 |
|
ta
|
Added Tamil Example Sentences (#5583)
|
2020-06-13 15:56:26 +02:00 |
|
te
|
|
|
|
th
|
Reduce stored lexemes data, move feats to lookups (#5238)
|
2020-05-19 15:59:14 +02:00 |
|
tl
|
Move lookup tables out of the core library (#4346)
|
2019-10-01 00:01:27 +02:00 |
|
tr
|
|
|
|
tt
|
|
|
|
uk
|
|
|
|
ur
|
Tidy up and auto-format
|
2020-05-21 14:14:01 +02:00 |
|
vi
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
xx
|
|
|
|
yo
|
|
|
|
zh
|
Map NR to PROPN (#5512)
|
2020-05-26 22:30:53 +02:00 |
|
__init__.py
|
|
|
|
char_classes.py
|
|
|
|
lex_attrs.py
|
Reduce stored lexemes data, move feats to lookups (#5238)
|
2020-05-19 15:59:14 +02:00 |
|
norm_exceptions.py
|
|
|
|
punctuation.py
|
|
|
|
tag_map.py
|
|
|
|
tokenizer_exceptions.py
|
Rename to url_match
|
2020-05-22 12:41:03 +02:00 |