spaCy/spacy/lang
Alex Combessie 9cc880014c
Remove questionable French stopwords (#6310)
* Remove questionable French stopwords

* Create alexcombessie.md
2021-01-08 11:36:22 +11:00
..
af
am Add Amharic አማርኛ Language support (#6583) 2020-12-22 16:50:34 +01:00
ar
bg
bn Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
ca
cs Add tag map to cs language (#6284) 2020-11-05 10:13:11 +01:00
da Add (noun chunks) syntax iterators for Danish (#6246) 2021-01-07 16:33:00 +11:00
de Fix overlapping German noun chunks (#6112) 2020-09-22 21:52:42 +02:00
el
en remove cause without apostrophe from norm exceptions (#6636) 2021-01-06 12:30:30 +08:00
es Fix span boundary handling in Spanish noun_chunks (#5860) 2020-08-03 13:53:15 +02:00
et
eu Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
fa
fi
fr Remove questionable French stopwords (#6310) 2021-01-08 11:36:22 +11:00
ga
gu
he Hebrew like num (#5952) 2020-08-24 14:30:05 +02:00
hi Hindi: Adds tests for lexical attributes (norm and like_num) (#5829) 2020-10-07 10:23:32 +02:00
hr Added Multext-East V5 tagset for Croatian language (#6248) 2020-11-05 12:19:22 +01:00
hu
hy Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
id Update Indonesian Example Phrases (#6124) 2020-09-23 14:02:26 +02:00
is
it
ja fix ja leading spaces (#5969) 2020-08-25 14:16:24 +02:00
kn
ko fix bug in Korean language, resulting in 100x speedup by reducing overhead of mecab (#5701) 2020-07-06 17:03:33 +02:00
lb
lij
lt
lv
mk Include Macedonian language (#6230) 2020-10-15 15:55:01 +02:00
ml
mr
nb
ne Add Nepali Language (#5622) 2020-06-22 10:25:46 +02:00
nl
pl Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
pt Update stop_words.py in Portuguese (a,o,e) (#6345) 2021-01-08 11:35:38 +11:00
ro add new Romanian stopwords (#6621) 2021-01-08 11:34:47 +11:00
ru Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
sa Added support for Sanskrit language (#5956) 2020-08-25 10:56:29 +02:00
si
sk
sl
sq
sr
sv Update morph_rules.py (#6102) 2020-10-06 15:14:47 +02:00
ta Update spacy/lang/ta/examples.py 2020-10-13 11:03:35 +02:00
te
th Add Thai tag map (LST20 Corpus) (#6163) 2020-10-07 11:12:01 +02:00
ti Add Amharic አማርኛ Language support (#6583) 2020-12-22 16:50:34 +01:00
tl
tr Turkish tokenization improvements (#6268) 2020-10-29 09:43:17 +01:00
tt
uk
ur
vi
xx
yo
zh Update pkuseg version (#5774) 2020-07-19 11:09:49 +02:00
__init__.py
char_classes.py Add Amharic አማርኛ Language support (#6583) 2020-12-22 16:50:34 +01:00
lex_attrs.py Hebrew like num (#5952) 2020-08-24 14:30:05 +02:00
norm_exceptions.py
punctuation.py
tag_map.py
tokenizer_exceptions.py Fix raw strings in URL pattern (#5972) 2020-08-26 04:00:49 +02:00