Pavle Vidanović
|
d03401f532
|
Lemmatizer lookup dictionary for Serbian and basic tag set adde… (#4251)
* Serbian stopwords added. (cyrillic alphabet)
* spaCy Contribution agreement included.
* Test initialize updated
* Serbian language code update. --bugfix
* Tokenizer exceptions added. Init file updated.
* Norm exceptions and lexical attributes added.
* Examples added.
* Tests added.
* sr_lang examples update.
* Tokenizer exceptions updated. (Serbian)
* Lemmatizer created. Licence included.
* Test updated.
* Tag map basic added.
* tag_map.py file removed since it uses default spacy tags.
|
2019-09-08 14:19:15 +02:00 |
|