1
1
mirror of https://github.com/explosion/spaCy.git synced 2025-03-12 23:35:47 +03:00
spaCy/spacy/tests/lang/ru
Adriane Boyd 30d31fd335
Update Russian and Ukrainian lemmatizers ()
* pymorph2 issues , , :
- : pymorphy2_lookup
- : handle multiple forms pointing to the same normal form + handling empty POS tag
- : matching DET that are labelled as PRON by pymorhp2

* Move lemmatizer algorithm changes back into RussianLemmatizer

* Fix uk pymorphy3_lookup mode init

* Move and update tests for ru/uk lookup lemmatizer modes

* Fix typo

* Remove traces of previous behavior for uninflected POS

* Refactor to private generic-looking pymorphy methods

* Remove xfailed uk lemmatizer cases

* Update spacy/lang/ru/lemmatizer.py

Co-authored-by: Richard Hudson <richard@explosion.ai>

Co-authored-by: Dmytro S Lituiev <d.lituiev@gmail.com>
Co-authored-by: Richard Hudson <richard@explosion.ai>
2022-11-25 11:12:46 +01:00
..
__init__.py Revert 2019-09-29 17:32:12 +02:00
test_exceptions.py Drop Python 2.7 and 3.5 () 2019-12-22 01:53:56 +01:00
test_lemmatizer.py Update Russian and Ukrainian lemmatizers () 2022-11-25 11:12:46 +01:00
test_text.py Drop Python 2.7 and 3.5 () 2019-12-22 01:53:56 +01:00
test_tokenizer.py Handle Cyrillic combining diacritics () 2022-06-28 15:35:32 +02:00