spaCy/spacy/lang/ru
Paul O'Leary McCann fd759a881b
Fix inconsistent lemmas (#9405)
* Add util function to unique lists and preserve order

* Use unique function instead of list(set())

list(set()) has the issue that it's not consistent between runs of the
Python interpreter, so order can vary.

list(set()) calls were left in a few places where they were behind calls
to sorted(). I think in this case the calls to list() can be removed,
but this commit doesn't do that.

* Use the existing pattern for this
2021-10-11 11:38:45 +02:00
..
__init__.py Implement overwrite param for all custom lemmatizers (#6794) 2021-01-26 14:53:43 +11:00
examples.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
lemmatizer.py Fix inconsistent lemmas (#9405) 2021-10-11 11:38:45 +02:00
lex_attrs.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
stop_words.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
tokenizer_exceptions.py Remove POS, TAG and LEMMA from tokenizer exceptions 2020-07-22 23:09:01 +02:00