spaCy/spacy/lang/ru
Richard Hudson a9559e7435
Handle Cyrillic combining diacritics (#10837)
* Handle Russian, Ukrainian and Bulgarian

* Corrections

* Correction

* Correction to comment

* Changes based on review

* Correction

* Reverted irrelevant change in punctuation.py

* Remove unnecessary group

* Reverted accidental change
2022-06-28 15:35:32 +02:00
..
__init__.py Handle Cyrillic combining diacritics (#10837) 2022-06-28 15:35:32 +02:00
examples.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
lemmatizer.py Format (#9630) 2021-11-05 09:56:26 +01:00
lex_attrs.py Extend list of _num_words (#10468) 2022-03-16 18:25:42 +01:00
stop_words.py Extend list of stopwords for ru language (#10313) 2022-02-17 15:51:15 +01:00
tokenizer_exceptions.py Extend list of abbreviations for ru language (#10282) 2022-02-17 15:48:50 +01:00