spaCy/spacy/lang/sr
Adriane Boyd 6f314f99c4
Use Latin normalization for Serbian attrs (#12608)
* Use Latin normalization for Serbian attrs

Use Latin normalization for Serbian `NORM`, `PREFIX`, and `SUFFIX`.

* Update NORMs in tokenizer exceptions and related tests

* Add tests for all custom lex attrs

* Remove unused imports
2023-05-08 12:33:56 +02:00
..
__init__.py Update Serbian tokenization for UD Serbian SET (#12442) 2023-03-24 16:26:40 +01:00
examples.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
lemma_lookup_licence.txt Lemmatizer lookup dictionary for Serbian and basic tag set adde… (#4251) 2019-09-08 14:19:15 +02:00
lex_attrs.py Use Latin normalization for Serbian attrs (#12608) 2023-05-08 12:33:56 +02:00
punctuation.py Update Serbian tokenization for UD Serbian SET (#12442) 2023-03-24 16:26:40 +01:00
stop_words.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
tokenizer_exceptions.py Use Latin normalization for Serbian attrs (#12608) 2023-05-08 12:33:56 +02:00