mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-24 00:46:28 +03:00
b892b446cc
* Added the same punctuation rules as danish language. * Added abbreviations and also the possibility to have capitalized abbreviations on some. Added a few specific cases too * Added test for long texts in swedish * Added morph rules, infixes and suffixes to __init__.py for swedish * Added some tests for prefixes, infixes and suffixes * Added tests for lemma * Renamed files to follow convention * [sv] Removed ambigious abbreviations * Added more tests for tokenizer exceptions * Added test for problem with punctuation in issue #2578 * Contributor agreement * Removed faulty lemmatization of 'jag' ('I') as it was lemmatized to 'jaga' ('hunt') |
||
---|---|---|
.. | ||
ar | ||
bn | ||
ca | ||
da | ||
de | ||
el | ||
en | ||
es | ||
fi | ||
fr | ||
ga | ||
he | ||
hu | ||
id | ||
ja | ||
nb | ||
ro | ||
ru | ||
sv | ||
th | ||
tr | ||
tt | ||
ur | ||
__init__.py | ||
test_attrs.py |