spaCy/spacy/tests/lang
Emil Stenström 1914c488d3 Swedish: Exceptions for single letter words ending sentence (#2615)
* Exceptions for single letter words ending sentence

Sentences ending in "i." (as in "... peka i."), "m." (as in "...än 2000 m."), should be tokenized as two separate tokens.

* Add test
2018-08-05 14:14:30 +02:00
..
ar Add Arabic language (#2314) 2018-05-15 00:27:19 +02:00
bn Move language-specific tests to tests/lang 2017-05-09 00:02:37 +02:00
da Add Danish lemmatizer (#2184) 2018-04-07 19:07:28 +02:00
de German lemmatizer additions (#2529) 2018-07-09 11:10:15 +02:00
el Add support for Greek language (#2535) 2018-07-10 13:48:38 +02:00
en Drop six and related hacks as a dependency 2018-03-28 10:45:25 +02:00
es Move language-specific tests to tests/lang 2017-05-09 00:02:37 +02:00
fi Move language-specific tests to tests/lang 2017-05-09 00:02:37 +02:00
fr Try to fix memory error by moving fr_tokenizer to module scope 2018-07-24 20:09:06 +02:00
ga merge 2017-10-31 22:55:59 +00:00
he Move language-specific tests to tests/lang 2017-05-09 00:02:37 +02:00
hu Update tests 2017-06-05 02:09:27 +02:00
id added {pre,suf,in}fix tests 2017-08-20 13:43:00 +07:00
ja Add Japanese lemmas (#2543) 2018-07-13 10:55:14 +02:00
nb Move language-specific tests to tests/lang 2017-05-09 00:02:37 +02:00
ro Updates to Romanian support (#2354) 2018-05-24 11:40:00 +02:00
ru Added tag map, fixed tests fails, added more exceptions 2017-11-26 20:54:48 +03:00
sv Swedish: Exceptions for single letter words ending sentence (#2615) 2018-08-05 14:14:30 +02:00
th add thai in spacy2 2017-09-26 21:36:27 +07:00
tr Adds Turkish Lemmatization 2017-12-01 17:04:32 +03:00
tt Add Tatar Language Support (#2444) 2018-06-19 10:17:53 +02:00
ur Add Urdu Language Support (#2430) 2018-06-22 11:14:03 +02:00
__init__.py Remove imports in /lang/__init__.py 2017-05-08 23:58:07 +02:00
test_attrs.py added lex test for is_currency 2018-02-11 18:50:50 +01:00