Emil Stenström
3834f4146d
Add abbreviations from UD_Swedish-Talbanken ( #2613 )
...
* Add abbreviations from UD_Swedish-Talbanken
* Add contributor agreement.
2018-08-07 13:53:17 +02:00
Emil Stenström
1914c488d3
Swedish: Exceptions for single letter words ending sentence ( #2615 )
...
* Exceptions for single letter words ending sentence
Sentences ending in "i." (as in "... peka i."), "m." (as in "...än 2000 m."), should be tokenized as two separate tokens.
* Add test
2018-08-05 14:14:30 +02:00
ines
acb9bdb852
Fix PRON_LEMMA imports
2017-11-06 17:41:53 +01:00
ines
819e30a26e
Tidy up tokenizer exceptions
2017-11-01 23:02:45 +01:00
ines
7e424a1804
Don't copy exception dicts if not necessary and tidy up
2017-10-31 21:05:29 +01:00
ines
73b577cb01
Fix relative imports
2017-05-08 22:29:04 +02:00
ines
f46ffe3e89
Move language data to /lang module
2017-05-08 20:00:40 +02:00