1
1
mirror of https://github.com/explosion/spaCy.git synced 2025-01-18 21:44:12 +03:00
Commit Graph

1 Commits

Author SHA1 Message Date
Jani Monoses
0e08e49e87 Lemmatizer ro ()
* Add Romanian lemmatizer lookup table.

Adapted from http://www.lexiconista.com/datasets/lemmatization/
by replacing cedillas with commas (ș and ț).

The original dataset is licensed under the Open Database License.

* Fix one blatant issue in the Romanian lemmatizer

* Romanian examples file

* Add ro_tokenizer in conftest

* Add Romanian lemmatizer test
2018-05-12 15:20:04 +02:00