spaCy/spacy
Jani Monoses 0e08e49e87 Lemmatizer ro (#2319)
* Add Romanian lemmatizer lookup table.

Adapted from http://www.lexiconista.com/datasets/lemmatization/
by replacing cedillas with commas (ș and ț).

The original dataset is licensed under the Open Database License.

* Fix one blatant issue in the Romanian lemmatizer

* Romanian examples file

* Add ro_tokenizer in conftest

* Add Romanian lemmatizer test
2018-05-12 15:20:04 +02:00
..
cli Fix formatting and consistency 2018-05-07 23:02:11 +02:00
data Make spacy/data a package 2017-03-18 20:04:22 +01:00
displacy Add collapse_phrases option to displacy (closes #2266) 2018-04-28 23:06:50 +02:00
lang Lemmatizer ro (#2319) 2018-05-12 15:20:04 +02:00
syntax Fix loading of models when custom vectors are added 2018-04-10 22:19:20 +02:00
tests Lemmatizer ro (#2319) 2018-05-12 15:20:04 +02:00
tokens Test and fix for Issue #2219 (#2272) 2018-05-03 18:40:46 +02:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py Revert "Check if spaCy has compiled correctly and show error message" 2018-04-06 15:49:44 +02:00
__main__.py
_ml.py 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
about.py Increment version to v2.0.12.dev0 2018-04-10 22:20:16 +02:00
attrs.pxd Fix LANG symbol 2018-02-17 18:10:50 +01:00
attrs.pyx code for is_currency 2018-02-11 18:51:32 +01:00
compat.py Fix urllib for Python 3 2018-03-29 00:19:33 +02:00
errors.py Improve error message when reading vectors 2018-04-10 21:26:50 +02:00
glossary.py Fix typo in glossary (resolves #1964) 2018-02-10 11:58:41 +01:00
gold.pxd
gold.pyx rename SP to _SP (#2289) 2018-05-03 18:33:49 +02:00
language.py Fix vector-name loading fix 2018-04-04 01:31:25 +02:00
lemmatizer.py If no rules are set, lemmatize by lookup 2017-12-06 12:12:11 +01:00
lexeme.pxd
lexeme.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
matcher.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
morphology.pxd fix typo/missing here too 2018-02-18 14:38:27 +00:00
morphology.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
parts_of_speech.pxd Add support for Universal Dependencies v2.0 2017-03-03 13:17:34 +01:00
parts_of_speech.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
pipeline.pxd
pipeline.pyx Fix loading of models when custom vectors are added 2018-04-10 22:19:20 +02:00
scorer.py 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
strings.pxd
strings.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
structs.pxd Make TokenC.sent_tart an int, to allow ternary value 2017-10-08 19:58:54 +02:00
symbols.pxd Fix LANG symbol 2018-02-17 18:10:50 +01:00
symbols.pyx Add missing symbol for LANG attr. Fixes inconsistent numeric ID 2018-02-17 17:37:02 +01:00
tokenizer.pxd
tokenizer.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
typedefs.pxd Work on changing StringStore to return hashes. 2017-05-28 12:36:27 +02:00
typedefs.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
util.py 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
vectors.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
vocab.pxd
vocab.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00