spaCy/spacy/lang/es
Adriane Boyd d17afb4826
Add Spanish rule-based lemmatizer (#6833)
* Initial Spanish lemmatizer

* Handle merged verb+pron(s) multi-word tokens

* Use VERB for AUX rule lookup

* Add morph to lemma cache key

* Fix aux lookups, minor refactoring

* Improve verb+pron handling

* Move verb+pron handling into its own method
* Check for exceptions (primarily for se)
* Collect pronouns in the same (not reversed) order

* Only add modified possible lemmas
2021-01-27 19:21:35 +08:00
..
__init__.py Add Spanish rule-based lemmatizer (#6833) 2021-01-27 19:21:35 +08:00
examples.py Merge branch 'develop' into master-tmp 2020-06-20 15:52:00 +02:00
lemmatizer.py Add Spanish rule-based lemmatizer (#6833) 2021-01-27 19:21:35 +08:00
lex_attrs.py Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
punctuation.py Remove unicode declarations and tidy up 2020-06-21 22:34:10 +02:00
stop_words.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
syntax_iterators.py Tidy up and auto-format 2021-01-15 11:57:36 +11:00
tokenizer_exceptions.py Remove POS, TAG and LEMMA from tokenizer exceptions 2020-07-22 23:09:01 +02:00