spaCy

mirror of https://github.com/explosion/spaCy.git synced 2024-12-26 18:06:29 +03:00

History

Adriane Boyd d17afb4826 Add Spanish rule-based lemmatizer (#6833 ) * Initial Spanish lemmatizer * Handle merged verb+pron(s) multi-word tokens * Use VERB for AUX rule lookup * Add morph to lemma cache key * Fix aux lookups, minor refactoring * Improve verb+pron handling * Move verb+pron handling into its own method * Check for exceptions (primarily for se) * Collect pronouns in the same (not reversed) order * Only add modified possible lemmas		2021-01-27 19:21:35 +08:00
..
__init__.py	Add Spanish rule-based lemmatizer (#6833 )	2021-01-27 19:21:35 +08:00
examples.py	Merge branch 'develop' into master-tmp	2020-06-20 15:52:00 +02:00
lemmatizer.py	Add Spanish rule-based lemmatizer (#6833 )	2021-01-27 19:21:35 +08:00
lex_attrs.py	Merge branch 'develop' into master-tmp	2020-05-21 18:39:06 +02:00
punctuation.py	Remove unicode declarations and tidy up	2020-06-21 22:34:10 +02:00
stop_words.py	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
syntax_iterators.py	Tidy up and auto-format	2021-01-15 11:57:36 +11:00
tokenizer_exceptions.py	Remove POS, TAG and LEMMA from tokenizer exceptions	2020-07-22 23:09:01 +02:00