mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 01:46:28 +03:00
f9e603903b
NUM_WORDS and ORDINAL_WORDS are currently not used, but the hard-coded list should be removed from orth.pyx and replaced to use language-specific functions. This will later allow other languages to use their own functions to set those flags. (In English, this is easier because it only needs to be checked against a set – in German for example, this requires a more complex function, as most number words are one word.) |
||
---|---|---|
.. | ||
lemmatizer | ||
__init__.py | ||
download.py | ||
language_data.py | ||
morph_rules.py | ||
tag_map.py | ||
tokenizer_exceptions.py | ||
word_sets.py |