spaCy/spacy/en
ines f9e603903b Rename stop_words.py to word_sets.py and include more sets
NUM_WORDS and ORDINAL_WORDS are currently not used, but the hard-coded
list should be removed from orth.pyx and replaced to use
language-specific functions. This will later allow other languages to
use their own functions to set those flags. (In English, this is easier
because it only needs to be checked against a set – in German for
example, this requires a more complex function, as most number words
are one word.)
2017-03-12 13:58:22 +01:00
..
lemmatizer Add Python-formatted lemmatizer data and rules 2017-03-12 13:58:22 +01:00
__init__.py Add Python-formatted lemmatizer data and rules 2017-03-12 13:58:22 +01:00
download.py Let --data-path be specified when running download.py scripts 2016-11-20 15:48:04 +00:00
language_data.py Rename stop_words.py to word_sets.py and include more sets 2017-03-12 13:58:22 +01:00
morph_rules.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
tag_map.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
tokenizer_exceptions.py Move English time exceptions ("1a.m." etc.) and refactor 2017-03-12 13:58:22 +01:00
word_sets.py Rename stop_words.py to word_sets.py and include more sets 2017-03-12 13:58:22 +01:00