mirror of
https://github.com/explosion/spaCy.git
synced 2024-11-11 04:08:09 +03:00
26446aa728
Move exceptions loading behind a get_tokenizer_exceptions() function for French, instead of loading into the top-level namespace. This cuts import times from 0.6s to 0.2s, at the expense of making the French data a little different from the others (there's no top-level TOKENIZER_EXCEPTIONS variable.) The current solution feels somewhat unsatisfying. |
||
---|---|---|
.. | ||
__init__.py | ||
_tokenizer_exceptions_list.py | ||
language_data.py | ||
punctuation.py | ||
stop_words.py | ||
tokenizer_exceptions.py |