mirror of
https://github.com/explosion/spaCy.git
synced 2026-01-08 17:51:16 +03:00
Move exceptions loading behind a get_tokenizer_exceptions() function for French, instead of loading into the top-level namespace. This cuts import times from 0.6s to 0.2s, at the expense of making the French data a little different from the others (there's no top-level TOKENIZER_EXCEPTIONS variable.) The current solution feels somewhat unsatisfying. |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| _tokenizer_exceptions_list.py | ||
| language_data.py | ||
| punctuation.py | ||
| stop_words.py | ||
| tokenizer_exceptions.py | ||