mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-25 17:36:30 +03:00
3711af74e5
* Add tokenizer option to allow Matcher handling for all rules
Add tokenizer option `with_faster_rules_heuristics` that determines
whether the special cases applied by the internal `Matcher` are filtered
by whether they contain affixes or space. If `True` (default), the rules
are filtered to prioritize speed over rare edge cases. If `False`, all
rules are included in the final `Matcher`-based pass over the doc.
* Reset all caches when reloading special cases
* Revert "Reset all caches when reloading special cases"
This reverts commit
|
||
---|---|---|
.. | ||
__init__.py | ||
test_resource_warning.py | ||
test_serialize_config.py | ||
test_serialize_doc.py | ||
test_serialize_docbin.py | ||
test_serialize_extension_attrs.py | ||
test_serialize_kb.py | ||
test_serialize_language.py | ||
test_serialize_pipeline.py | ||
test_serialize_tokenizer.py | ||
test_serialize_vocab_strings.py |