spaCy/spacy/tests/serialize
Adriane Boyd 3711af74e5
Add tokenizer option to allow Matcher handling for all rules (#10452)
* Add tokenizer option to allow Matcher handling for all rules

Add tokenizer option `with_faster_rules_heuristics` that determines
whether the special cases applied by the internal `Matcher` are filtered
by whether they contain affixes or space. If `True` (default), the rules
are filtered to prioritize speed over rare edge cases. If `False`, all
rules are included in the final `Matcher`-based pass over the doc.

* Reset all caches when reloading special cases

* Revert "Reset all caches when reloading special cases"

This reverts commit 4ef6bd171d.

* Initialize max_length properly

* Add new tag to API docs

* Rename to faster heuristics
2022-03-24 13:21:32 +01:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_resource_warning.py Tidy up tests 2020-10-15 10:20:21 +02:00
test_serialize_config.py Tagger: use unnormalized probabilities for inference (#10197) 2022-03-15 14:15:31 +01:00
test_serialize_doc.py Migrate regression tests into the main test suite (#9655) 2021-12-04 20:34:48 +01:00
test_serialize_docbin.py Migrate regression tests into the main test suite (#9655) 2021-12-04 20:34:48 +01:00
test_serialize_extension_attrs.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
test_serialize_kb.py Update flake8 version in reqs and CI 2021-06-28 11:29:36 +02:00
test_serialize_language.py Tagger: use unnormalized probabilities for inference (#10197) 2022-03-15 14:15:31 +01:00
test_serialize_pipeline.py Migrate regression tests into the main test suite (#9655) 2021-12-04 20:34:48 +01:00
test_serialize_tokenizer.py Add tokenizer option to allow Matcher handling for all rules (#10452) 2022-03-24 13:21:32 +01:00
test_serialize_vocab_strings.py Migrate regression tests into the main test suite (#9655) 2021-12-04 20:34:48 +01:00