spaCy/spacy/tests/pipeline
Paul O'Leary McCann d959603d51
Don't add duplicate patterns all the time in EntityRuler (fix #8216) (#8246)
* Don't add duplicate patterns (fix #8216)

* Refactor EntityRuler init

This simplifies the EntityRuler init code. This is helpful as prep for
allowing the EntityRuler to reset itself.

* Make EntityRuler.clear reset matchers

Includes a new test for this.

* Tidy PhraseMatcher instantiation

Since the attr can be None safely now, the guard if is no longer
required here.

Also renamed the `_validate` attr. Maybe it's not needed?

* Fix NER test

* Add test to make sure patterns aren't increasing

* Move test to regression tests
2021-06-03 09:05:26 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_analysis.py Simplify pipe analysis 2020-08-01 13:40:06 +02:00
test_annotates_on_update.py Add training option to set annotations on update (#7767) 2021-04-26 16:53:53 +02:00
test_attributeruler.py Tidy up and auto-format 2021-01-05 13:41:53 +11:00
test_entity_linker.py consistently use registry as callable 2021-03-02 17:56:28 +01:00
test_entity_ruler.py Don't add duplicate patterns all the time in EntityRuler (fix #8216) (#8246) 2021-06-03 09:05:26 +02:00
test_functions.py Add token_splitter component (#6726) 2021-01-17 19:54:41 +08:00
test_initialize.py Test with default value 2020-09-29 17:00:40 +02:00
test_lemmatizer.py Use morph hash in lemmatizer cache key (#7690) 2021-04-08 13:22:38 +02:00
test_models.py Set up GPU CI testing (#7293) 2021-04-22 14:58:29 +02:00
test_morphologizer.py Handle unset token.morph in Morphologizer (#6704) 2021-01-15 17:20:10 +01:00
test_pipe_factories.py Fix scoring normalization (#7629) 2021-04-26 16:53:38 +02:00
test_pipe_methods.py Add training option to set annotations on update (#7767) 2021-04-26 16:53:53 +02:00
test_sentencizer.py Refactor Docs.is_ flags (#6044) 2020-09-17 00:14:01 +02:00
test_senter.py adding tests for trained models to ensure predict reproducibility 2020-10-13 21:07:13 +02:00
test_tagger.py Sync missing and misaligned values in Tagger loss (#6689) 2021-01-10 11:30:37 +11:00
test_textcat.py Set up GPU CI testing (#7293) 2021-04-22 14:58:29 +02:00
test_tok2vec.py Set up GPU CI testing (#7293) 2021-04-22 14:58:29 +02:00