spaCy/spacy/tests/pipeline
Paul O'Leary McCann ad026dc5fd Don't add duplicate patterns all the time in EntityRuler (fix #8216) (#8246)
* Don't add duplicate patterns (fix #8216)

* Refactor EntityRuler init

This simplifies the EntityRuler init code. This is helpful as prep for
allowing the EntityRuler to reset itself.

* Make EntityRuler.clear reset matchers

Includes a new test for this.

* Tidy PhraseMatcher instantiation

Since the attr can be None safely now, the guard if is no longer
required here.

Also renamed the `_validate` attr. Maybe it's not needed?

* Fix NER test

* Add test to make sure patterns aren't increasing

* Move test to regression tests
2021-07-16 15:47:55 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_analysis.py Simplify pipe analysis 2020-08-01 13:40:06 +02:00
test_attributeruler.py Tidy up and auto-format 2021-01-05 13:41:53 +11:00
test_entity_linker.py KB & NEL to/from bytes (#8113) 2021-05-20 18:11:30 +10:00
test_entity_ruler.py Don't add duplicate patterns all the time in EntityRuler (fix #8216) (#8246) 2021-07-16 15:47:55 +02:00
test_functions.py Add token_splitter component (#6726) 2021-01-17 19:54:41 +08:00
test_initialize.py Test with default value 2020-09-29 17:00:40 +02:00
test_lemmatizer.py Use morph hash in lemmatizer cache key (#7690) 2021-04-08 13:22:38 +02:00
test_models.py Set up GPU CI testing (#7293) 2021-04-22 14:58:29 +02:00
test_morphologizer.py Handle unset token.morph in Morphologizer (#6704) 2021-01-15 17:20:10 +01:00
test_pipe_factories.py Fix scoring normalization (#7629) 2021-07-16 15:47:55 +02:00
test_pipe_methods.py fix NEL config and IO, and n_sents functionality (#7100) 2021-02-22 14:49:52 +11:00
test_sentencizer.py Refactor Docs.is_ flags (#6044) 2020-09-17 00:14:01 +02:00
test_senter.py adding tests for trained models to ensure predict reproducibility 2020-10-13 21:07:13 +02:00
test_tagger.py Sync missing and misaligned values in Tagger loss (#6689) 2021-01-10 11:30:37 +11:00
test_textcat.py Set up GPU CI testing (#7293) 2021-04-22 14:58:29 +02:00
test_tok2vec.py Ensemble textcat with listener (#8012) 2021-05-31 18:21:06 +10:00