spaCy/spacy/tests/pipeline
Adriane Boyd ca33e891e2 Extend AttributeRuler functionality
* Add option to initialize with a dict of AttributeRuler patterns

* Instead of silently discarding overlapping matches (the default
behavior for the retokenizer if only the attrs differ), split the
matches into disjoint sets and retokenize each set separately. This
allows, for instance, one pattern to set the POS and another pattern to
set the lemma. (If two matches modify the same attribute, it looks like
the attrs are applied in the order they were added, but it may not be
deterministic?)

* Improve types
2020-07-30 11:17:33 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_analysis.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_attributeruler.py Extend AttributeRuler functionality 2020-07-30 11:17:33 +02:00
test_entity_linker.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_entity_ruler.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_functions.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_morphologizer.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_pipe_factories.py Remove scores list from config and document 2020-07-28 11:22:24 +02:00
test_pipe_methods.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_sentencizer.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_senter.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_simple_ner.py Tidy up and auto-format 2020-06-20 14:15:04 +02:00
test_tagger.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_textcat.py Merge branch 'develop' into feature/component-scores 2020-07-27 18:14:39 +02:00