spaCy/spacy/pipeline
Adriane Boyd ca33e891e2 Extend AttributeRuler functionality
* Add option to initialize with a dict of AttributeRuler patterns

* Instead of silently discarding overlapping matches (the default
behavior for the retokenizer if only the attrs differ), split the
matches into disjoint sets and retokenize each set separately. This
allows, for instance, one pattern to set the POS and another pattern to
set the lemma. (If two matches modify the same attribute, it looks like
the attrs are applied in the order they were added, but it may not be
deterministic?)

* Improve types
2020-07-30 11:17:33 +02:00
..
__init__.py Add AttributeRuler for token attribute exceptions 2020-07-30 09:10:59 +02:00
attributeruler.py Extend AttributeRuler functionality 2020-07-30 11:17:33 +02:00
dep_parser.pyx Merge branch 'develop' into feature/component-scores 2020-07-27 18:14:39 +02:00
entity_linker.py Update docstrings, docs and pipe consistency 2020-07-28 13:37:31 +02:00
entityruler.py Merge branch 'develop' into feature/component-scores 2020-07-27 18:14:39 +02:00
functions.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
morphologizer.pyx Update docstrings, docs and pipe consistency 2020-07-28 13:37:31 +02:00
multitask.pyx Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
ner.pyx Merge branch 'develop' into feature/component-scores 2020-07-27 18:14:39 +02:00
pipe.pyx Remove unused methods 2020-07-28 16:50:02 +02:00
sentencizer.pyx Merge branch 'develop' into feature/component-scores 2020-07-27 18:14:39 +02:00
senter.pyx Merge branch 'develop' into feature/component-scores 2020-07-27 18:14:39 +02:00
simple_ner.py Add missing Scorer imports 2020-07-27 15:08:51 +02:00
tagger.pyx Update docstrings, docs and pipe consistency 2020-07-28 13:37:31 +02:00
textcat.py Update docstrings, docs and pipe consistency 2020-07-28 13:37:31 +02:00
tok2vec.py Update docstrings, docs and pipe consistency 2020-07-28 13:37:31 +02:00