mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 18:06:29 +03:00
ca33e891e2
* Add option to initialize with a dict of AttributeRuler patterns * Instead of silently discarding overlapping matches (the default behavior for the retokenizer if only the attrs differ), split the matches into disjoint sets and retokenize each set separately. This allows, for instance, one pattern to set the POS and another pattern to set the lemma. (If two matches modify the same attribute, it looks like the attrs are applied in the order they were added, but it may not be deterministic?) * Improve types |
||
---|---|---|
.. | ||
__init__.py | ||
attributeruler.py | ||
dep_parser.pyx | ||
entity_linker.py | ||
entityruler.py | ||
functions.py | ||
morphologizer.pyx | ||
multitask.pyx | ||
ner.pyx | ||
pipe.pyx | ||
sentencizer.pyx | ||
senter.pyx | ||
simple_ner.py | ||
tagger.pyx | ||
textcat.py | ||
tok2vec.py |