mirror of
https://github.com/explosion/spaCy.git
synced 2025-10-24 12:41:23 +03:00
Add official support for the `DependencyMatcher`. Redesign the pattern
specification. Fix and extend operator implementations. Update API docs
and add usage docs.
Patterns
--------
Refactor pattern structure to:
```
{
"LEFT_ID": str,
"REL_OP": str,
"RIGHT_ID": str,
"RIGHT_ATTRS": dict,
}
```
The first node contains only `RIGHT_ID` and `RIGHT_ATTRS` and all
subsequent nodes contain all four keys.
New operators
-------------
Because of the way patterns are constructed from left to right, it's
helpful to have `follows` operators along with `precedes` operators. Add
operators for simple precedes / follows alongside immediate precedes /
follows.
* `.*`: precedes
* `;`: immediately follows
* `;*`: follows
Operator fixes
--------------
* `<` and `<<` do not include the node itself
* Fix reversed order for all operators involving linear precedence (`.`,
all sibling operators)
* Linear precedence operators do not match nodes outside the same parse
Additional fixes
----------------
* Use v3 Matcher API
* Support `get` and `remove`
* Support pickling
|
||
|---|---|---|
| .. | ||
| architecture.svg | ||
| cli_init_fill-config_diff.jpg | ||
| course.jpg | ||
| dep-match-diagram.svg | ||
| displacy_jupyter.jpg | ||
| displacy-compact.svg | ||
| displacy-custom-parser.svg | ||
| displacy-dep-founded.html | ||
| displacy-ent-custom.html | ||
| displacy-ent-snek.html | ||
| displacy-ent1.html | ||
| displacy-ent2.html | ||
| displacy-long.html | ||
| displacy-long2.html | ||
| displacy-model-rules.svg | ||
| displacy-model-rules2.svg | ||
| displacy-small.svg | ||
| displacy.svg | ||
| layers-architectures.svg | ||
| matcher-demo.jpg | ||
| pipeline_transformer.svg | ||
| pipeline.svg | ||
| prodigy.jpg | ||
| project_document.jpg | ||
| projects.svg | ||
| sense2vec.jpg | ||
| spacy-streamlit.png | ||
| thinc_mypy.jpg | ||
| tok2vec-listener.svg | ||
| tok2vec.svg | ||
| tokenization.svg | ||
| training.svg | ||
| vocab_stringstore.svg | ||