mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 18:06:29 +03:00
85778dfcf4
* Add edit tree lemmatizer Co-authored-by: Daniël de Kok <me@danieldk.eu> * Hide edit tree lemmatizer labels * Use relative imports * Switch to single quotes in error message * Type annotation fixes Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com> * Reformat edit_tree_lemmatizer with black * EditTreeLemmatizer.predict: take Iterable Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com> * Validate edit trees during deserialization This change also changes the serialized representation. Rather than mirroring the deep C structure, we use a simple flat union of the match and substitution node types. * Move edit_trees to _edit_tree_internals * Fix invalid edit tree format error message * edit_tree_lemmatizer: remove outdated TODO comment * Rename factory name to trainable_lemmatizer * Ignore type instead of casting truths to List[Union[Ints1d, Floats2d, List[int], List[str]]] for thinc v8.0.14 * Switch to Tagger.v2 * Add documentation for EditTreeLemmatizer * docs: Fix 3.2 -> 3.3 somewhere * trainable_lemmatizer documentation fixes * docs: EditTreeLemmatizer is in edit_tree_lemmatizer.py Co-authored-by: Daniël de Kok <me@danieldk.eu> Co-authored-by: Daniël de Kok <me@github.danieldk.eu> Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com> |
||
---|---|---|
.. | ||
architectures.md | ||
attributeruler.md | ||
cli.md | ||
corpus.md | ||
cython-classes.md | ||
cython-structs.md | ||
cython.md | ||
data-formats.md | ||
dependencymatcher.md | ||
dependencyparser.md | ||
doc.md | ||
docbin.md | ||
edittreelemmatizer.md | ||
entitylinker.md | ||
entityrecognizer.md | ||
entityruler.md | ||
example.md | ||
index.md | ||
kb.md | ||
language.md | ||
legacy.md | ||
lemmatizer.md | ||
lexeme.md | ||
lookups.md | ||
matcher.md | ||
morphologizer.md | ||
morphology.md | ||
phrasematcher.md | ||
pipe.md | ||
pipeline-functions.md | ||
scorer.md | ||
sentencerecognizer.md | ||
sentencizer.md | ||
span.md | ||
spancategorizer.md | ||
spangroup.md | ||
stringstore.md | ||
tagger.md | ||
textcategorizer.md | ||
tok2vec.md | ||
token.md | ||
tokenizer.md | ||
top-level.md | ||
transformer.md | ||
vectors.md | ||
vocab.md |