spaCy/website/docs/api
Matthew Honnibal f277bfdf0f
Add SpanGroup and Graph container types to represent arbitrary annotations (#6696)
* Draft out initial Spans data structure

* Initial span group commit

* Basic span group support on Doc

* Basic test for span group

* Compile span_group.pyx

* Draft addition of SpanGroup to DocBin

* Add deserialization for SpanGroup

* Add tests for serializing SpanGroup

* Fix serialization of SpanGroup

* Add EdgeC and GraphC structs

* Add draft Graph data structure

* Compile graph

* More work on Graph

* Update GraphC

* Upd graph

* Fix walk functions

* Let Graph take nodes and edges on construction

* Fix walking and getting

* Add graph tests

* Fix import

* Add module with the SpanGroups dict thingy

* Update test

* Rename 'span_groups' attribute

* Try to fix c++11 compilation

* Fix test

* Update DocBin

* Try to fix compilation

* Try to fix graph

* Improve SpanGroup docstrings

* Add doc.spans to documentation

* Fix serialization

* Tidy up and add docs

* Update docs [ci skip]

* Add SpanGroup.has_overlap

* WIP updated Graph API

* Start testing new Graph API

* Update Graph tests

* Update Graph

* Add docstring

Co-authored-by: Ines Montani <ines@ines.io>
2021-01-14 17:30:41 +11:00
..
architectures.md Fix types of Tok2Vec encoding architectures (#6442) 2021-01-07 16:39:27 +11:00
attributeruler.md Update docs [ci skip] 2020-10-09 10:36:06 +02:00
cli.md Merge pull request #6647 from svlandeg/feature/init_config_overwrite 2021-01-05 14:59:04 +11:00
corpus.md Integrate file readers 2020-10-02 01:36:06 +02:00
cython-classes.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
cython-structs.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
cython.md Update docs [ci skip] 2020-09-12 17:05:10 +02:00
data-formats.md multi-label textcat component (#6474) 2021-01-06 13:07:14 +11:00
dependencymatcher.md doc fixes 2020-09-12 17:38:54 +02:00
dependencyparser.md Update docs 2020-10-03 16:08:24 +02:00
doc.md Add SpanGroup and Graph container types to represent arbitrary annotations (#6696) 2021-01-14 17:30:41 +11:00
docbin.md small UX fix for DocBin (#6167) 2020-10-02 15:43:32 +02:00
entitylinker.md Update docs [ci skip] 2020-10-13 11:38:52 +02:00
entityrecognizer.md Update docs 2020-10-03 16:08:24 +02:00
entityruler.md Update docs [ci skip] 2020-10-06 10:31:48 +02:00
example.md Proofreading 2020-09-24 13:15:28 +02:00
index.md Update v3 docs 2020-07-03 16:48:21 +02:00
kb.md Define candidate generator in EL config (#5876) 2020-08-18 16:10:36 +02:00
language.md Format 2020-12-09 12:44:01 +01:00
lemmatizer.md Fix Lemmatizer.get_lookups_config 2020-10-03 17:16:10 +02:00
lexeme.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
lookups.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
matcher.md Update matcher.md 2020-12-09 11:09:45 +11:00
morphologizer.md remove labels from morphologizer constructor 2020-11-11 21:48:50 +01:00
morphology.md Proofreading 2020-09-24 13:15:28 +02:00
multilabel_textcategorizer.md multi-label textcat component (#6474) 2021-01-06 13:07:14 +11:00
phrasematcher.md doc fixes 2020-09-12 17:38:54 +02:00
pipe.md Update docs [ci skip] 2020-10-09 10:36:06 +02:00
pipeline-functions.md Proofreading 2020-09-28 16:50:15 +02:00
scorer.md Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
sentencerecognizer.md Merge branch 'develop' into feature/prepare 2020-09-29 20:53:05 +02:00
sentencizer.md Update docs [ci skip] 2020-10-09 10:36:06 +02:00
span.md Proofreading 2020-09-28 16:50:15 +02:00
spangroup.md Add SpanGroup and Graph container types to represent arbitrary annotations (#6696) 2021-01-14 17:30:41 +11:00
stringstore.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
tagger.md remove set_morphology from docs 2020-11-11 21:32:34 +01:00
textcategorizer.md multi-label textcat component (#6474) 2021-01-06 13:07:14 +11:00
tok2vec.md Merge branch 'develop' into feature/prepare 2020-09-29 20:53:05 +02:00
token.md Update docs [ci skip] 2020-10-02 13:24:33 +02:00
tokenizer.md Update docs [ci skip] 2020-10-02 13:24:33 +02:00
top-level.md require_cpu functionality (#6336) 2020-12-08 14:42:40 +08:00
transformer.md fix typo in transformer docs (#6404) 2020-11-19 14:11:38 +01:00
vectors.md Proofreading 2020-09-28 16:50:15 +02:00
vocab.md Proofreading 2020-09-28 16:50:15 +02:00