spaCy/spacy/tests/doc
Matthew Honnibal f277bfdf0f
Add SpanGroup and Graph container types to represent arbitrary annotations (#6696)
* Draft out initial Spans data structure

* Initial span group commit

* Basic span group support on Doc

* Basic test for span group

* Compile span_group.pyx

* Draft addition of SpanGroup to DocBin

* Add deserialization for SpanGroup

* Add tests for serializing SpanGroup

* Fix serialization of SpanGroup

* Add EdgeC and GraphC structs

* Add draft Graph data structure

* Compile graph

* More work on Graph

* Update GraphC

* Upd graph

* Fix walk functions

* Let Graph take nodes and edges on construction

* Fix walking and getting

* Add graph tests

* Fix import

* Add module with the SpanGroups dict thingy

* Update test

* Rename 'span_groups' attribute

* Try to fix c++11 compilation

* Fix test

* Update DocBin

* Try to fix compilation

* Try to fix graph

* Improve SpanGroup docstrings

* Add doc.spans to documentation

* Fix serialization

* Tidy up and add docs

* Update docs [ci skip]

* Add SpanGroup.has_overlap

* WIP updated Graph API

* Start testing new Graph API

* Update Graph tests

* Update Graph

* Add docstring

Co-authored-by: Ines Montani <ines@ines.io>
2021-01-14 17:30:41 +11:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_add_entities.py begin_training -> initialize 2020-09-28 21:35:09 +02:00
test_array.py Refactor Token morph setting (#6175) 2020-10-01 22:21:46 +02:00
test_creation.py Add Lemmatizer and simplify related components (#5848) 2020-08-07 15:27:13 +02:00
test_doc_api.py Add SpanGroup and Graph container types to represent arbitrary annotations (#6696) 2021-01-14 17:30:41 +11:00
test_graph.py Add SpanGroup and Graph container types to represent arbitrary annotations (#6696) 2021-01-14 17:30:41 +11:00
test_morphanalysis.py Tidy up and auto-format 2020-10-03 17:20:18 +02:00
test_pickle_doc.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
test_retokenize_merge.py Set default lemmas in retokenizer (#6667) 2021-01-06 12:29:44 +08:00
test_retokenize_split.py Set default lemmas in retokenizer (#6667) 2021-01-06 12:29:44 +08:00
test_span.py Merge branch 'develop' into master-tmp 2020-10-04 14:52:20 +02:00
test_to_json.py Fix morph in Doc.to_json 2020-10-08 14:44:35 +02:00
test_token_api.py Tidy up tests and docs 2020-09-21 20:43:54 +02:00
test_underscore.py Merge branch 'master' into tmp/sync 2020-03-26 13:38:14 +01:00