spaCy/spacy/tests/pipeline
Sofie Van Landeghem e48a09df4e Example class for training data (#4543)
* OrigAnnot class instead of gold.orig_annot list of zipped tuples

* from_orig to replace from_annot_tuples

* rename to RawAnnot

* some unit tests for GoldParse creation and internal format

* removing orig_annot and switching to lists instead of tuple

* rewriting tuples to use RawAnnot (+ debug statements, WIP)

* fix pop() changing the data

* small fixes

* pop-append fixes

* return RawAnnot for existing GoldParse to have uniform interface

* clean up imports

* fix merge_sents

* add unit test for 4402 with new structure (not working yet)

* introduce DocAnnot

* typo fixes

* add unit test for merge_sents

* rename from_orig to from_raw

* fixing unit tests

* fix nn parser

* read_annots to produce text, doc_annot pairs

* _make_golds fix

* rename golds_to_gold_annots

* small fixes

* fix encoding

* have golds_to_gold_annots use DocAnnot

* missed a spot

* merge_sents as function in DocAnnot

* allow specifying only part of the token-level annotations

* refactor with Example class + underlying dicts

* pipeline components to work with Example objects (wip)

* input checking

* fix yielding

* fix calls to update

* small fixes

* fix scorer unit test with new format

* fix kwargs order

* fixes for ud and conllu scripts

* fix reading data for conllu script

* add in proper errors (not fixed numbering yet to avoid merge conflicts)

* fixing few more small bugs

* fix EL script
2019-11-11 17:35:27 +01:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_analysis.py Fix pipeline analysis on remove pipe (#4557) 2019-10-30 19:04:17 +01:00
test_entity_linker.py Explicitly catch warning in test 2019-10-24 16:16:27 +02:00
test_entity_ruler.py Revert #4334 2019-09-29 17:32:12 +02:00
test_factories.py Revert #4334 2019-09-29 17:32:12 +02:00
test_functions.py Auto-format [ci skip] 2019-10-30 19:27:18 +01:00
test_pipe_methods.py Also support passing list to Language.disable_pipes (#4521) 2019-10-25 16:19:08 +02:00
test_sentencizer.py Extend unicode character block for Sinhala (#4378) 2019-10-07 13:17:03 +02:00
test_textcat.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00