spaCy/examples/training
Sofie Van Landeghem e48a09df4e Example class for training data (#4543)
* OrigAnnot class instead of gold.orig_annot list of zipped tuples

* from_orig to replace from_annot_tuples

* rename to RawAnnot

* some unit tests for GoldParse creation and internal format

* removing orig_annot and switching to lists instead of tuple

* rewriting tuples to use RawAnnot (+ debug statements, WIP)

* fix pop() changing the data

* small fixes

* pop-append fixes

* return RawAnnot for existing GoldParse to have uniform interface

* clean up imports

* fix merge_sents

* add unit test for 4402 with new structure (not working yet)

* introduce DocAnnot

* typo fixes

* add unit test for merge_sents

* rename from_orig to from_raw

* fixing unit tests

* fix nn parser

* read_annots to produce text, doc_annot pairs

* _make_golds fix

* rename golds_to_gold_annots

* small fixes

* fix encoding

* have golds_to_gold_annots use DocAnnot

* missed a spot

* merge_sents as function in DocAnnot

* allow specifying only part of the token-level annotations

* refactor with Example class + underlying dicts

* pipeline components to work with Example objects (wip)

* input checking

* fix yielding

* fix calls to update

* small fixes

* fix scorer unit test with new format

* fix kwargs order

* fixes for ud and conllu scripts

* fix reading data for conllu script

* add in proper errors (not fixed numbering yet to avoid merge conflicts)

* fixing few more small bugs

* fix EL script
2019-11-11 17:35:27 +01:00
..
ner_example_data Updates/bugfixes for NER/IOB converters (#4186) 2019-08-29 12:04:01 +02:00
textcat_example_data Add textcat to train CLI (#4226) 2019-09-15 22:31:31 +02:00
conllu.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
ner_multitask_objective.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
pretrain_kb.py Use consistent spelling 2019-10-02 10:37:39 +02:00
pretrain_textcat.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
rehearsal.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_entity_linker.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_intent_parser.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_ner.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_new_entity_type.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_parser.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_tagger.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_textcat.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
training-data.json Revert training example edit from #4327 (#4403) 2019-10-10 17:00:26 +02:00
vocab-data.jsonl Use even smaller examle size 2017-10-30 19:46:45 +01:00