spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-03-07 21:31:30 +03:00

History

Sofie Van Landeghem e48a09df4e Example class for training data (#4543 ) * OrigAnnot class instead of gold.orig_annot list of zipped tuples * from_orig to replace from_annot_tuples * rename to RawAnnot * some unit tests for GoldParse creation and internal format * removing orig_annot and switching to lists instead of tuple * rewriting tuples to use RawAnnot (+ debug statements, WIP) * fix pop() changing the data * small fixes * pop-append fixes * return RawAnnot for existing GoldParse to have uniform interface * clean up imports * fix merge_sents * add unit test for 4402 with new structure (not working yet) * introduce DocAnnot * typo fixes * add unit test for merge_sents * rename from_orig to from_raw * fixing unit tests * fix nn parser * read_annots to produce text, doc_annot pairs * _make_golds fix * rename golds_to_gold_annots * small fixes * fix encoding * have golds_to_gold_annots use DocAnnot * missed a spot * merge_sents as function in DocAnnot * allow specifying only part of the token-level annotations * refactor with Example class + underlying dicts * pipeline components to work with Example objects (wip) * input checking * fix yielding * fix calls to update * small fixes * fix scorer unit test with new format * fix kwargs order * fixes for ud and conllu scripts * fix reading data for conllu script * add in proper errors (not fixed numbering yet to avoid merge conflicts) * fixing few more small bugs * fix EL script		2019-11-11 17:35:27 +01:00
..
ner_example_data	Updates/bugfixes for NER/IOB converters (#4186 )	2019-08-29 12:04:01 +02:00
textcat_example_data	Add textcat to train CLI (#4226 )	2019-09-15 22:31:31 +02:00
conllu.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
ner_multitask_objective.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
pretrain_kb.py	Use consistent spelling	2019-10-02 10:37:39 +02:00
pretrain_textcat.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
rehearsal.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_entity_linker.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_intent_parser.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_ner.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_new_entity_type.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_parser.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_tagger.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
train_textcat.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
training-data.json	Revert training example edit from #4327 (#4403 )	2019-10-10 17:00:26 +02:00
vocab-data.jsonl	Use even smaller examle size	2017-10-30 19:46:45 +01:00