spaCy/spacy/tests/serialize
Matthew Honnibal ecb3c4e8f4
Create corpus iterator and batcher from registry during training (#5865)
* Move batchers into their own module (and registry)

* Update CLI

* Update Corpus and batcher

* Update tests

* Update one config

* Merge 'evaluation' block back under [training]

* Import batchers in gold __init__

* Fix batchers

* Update config

* Update schema

* Update util

* Don't assume train and dev are actually paths

* Update onto-joint config

* Fix missing import

* Format

* Format

* Update spacy/gold/corpus.py

Co-authored-by: Ines Montani <ines@ines.io>

* Fix name

* Update default config

* Fix get_length option in batchers

* Update test

* Add comment

* Pass path into Corpus

* Update docstring

* Update schema and configs

* Update config

* Fix test

* Fix paths

* Fix print

* Fix create_train_batches

* [training.read_train] -> [training.train_corpus]

* Update onto-joint config

Co-authored-by: Ines Montani <ines@ines.io>
2020-08-04 15:09:37 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_serialize_config.py Create corpus iterator and batcher from registry during training (#5865) 2020-08-04 15:09:37 +02:00
test_serialize_doc.py Remove dead and/or deprecated code (#5710) 2020-07-06 13:06:25 +02:00
test_serialize_extension_attrs.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
test_serialize_kb.py Default empty KB in EL component (#5872) 2020-08-04 14:34:09 +02:00
test_serialize_language.py Remove dead and/or deprecated code (#5710) 2020-07-06 13:06:25 +02:00
test_serialize_pipeline.py Tidy up, autoformat, add types 2020-07-25 15:01:15 +02:00
test_serialize_tokenizer.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
test_serialize_vocab_strings.py Test suite clean up (#5781) 2020-07-20 14:49:54 +02:00