spaCy/spacy/tests/training
Daniël de Kok 8a5814bf2c
Add distillation loop (#12542)
* Add distillation initialization and loop

* Fix up configuration keys

* Add docstring

* Type annotations

* init_nlp_distill -> init_nlp_student

* Do not resolve dot name distill corpus in initialization

(Since we don't use it.)

* student: do not request use of optimizer in student pipe

We apply finish up the updates once in the training loop instead.

Also add the necessary logic to `Language.distill` to mirror
`Language.update`.

* Correctly determine sort key in subdivide_batch

* Fix _distill_loop docstring wrt. stopping condition

* _distill_loop: fix distill_data docstring

Make similar changes in train_while_improving, since it also had
incorrect types and missing type annotations.

* Move `set_{gpu_allocator,seed}_from_config` to spacy.util

* Update Language.update docs for the sgd argument

* Type annotation

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>

---------

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
2023-04-21 13:49:40 +02:00
..
__init__.py move tests to correct subdir 2020-09-15 21:40:38 +02:00
test_augmenters.py Preserve missing entity annotation in augmenters (#11540) 2022-09-27 10:16:51 +02:00
test_corpus.py Add spacy.PlainTextCorpusReader.v1 (#12122) 2023-01-26 11:33:22 +01:00
test_logger.py Add ConsoleLogger.v2 (#11214) 2022-08-29 10:23:05 +02:00
test_loop.py Add distillation loop (#12542) 2023-04-21 13:49:40 +02:00
test_new_example.py adding spans to doc_annotation in Example.to_dict (#11261) 2022-08-05 12:26:38 +02:00
test_pretraining.py Tagger: use unnormalized probabilities for inference (#10197) 2022-03-15 14:15:31 +01:00
test_readers.py Revert "disable failing test because Stanford servers are down (#11015)" (#11054) 2022-06-30 11:24:54 +02:00
test_rehearse.py bugfix parser labels (#10797) 2022-05-13 11:41:32 +02:00
test_training.py Fix batching regression (#12094) 2023-01-18 18:28:30 +01:00