spaCy/spacy/training
Adriane Boyd bdb485cc80
Add callback to copy vocab/tokenizer from model (#7750)
* Add callback to copy vocab/tokenizer from model

Add callback `spacy.copy_from_base_model.v1` to copy the tokenizer
settings and/or vocab (including vectors) from a base model.

* Move spacy.copy_from_base_model.v1 to spacy.training.callbacks

* Add documentation

* Modify to specify model as tokenizer and vocab params
2021-04-22 12:36:50 +02:00
..
converters Fix parser sourcing in NER converter (#7631) 2021-04-08 12:25:03 +02:00
__init__.pxd Renaming gold & annotation_setter (#6042) 2020-09-09 10:31:03 +02:00
__init__.py Add callback to copy vocab/tokenizer from model (#7750) 2021-04-22 12:36:50 +02:00
align.pyx Fix alignment for 1-to-1 tokens and lowercasing (#6476) 2020-12-08 14:25:16 +08:00
alignment.py Replace pytokenizations with internal alignment (#6293) 2020-11-03 16:24:38 +01:00
augment.py Fix lowercase augmentation (#7336) 2021-03-09 14:02:32 +11:00
batchers.py Renaming gold & annotation_setter (#6042) 2020-09-09 10:31:03 +02:00
callbacks.py Add callback to copy vocab/tokenizer from model (#7750) 2021-04-22 12:36:50 +02:00
corpus.py Support large/infinite training corpora (#7208) 2021-04-08 18:08:04 +10:00
example.pxd Make a pre-check to speed up alignment cache (#6139) 2020-09-24 18:13:39 +02:00
example.pyx Update sent_starts in Example.from_dict (#7847) 2021-04-22 11:32:45 +02:00
gold_io.pyx Fix is_sent_start when converting from JSON (fix #7635) (#7655) 2021-04-08 18:24:52 +10:00
initialize.py Support large/infinite training corpora (#7208) 2021-04-08 18:08:04 +10:00
iob_utils.py Merge pull request #6089 from adrianeboyd/feature/doc-ents-v3-2 2020-09-24 14:44:42 +02:00
loggers.py W&B integration: Optional support for dataset and model checkpoint logging and versioning (#7429) 2021-04-01 19:36:23 +02:00
loop.py Support large/infinite training corpora (#7208) 2021-04-08 18:08:04 +10:00
pretrain.py replace "is not" with != 2021-03-18 21:09:11 +01:00