mirror of
https://github.com/explosion/spaCy.git
synced 2025-10-24 12:41:23 +03:00
* verbose and tag_map options * adding init_tok2vec option and only changing the tok2vec that is specified * adding omit_extra_lookups and verifying textcat config * wip * pretrain bugfix * add replace and resume options * train_textcat fix * raw text functionality * improve UX when KeyError or when input data can't be parsed * avoid unnecessary access to goldparse in TextCat pipe * save performance information in nlp.meta * add noise_level to config * move nn_parser's defaults to config file * multitask in config - doesn't work yet * scorer offering both F and AUC options, need to be specified in config * add textcat verification code from old train script * small fixes to config files * clean up * set default config for ner/parser to allow create_pipe to work as before * two more test fixes * small fixes * cleanup * fix NER pickling + additional unit test * create_pipe as before |
||
|---|---|---|
| .. | ||
| ner_example_data | ||
| textcat_example_data | ||
| conllu-config.json | ||
| conllu.py | ||
| create_kb.py | ||
| ner_multitask_objective.py | ||
| rehearsal.py | ||
| train_entity_linker.py | ||
| train_intent_parser.py | ||
| train_morphologizer.py | ||
| train_ner.py | ||
| train_new_entity_type.py | ||
| train_parser.py | ||
| train_tagger.py | ||
| train_textcat_config.cfg | ||
| train_textcat.py | ||
| training-data.json | ||
| vocab-data.jsonl | ||