spaCy/spacy/cli
Sofie 66016ac289 Batch UD evaluation script (#3174)
* running UD eval

* printing timing of tokenizer: tokens per second

* timing of default English model

* structured output and parameterization to compare different runs

* additional flag to allow evaluation without parsing info

* printing verbose log of errors for manual inspection

* printing over- and undersegmented cases (and combo's)

* add under and oversegmented numbers to Score and structured output

* print high-freq over/under segmented words and word shapes

* printing examples as part of the structured output

* print the results to file

* batch run of different models and treebanks per language

* cleaning up code

* commandline script to process all languages in spaCy & UD

* heuristic to remove blinded corpora and option to run one single best per language

* pathlib instead of os for file paths
2019-01-27 06:01:02 +01:00
..
converters Tidy up merge conflict leftovers 2018-12-18 13:58:30 +01:00
schemas 💫 Replace ujson, msgpack and dill/pickle/cloudpickle with srsly (#3003) 2018-12-03 01:28:22 +01:00
ud Batch UD evaluation script (#3174) 2019-01-27 06:01:02 +01:00
__init__.py 💫 New JSON helpers, training data internals & CLI rewrite (#2932) 2018-11-30 20:16:14 +01:00
convert.py cli.convert: fix typo in converter arguments (#3099) 2018-12-27 18:08:41 +01:00
debug_data.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
download.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
evaluate.py spacy.cli.evaluate: fix TypeError (#3101) 2018-12-28 11:14:28 +01:00
info.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
init_model.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
link.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
package.py Set vectors.name when updating meta.json during training (#3100) 2018-12-27 19:55:40 +01:00
pretrain.py Remove unused code from spacy pretrain 2018-12-18 19:19:26 +01:00
profile.py Fix cytoolz import cytoolz 2018-12-06 16:04:12 +01:00
train.py Set vectors.name when updating meta.json during training (#3100) 2018-12-27 19:55:40 +01:00
validate.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00