spaCy/spacy/cli/ud
Sofie 66016ac289 Batch UD evaluation script (#3174)
* running UD eval

* printing timing of tokenizer: tokens per second

* timing of default English model

* structured output and parameterization to compare different runs

* additional flag to allow evaluation without parsing info

* printing verbose log of errors for manual inspection

* printing over- and undersegmented cases (and combo's)

* add under and oversegmented numbers to Score and structured output

* print high-freq over/under segmented words and word shapes

* printing examples as part of the structured output

* print the results to file

* batch run of different models and treebanks per language

* cleaning up code

* commandline script to process all languages in spaCy & UD

* heuristic to remove blinded corpora and option to run one single best per language

* pathlib instead of os for file paths
2019-01-27 06:01:02 +01:00
..
__init__.py 💫 New JSON helpers, training data internals & CLI rewrite (#2932) 2018-11-30 20:16:14 +01:00
conll17_ud_eval.py Batch UD evaluation script (#3174) 2019-01-27 06:01:02 +01:00
run_eval.py Batch UD evaluation script (#3174) 2019-01-27 06:01:02 +01:00
ud_run_test.py Remove unused cytoolz / itertools imports 2018-12-03 02:12:07 +01:00
ud_train.py Remove unused cytoolz / itertools imports 2018-12-03 02:12:07 +01:00