spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-01-22 00:04:20 +03:00

History

Sofie 66016ac289 Batch UD evaluation script (#3174 ) * running UD eval * printing timing of tokenizer: tokens per second * timing of default English model * structured output and parameterization to compare different runs * additional flag to allow evaluation without parsing info * printing verbose log of errors for manual inspection * printing over- and undersegmented cases (and combo's) * add under and oversegmented numbers to Score and structured output * print high-freq over/under segmented words and word shapes * printing examples as part of the structured output * print the results to file * batch run of different models and treebanks per language * cleaning up code * commandline script to process all languages in spaCy & UD * heuristic to remove blinded corpora and option to run one single best per language * pathlib instead of os for file paths		2019-01-27 06:01:02 +01:00
..
__init__.py	💫 New JSON helpers, training data internals & CLI rewrite (#2932 )	2018-11-30 20:16:14 +01:00
conll17_ud_eval.py	Batch UD evaluation script (#3174 )	2019-01-27 06:01:02 +01:00
run_eval.py	Batch UD evaluation script (#3174 )	2019-01-27 06:01:02 +01:00
ud_run_test.py	Remove unused cytoolz / itertools imports	2018-12-03 02:12:07 +01:00
ud_train.py	Remove unused cytoolz / itertools imports	2018-12-03 02:12:07 +01:00