mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 01:46:28 +03:00
66016ac289
* running UD eval * printing timing of tokenizer: tokens per second * timing of default English model * structured output and parameterization to compare different runs * additional flag to allow evaluation without parsing info * printing verbose log of errors for manual inspection * printing over- and undersegmented cases (and combo's) * add under and oversegmented numbers to Score and structured output * print high-freq over/under segmented words and word shapes * printing examples as part of the structured output * print the results to file * batch run of different models and treebanks per language * cleaning up code * commandline script to process all languages in spaCy & UD * heuristic to remove blinded corpora and option to run one single best per language * pathlib instead of os for file paths |
||
---|---|---|
.. | ||
converters | ||
schemas | ||
ud | ||
__init__.py | ||
convert.py | ||
debug_data.py | ||
download.py | ||
evaluate.py | ||
info.py | ||
init_model.py | ||
link.py | ||
package.py | ||
pretrain.py | ||
profile.py | ||
train.py | ||
validate.py |