spaCy/spacy/cli
Adriane Boyd a4b32b9552
Handle missing reference values in scorer (#6286)
* Handle missing reference values in scorer

Handle missing values in reference doc during scoring where it is
possible to detect an unset state for the attribute. If no reference
docs contain annotation, `None` is returned instead of a score. `spacy
evaluate` displays `-` for missing scores and the missing scores are
saved as `None`/`null` in the metrics.

Attributes without unset states:

* `token.head`: relies on `token.dep` to recognize unset values
* `doc.cats`: unable to handle missing annotation

Additional changes:

* add optional `has_annotation` check to `score_scans` to replace
`doc.sents` hack
* update `score_token_attr_per_feat` to handle missing and empty morph
representations
* fix bug in `Doc.has_annotation` for normalization of `IS_SENT_START`
vs. `SENT_START`

* Fix import

* Update return types
2020-11-03 15:47:18 +01:00
..
project Enable commit check via env var 2020-10-05 20:51:15 +02:00
templates TextCat updates and fixes (#6263) 2020-10-18 14:50:41 +02:00
__init__.py Move init labels to init pipeline module 2020-09-29 18:09:33 +02:00
_util.py update output of debug config command 2020-10-06 09:47:23 +02:00
convert.py add utf8 encoding for opening file 2020-10-09 16:03:14 +02:00
debug_config.py Fix small issues, resolve_dot_names and debug model 2020-09-29 20:38:35 +02:00
debug_data.py Tidy up and auto-format 2020-10-03 17:20:18 +02:00
debug_model.py Fix small issues, resolve_dot_names and debug model 2020-09-29 20:38:35 +02:00
download.py Fix download CLI [ci skip] 2020-09-24 14:43:56 +02:00
evaluate.py Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
info.py accuracy -> performance 2020-09-24 14:32:35 +02:00
init_config.py Update config resolution to use new Thinc 2020-09-27 22:21:31 +02:00
init_pipeline.py Fix success message [ci skip] 2020-10-23 16:11:54 +02:00
package.py Use updated run_command 2020-09-20 16:21:43 +02:00
pretrain.py Fix typos and refactor CLI logging 2020-09-28 21:17:10 +02:00
profile.py Update docs links in codebase 2020-09-04 12:58:50 +02:00
train.py TextCat updates and fixes (#6263) 2020-10-18 14:50:41 +02:00
validate.py Update docs links in codebase 2020-09-04 12:58:50 +02:00