spaCy/website/docs/api
Adriane Boyd a4b32b9552
Handle missing reference values in scorer (#6286)
* Handle missing reference values in scorer

Handle missing values in reference doc during scoring where it is
possible to detect an unset state for the attribute. If no reference
docs contain annotation, `None` is returned instead of a score. `spacy
evaluate` displays `-` for missing scores and the missing scores are
saved as `None`/`null` in the metrics.

Attributes without unset states:

* `token.head`: relies on `token.dep` to recognize unset values
* `doc.cats`: unable to handle missing annotation

Additional changes:

* add optional `has_annotation` check to `score_scans` to replace
`doc.sents` hack
* update `score_token_attr_per_feat` to handle missing and empty morph
representations
* fix bug in `Doc.has_annotation` for normalization of `IS_SENT_START`
vs. `SENT_START`

* Fix import

* Update return types
2020-11-03 15:47:18 +01:00
..
architectures.md TextCat updates and fixes (#6263) 2020-10-18 14:50:41 +02:00
attributeruler.md Update docs [ci skip] 2020-10-09 10:36:06 +02:00
cli.md Update docs [ci skip] 2020-10-09 12:04:52 +02:00
corpus.md Integrate file readers 2020-10-02 01:36:06 +02:00
cython-classes.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
cython-structs.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
cython.md Update docs [ci skip] 2020-09-12 17:05:10 +02:00
data-formats.md Update docs [ci skip] 2020-10-06 10:31:48 +02:00
dependencymatcher.md doc fixes 2020-09-12 17:38:54 +02:00
dependencyparser.md Update docs 2020-10-03 16:08:24 +02:00
doc.md Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
docbin.md small UX fix for DocBin (#6167) 2020-10-02 15:43:32 +02:00
entitylinker.md Update docs [ci skip] 2020-10-13 11:38:52 +02:00
entityrecognizer.md Update docs 2020-10-03 16:08:24 +02:00
entityruler.md Update docs [ci skip] 2020-10-06 10:31:48 +02:00
example.md Proofreading 2020-09-24 13:15:28 +02:00
index.md Update v3 docs 2020-07-03 16:48:21 +02:00
kb.md Define candidate generator in EL config (#5876) 2020-08-18 16:10:36 +02:00
language.md set_kb method for entity_linker 2020-10-08 10:34:01 +02:00
lemmatizer.md Fix Lemmatizer.get_lookups_config 2020-10-03 17:16:10 +02:00
lexeme.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
lookups.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
matcher.md Merge pull request #6135 from walterhenry/develop-proof 2020-09-29 20:49:06 +02:00
morphologizer.md Update docs 2020-10-03 16:08:24 +02:00
morphology.md Proofreading 2020-09-24 13:15:28 +02:00
phrasematcher.md doc fixes 2020-09-12 17:38:54 +02:00
pipe.md Update docs [ci skip] 2020-10-09 10:36:06 +02:00
pipeline-functions.md Proofreading 2020-09-28 16:50:15 +02:00
scorer.md Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
sentencerecognizer.md Merge branch 'develop' into feature/prepare 2020-09-29 20:53:05 +02:00
sentencizer.md Update docs [ci skip] 2020-10-09 10:36:06 +02:00
span.md Proofreading 2020-09-28 16:50:15 +02:00
stringstore.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
tagger.md Update docs 2020-10-03 16:08:24 +02:00
textcategorizer.md Fix label initialization of textcat component (#6190) 2020-10-03 17:07:38 +02:00
tok2vec.md Merge branch 'develop' into feature/prepare 2020-09-29 20:53:05 +02:00
token.md Update docs [ci skip] 2020-10-02 13:24:33 +02:00
tokenizer.md Update docs [ci skip] 2020-10-02 13:24:33 +02:00
top-level.md Update data augmenters (#6196) 2020-10-04 17:46:29 +02:00
transformer.md Update docs and install extras [ci skip] 2020-10-08 10:58:50 +02:00
vectors.md Proofreading 2020-09-28 16:50:15 +02:00
vocab.md Proofreading 2020-09-28 16:50:15 +02:00