spaCy/spacy/pipeline
Matthew Honnibal 6f5e308d17
Support negative examples in partial NER annotations (#8106)
* Support a cfg field in transition system

* Make NER 'has gold' check use right alignment for span

* Pass 'negative_samples_key' property into NER transition system

* Add field for negative samples to NER transition system

* Check neg_key in NER has_gold

* Support negative examples in NER oracle

* Test for negative examples in NER

* Fix name of config variable in NER

* Remove vestiges of old-style partial annotation

* Remove obsolete tests

* Add comment noting lack of support for negative samples in parser

* Additions to "neg examples" PR (#8201)

* add custom error and test for deprecated format

* add test for unlearning an entity

* add break also for Begin's cost

* add negative_samples_key property on Parser

* rename

* extend docs & fix some older docs issues

* add subclass constructors, clean up tests, fix docs

* add flaky test with ValueError if gold parse was not found

* remove ValueError if n_gold == 0

* fix docstring

* Hack in environment variables to try out training

* Remove hack

* Remove NER hack, and support 'negative O' samples

* Fix O oracle

* Fix transition parser

* Remove 'not O' from oracle

* Fix NER oracle

* check for spans in both gold.ents and gold.spans and raise if so, to prevent memory access violation

* use set instead of list in consistency check

Co-authored-by: svlandeg <sofie.vanlandeghem@gmail.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2021-06-17 17:33:00 +10:00
..
_parser_internals Support negative examples in partial NER annotations (#8106) 2021-06-17 17:33:00 +10:00
__init__.py multi-label textcat component (#6474) 2021-01-06 13:07:14 +11:00
attributeruler.py Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
dep_parser.pyx Support negative examples in partial NER annotations (#8106) 2021-06-17 17:33:00 +10:00
entity_linker.py Merge remote-tracking branch 'upstream/develop' into chore/develop-into-master-v3.1 2021-06-15 15:05:17 +02:00
entityruler.py Merge remote-tracking branch 'upstream/develop' into chore/develop-into-master-v3.1 2021-06-15 15:05:17 +02:00
functions.py Tidy up and auto-format 2021-02-13 12:55:56 +11:00
lemmatizer.py Use warnings.warn instead of logger.warning 2021-06-04 17:44:08 +02:00
morphologizer.pyx Replace negative rows with 0 in StaticVectors (#7674) 2021-04-22 18:04:15 +10:00
multitask.pyx Replace negative rows with 0 in StaticVectors (#7674) 2021-04-22 18:04:15 +10:00
ner.pyx Support negative examples in partial NER annotations (#8106) 2021-06-17 17:33:00 +10:00
pipe.pxd TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
pipe.pyx Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
sentencizer.pyx Re-refactor Sentencizer with Pipe API (#7176) 2021-02-26 09:48:14 +01:00
senter.pyx Replace negative rows with 0 in StaticVectors (#7674) 2021-04-22 18:04:15 +10:00
tagger.pyx Replace negative rows with 0 in StaticVectors (#7674) 2021-04-22 18:04:15 +10:00
textcat_multilabel.py Resizable textcat (#7862) 2021-06-16 11:45:00 +02:00
textcat.py Resizable textcat (#7862) 2021-06-16 11:45:00 +02:00
tok2vec.py Ensemble textcat with listener (#8012) 2021-05-31 18:21:06 +10:00
trainable_pipe.pxd Revert added_strings change (#6236) 2020-10-10 18:55:07 +02:00
trainable_pipe.pyx Resizable textcat (#7862) 2021-06-16 11:45:00 +02:00
transition_parser.pxd TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
transition_parser.pyx Support negative examples in partial NER annotations (#8106) 2021-06-17 17:33:00 +10:00