spaCy/examples/training
Sofie Van Landeghem 0ba1b5eebc CLI scripts for entity linking (wikipedia & generic) (#4091)
* document token ent_kb_id

* document span kb_id

* update pipeline documentation

* prior and context weights as bool's instead

* entitylinker api documentation

* drop for both models

* finish entitylinker documentation

* small fixes

* documentation for KB

* candidate documentation

* links to api pages in code

* small fix

* frequency examples as counts for consistency

* consistent documentation about tensors returned by predict

* add entity linking to usage 101

* add entity linking infobox and KB section to 101

* entity-linking in linguistic features

* small typo corrections

* training example and docs for entity_linker

* predefined nlp and kb

* revert back to similarity encodings for simplicity (for now)

* set prior probabilities to 0 when excluded

* code clean up

* bugfix: deleting kb ID from tokens when entities were removed

* refactor train el example to use either model or vocab

* pretrain_kb example for example kb generation

* add to training docs for KB + EL example scripts

* small fixes

* error numbering

* ensure the language of vocab and nlp stay consistent across serialization

* equality with =

* avoid conflict in errors file

* add error 151

* final adjustements to the train scripts - consistency

* update of goldparse documentation

* small corrections

* push commit

* turn kb_creator into CLI script (wip)

* proper parameters for training entity vectors

* wikidata pipeline split up into two executable scripts

* remove context_width

* move wikidata scripts in bin directory, remove old dummy script

* refine KB script with logs and preprocessing options

* small edits

* small improvements to logging of EL CLI script
2019-08-13 15:38:59 +02:00
..
conllu.py Remove unused cytoolz / itertools imports 2018-12-03 02:12:07 +01:00
ner_multitask_objective.py Auto-format examples 2018-12-02 04:26:26 +01:00
pretrain_kb.py CLI scripts for entity linking (wikipedia & generic) (#4091) 2019-08-13 15:38:59 +02:00
pretrain_textcat.py Auto-format examples 2018-12-02 04:26:26 +01:00
rehearsal.py Update rehearsal example 2019-02-24 16:17:41 +01:00
train_entity_linker.py CLI scripts for entity linking (wikipedia & generic) (#4091) 2019-08-13 15:38:59 +02:00
train_intent_parser.py Auto-format examples 2018-12-02 04:26:26 +01:00
train_ner.py Test and update examples [ci skip] 2019-03-16 14:15:49 +01:00
train_new_entity_type.py Update compatibility [ci skip] 2019-04-01 16:25:16 +02:00
train_parser.py Test and update examples [ci skip] 2019-03-16 14:15:49 +01:00
train_tagger.py Test and update examples [ci skip] 2019-03-16 14:15:49 +01:00
train_textcat.py Bug fixes and options for TextCategorizer (#3472) 2019-03-23 16:44:44 +01:00
training-data.json Update Example input JSON file to adhere to specification. (#3243) 2019-02-07 16:18:01 +01:00
vocab-data.jsonl Use even smaller examle size 2017-10-30 19:46:45 +01:00