spaCy/spacy/tests/regression
Sofie Van Landeghem 2d249a9502 KB extensions and better parsing of WikiData (#4375)
* fix overflow error on windows

* more documentation & logging fixes

* md fix

* 3 different limit parameters to play with execution time

* bug fixes directory locations

* small fixes

* exclude dev test articles from prior probabilities stats

* small fixes

* filtering wikidata entities, removing numeric and meta items

* adding aliases from wikidata also to the KB

* fix adding WD aliases

* adding also new aliases to previously added entities

* fixing comma's

* small doc fixes

* adding subclassof filtering

* append alias functionality in KB

* prevent appending the same entity-alias pair

* fix for appending WD aliases

* remove date filter

* remove unnecessary import

* small corrections and reformatting

* remove WD aliases for now (too slow)

* removing numeric entities from training and evaluation

* small fixes

* shortcut during prediction if there is only one candidate

* add counts and fscore logging, remove FP NER from evaluation

* fix entity_linker.predict to take docs instead of single sentences

* remove enumeration sentences from the WP dataset

* entity_linker.update to process full doc instead of single sentence

* spelling corrections and dump locations in readme

* NLP IO fix

* reading KB is unnecessary at the end of the pipeline

* small logging fix

* remove empty files
2019-10-14 12:28:53 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue1-1000.py KB extensions and better parsing of WikiData (#4375) 2019-10-14 12:28:53 +02:00
test_issue1001-1500.py Refactor lemmatizer and data table integration (#4353) 2019-10-01 21:36:03 +02:00
test_issue1501-2000.py Refactor lemmatizer and data table integration (#4353) 2019-10-01 21:36:03 +02:00
test_issue2001-2500.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue2501-3000.py Refactor lemmatizer and data table integration (#4353) 2019-10-01 21:36:03 +02:00
test_issue3001-3500.py Ensure training doesn't crash with empty batches (#4360) 2019-10-02 12:50:47 +02:00
test_issue3521.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3526.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3531.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3540.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3549.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3555.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3611.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3625.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3803.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3830.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3839.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3869.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3879.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3880.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3882.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3951.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3959.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3962.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue3972.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4002.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4030.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4042.py Fix test imports 2019-09-29 17:34:56 +02:00
test_issue4054.py Fix test imports 2019-09-29 17:34:56 +02:00
test_issue4120.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4133.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4190.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4267.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4272.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4278.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4313.py Revert #4334 2019-09-29 17:32:12 +02:00
test_issue4348.py Ensure training doesn't crash with empty batches (#4360) 2019-10-02 12:50:47 +02:00
test_issue4367.py Bugfix initializing DocBin with attributes (#4368) 2019-10-03 14:48:45 +02:00
test_issue4373.py Make PhraseMatcher.vocab consistent with Matcher.vocab (closes #4373) 2019-10-04 12:18:41 +02:00