spaCy/website/docs/api
Ines Montani cf65a80f36 Refactor lemmatizer and data table integration (#4353)
* Move test

* Allow default in Lookups.get_table

* Start with blank tables in Lookups.from_bytes

* Refactor lemmatizer to hold instance of Lookups

* Get lookups table within the lemmatization methods to make sure it references the correct table (even if the table was replaced or modified, e.g. when loading a model from disk)
* Deprecate other arguments on Lemmatizer.__init__ and expect Lookups for consistency
* Remove old and unsupported Lemmatizer.load classmethod
* Refactor language-specific lemmatizers to inherit as much as possible from base class and override only what they need

* Update tests and docs

* Fix more tests

* Fix lemmatizer

* Upgrade pytest to try and fix weird CI errors

* Try pytest 4.6.5
2019-10-01 21:36:03 +02:00
..
annotation.md Update lemma data documentation [ci skip] 2019-10-01 13:22:13 +02:00
cli.md Update CLI docs [ci skip] 2019-09-28 13:12:30 +02:00
cython-classes.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
cython-structs.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
cython.md 💫 Update website (#3285) 2019-02-17 19:31:19 +01:00
dependencyparser.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
doc.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
docbin.md Update DocBin and add docs 2019-09-18 20:23:21 +02:00
entitylinker.md Fix typos and formatting [ci skip] 2019-10-01 12:30:04 +02:00
entityrecognizer.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
entityruler.md Fix typos and formatting [ci skip] 2019-10-01 12:30:04 +02:00
goldcorpus.md 💫 Update website (#3285) 2019-02-17 19:31:19 +01:00
goldparse.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
index.md 💫 Update website (#3285) 2019-02-17 19:31:19 +01:00
kb.md Documentation for Entity Linking (#4065) 2019-09-12 11:38:34 +02:00
language.md Fix docs consistency [ci skip] 2019-09-14 16:23:37 +02:00
lemmatizer.md Refactor lemmatizer and data table integration (#4353) 2019-10-01 21:36:03 +02:00
lexeme.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
lookups.md Fix typos and formatting [ci skip] 2019-10-01 12:30:04 +02:00
matcher.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
phrasematcher.md Document PhraseMatcher.remove [ci skip] 2019-09-27 16:34:53 +02:00
pipeline-functions.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
scorer.md Add textcat to train CLI (#4226) 2019-09-15 22:31:31 +02:00
sentencizer.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
span.md Merge branch 'master' into develop 2019-09-12 18:03:01 +02:00
stringstore.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
tagger.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
textcategorizer.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
token.md Remove u-strings and fix formatting [ci skip] 2019-09-12 16:11:15 +02:00
tokenizer.md Remove LEMMA from exception examples [ci skip] 2019-09-12 16:26:27 +02:00
top-level.md Remove LEMMA from exception examples [ci skip] 2019-09-12 16:26:27 +02:00
vectors.md Fix tag [ci skip] 2019-09-26 16:22:33 +02:00
vocab.md Update vectors name docs [ci skip] 2019-09-26 16:21:32 +02:00