spaCy/spacy/pipeline
Adriane Boyd 2702db9fef
Recommend lookups tables from URLs or other loaders (#12283)
* Recommend lookups tables from URLs or other loaders

Shift away from the `lookups` extra (which isn't removed, just no longer
mentioned) and recommend loading data from the `spacy-lookups-data` repo
or other sources rather than the `spacy-lookups-data` package.

If the tables can't be loaded from the `lookups` registry in the
lemmatizer, show how to specify the tables in `[initialize]` rather than
recommending the `spacy-lookups-data` package.

* Add tests for some rule-based lemmatizers

* Apply suggestions from code review

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

---------

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-07-31 15:54:35 +02:00
..
_edit_tree_internals ci: add cython linter (#12694) 2023-07-19 12:03:31 +02:00
_parser_internals cython fixes and cleanup 2023-07-19 17:41:29 +02:00
__init__.py Merge remote-tracking branch 'upstream/master' into sync-v4-master-20230612 2023-06-12 15:57:10 +02:00
attribute_ruler.py isort all the things 2023-06-26 11:41:03 +02:00
dep_parser.py Merge branch 'upstream_master' into sync_v4 2023-07-19 16:37:31 +02:00
edit_tree_lemmatizer.py isort all the things 2023-06-26 11:41:03 +02:00
entity_linker.py isort all the things 2023-06-26 11:41:03 +02:00
entityruler.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
functions.py isort all the things 2023-06-26 11:41:03 +02:00
lemmatizer.py Recommend lookups tables from URLs or other loaders (#12283) 2023-07-31 15:54:35 +02:00
morphologizer.pyx cython fixes and cleanup 2023-07-19 17:41:29 +02:00
ner.py Merge branch 'upstream_master' into sync_v4 2023-07-19 16:37:31 +02:00
pipe.pxd TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
pipe.pyi isort all the things 2023-06-26 11:41:03 +02:00
pipe.pyx cython fixes and cleanup 2023-07-19 17:41:29 +02:00
sentencizer.pyx merge fixes 2023-07-19 16:38:29 +02:00
senter.pyx Merge branch 'upstream_master' into sync_v4 2023-07-19 16:37:31 +02:00
span_finder.py SpanFinder: set default max_length to 25 (#12791) 2023-07-06 09:55:34 +02:00
span_ruler.py isort all the things 2023-06-26 11:41:03 +02:00
spancat.py isort all the things 2023-06-26 11:41:03 +02:00
tagger.pyx Merge branch 'upstream_master' into sync_v4 2023-07-19 16:37:31 +02:00
textcat_multilabel.py isort all the things 2023-06-26 11:41:03 +02:00
textcat.py isort all the things 2023-06-26 11:41:03 +02:00
tok2vec.py isort all the things 2023-06-26 11:41:03 +02:00
trainable_pipe.pxd isort all the things 2023-06-26 11:41:03 +02:00
trainable_pipe.pyx cython fixes and cleanup 2023-07-19 17:41:29 +02:00
transition_parser.pyx cython fixes and cleanup 2023-07-19 17:41:29 +02:00