spaCy/spacy/cli
Paul O'Leary McCann cbd85c9608 Change GPU efficient textcat to use CNN, not BOW in generated configs (#11900)
* Change GPU efficient textcat to use CNN, not BOW

If you generate a config with a textcat component using GPU
(transformers), the defaut option (efficiency) uses a BOW architecture,
which does not use tok2vec features. While that can make sense as part
of a larger pipeline, in the case of just a transformer and a textcat,
that means the transformer is doing a lot of work for no purpose.

This changes it so that the CNN architecture is used instead. It could
also be changed to be the same as the accuracy config, which uses the
ensemble architecture.

* Add the transformer when using a textcat with GPU

* Switch ubuntu-latest to ubuntu-20.04 in main tests (#11928)

* Switch ubuntu-latest to ubuntu-20.04 in main tests

* Only use 20.04 for 3.6

* Require thinc v8.1.7

* Require thinc v8.1.8

* Break up longer expression

---------

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-09 10:32:51 +01:00
..
project Partially work around pending deprecation of pkg_resources (#12368) 2023-03-09 10:32:01 +01:00
templates Change GPU efficient textcat to use CNN, not BOW in generated configs (#11900) 2023-03-09 10:32:51 +01:00
__init__.py Add a spacy benchmark speed subcommand (#11902) 2023-01-12 11:55:21 +01:00
_util.py Have logging calls use string formatting types (#12215) 2023-02-02 11:15:22 +01:00
apply.py Auto-format code with black (#12019) 2022-12-23 12:44:07 +01:00
assemble.py Tidy up with flake8: imports, comparisons, etc. 2021-06-28 12:08:15 +02:00
benchmark_speed.py Add a spacy benchmark speed subcommand (#11902) 2023-01-12 11:55:21 +01:00
convert.py fix processing of "auto" in convert (#12050) 2023-01-05 10:21:00 +01:00
debug_config.py Fix references to config file in the docs & UX (#9961) 2022-01-04 14:31:26 +01:00
debug_data.py trainable_lemmatizer in debug data (#11419) 2023-01-26 17:36:50 +01:00
debug_diff.py Add debug diff command in spaCy CLI (#10502) 2022-04-07 10:48:45 +02:00
debug_model.py Tidy up with flake8: imports, comparisons, etc. 2021-06-28 12:08:15 +02:00
download.py Add a way to get the URL to download a pipeline to the CLI (#11175) 2022-09-02 11:58:21 +02:00
evaluate.py Add a spacy benchmark speed subcommand (#11902) 2023-01-12 11:55:21 +01:00
find_threshold.py find-threshold: CLI command for multi-label classifier threshold tuning (#11280) 2022-11-25 11:44:55 +01:00
info.py Partially work around pending deprecation of pkg_resources (#12368) 2023-03-09 10:32:01 +01:00
init_config.py Workaround for Typer optional default values with Python calls (#10788) 2022-06-17 12:15:36 +02:00
init_pipeline.py Add support for floret vectors (#8909) 2021-10-27 14:08:31 +02:00
package.py Adjust return type of registry.find (#12227) 2023-02-15 12:32:53 +01:00
pretrain.py Add parent argument (#11210) 2022-07-26 14:35:18 +02:00
profile.py 🏷 Add Mypy check to CI and ignore all existing Mypy errors (#9167) 2021-10-14 15:21:40 +02:00
train.py Add docs section for spacy.cli.train.train (#9545) 2021-10-29 10:36:34 +02:00
validate.py 🏷 Add Mypy check to CI and ignore all existing Mypy errors (#9167) 2021-10-14 15:21:40 +02:00