spaCy/spacy/pipeline
Daniël de Kok da7ad97519
Update TextCatBOW to use the fixed SparseLinear layer (#13149)
* Update `TextCatBOW` to use the fixed `SparseLinear` layer

A while ago, we fixed the `SparseLinear` layer to use all available
parameters: https://github.com/explosion/thinc/pull/754

This change updates `TextCatBOW` to `v3` which uses the new
`SparseLinear_v2` layer. This results in a sizeable improvement on a
text categorization task that was tested.

While at it, this `spacy.TextCatBOW.v3` also adds the `length_exponent`
option to make it possible to change the hidden size. Ideally, we'd just
have an option called `length`. But the way that `TextCatBOW` uses
hashes results in a non-uniform distribution of parameters when the
length is not a power of two.

* Replace TexCatBOW `length_exponent` parameter by `length`

We now round up the length to the next power of two if it isn't
a power of two.

* Remove some tests for TextCatBOW.v2

* Fix missing import
2023-11-29 09:11:54 +01:00
..
_edit_tree_internals Add profile=False to currently unprofiled cython 2023-09-28 17:09:41 +02:00
_parser_internals Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
legacy Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
__init__.py Update __all__ fields (#13063) 2023-10-16 10:17:47 +02:00
attributeruler.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
dep_parser.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
edit_tree_lemmatizer.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
entity_linker.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
entityruler.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
functions.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
lemmatizer.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
morphologizer.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
multitask.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
ner.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
pipe.pxd TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
pipe.pyi Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
pipe.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
sentencizer.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
senter.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
span_finder.py SpanFinder: set default max_length to 25 (#12791) 2023-07-06 09:55:34 +02:00
span_ruler.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
spancat.py SpanFinder into spaCy from experimental (#12507) 2023-06-07 15:52:28 +02:00
tagger.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
textcat_multilabel.py Update TextCatBOW to use the fixed SparseLinear layer (#13149) 2023-11-29 09:11:54 +01:00
textcat.py Update TextCatBOW to use the fixed SparseLinear layer (#13149) 2023-11-29 09:11:54 +01:00
tok2vec.py Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
trainable_pipe.pxd Configure isort to use the Black profile, recursively isort the spacy module (#12721) 2023-06-14 17:48:41 +02:00
trainable_pipe.pyx Remove profile=True from currently profiled cython 2023-09-28 17:09:41 +02:00
transition_parser.pxd ci: add cython linter (#12694) 2023-07-19 12:03:31 +02:00
transition_parser.pyx Add profile=False to currently unprofiled cython 2023-09-28 17:09:41 +02:00