spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-03-07 05:11:27 +03:00

History

Daniël de Kok da7ad97519 Update `TextCatBOW` to use the fixed `SparseLinear` layer (#13149 ) * Update `TextCatBOW` to use the fixed `SparseLinear` layer A while ago, we fixed the `SparseLinear` layer to use all available parameters: https://github.com/explosion/thinc/pull/754 This change updates `TextCatBOW` to `v3` which uses the new `SparseLinear_v2` layer. This results in a sizeable improvement on a text categorization task that was tested. While at it, this `spacy.TextCatBOW.v3` also adds the `length_exponent` option to make it possible to change the hidden size. Ideally, we'd just have an option called `length`. But the way that `TextCatBOW` uses hashes results in a non-uniform distribution of parameters when the length is not a power of two. * Replace TexCatBOW `length_exponent` parameter by `length` We now round up the length to the next power of two if it isn't a power of two. * Remove some tests for TextCatBOW.v2 * Fix missing import		2023-11-29 09:11:54 +01:00
..
101	Inline displaCy visualizations in docs (#13050 ) [ci skip]	2023-10-06 14:22:43 +02:00
_benchmarks-models.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
embeddings-transformers.mdx	Support registered vectors (#12492 )	2023-08-01 15:46:08 +02:00
facts-figures.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
index.mdx	Drop support for python 3.6 (#13009 )	2023-09-25 14:48:38 +02:00
large-language-models.mdx	Add Mistral mentions. (#13037 )	2023-10-05 14:44:38 +02:00
layers-architectures.mdx	Update `TextCatBOW` to use the fixed `SparseLinear` layer (#13149 )	2023-11-29 09:11:54 +01:00
linguistic-features.mdx	Clarify EL example in docs (#13071 )	2023-10-31 21:58:29 +01:00
models.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
processing-pipelines.mdx	Update `TextCatBOW` to use the fixed `SparseLinear` layer (#13149 )	2023-11-29 09:11:54 +01:00
projects.mdx	Remove pathy dependency, update docs for cloudpathlib in Weasel (#13035 )	2023-10-05 08:50:22 +02:00
rule-based-matching.mdx	Inline displaCy visualizations in docs (#13050 ) [ci skip]	2023-10-06 14:22:43 +02:00
saving-loading.mdx	Add preferred use of build for package CLI (#13109 )	2023-11-08 17:35:24 +01:00
spacy-101.mdx	WEB-27 Add `alt` tags to images (#12166 )	2023-01-24 13:56:14 +01:00
training.mdx	fix training.batch_size example (#12963 )	2023-09-06 16:38:13 +02:00
v2-1.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v2-2.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v2-3.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v2.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v3-1.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v3-2.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v3-3.mdx	Inline displaCy visualizations in docs (#13050 ) [ci skip]	2023-10-06 14:22:43 +02:00
v3-4.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
v3-5.mdx	Revert "Fix FUZZY operator definition (#12318 )" (#12336 )	2023-02-27 09:48:36 +01:00
v3-6.mdx	Docs for v3.6.0 (#12792 )	2023-07-06 12:58:25 +02:00
v3-7.mdx	Docs for v3.7.0 (#13029 )	2023-10-01 21:40:07 +02:00
v3.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00
visualizers.mdx	Inline displaCy visualizations in docs (#13050 ) [ci skip]	2023-10-06 14:22:43 +02:00