mirror of
https://github.com/explosion/spaCy.git
synced 2025-01-08 08:16:34 +03:00
da7ad97519
* Update `TextCatBOW` to use the fixed `SparseLinear` layer A while ago, we fixed the `SparseLinear` layer to use all available parameters: https://github.com/explosion/thinc/pull/754 This change updates `TextCatBOW` to `v3` which uses the new `SparseLinear_v2` layer. This results in a sizeable improvement on a text categorization task that was tested. While at it, this `spacy.TextCatBOW.v3` also adds the `length_exponent` option to make it possible to change the hidden size. Ideally, we'd just have an option called `length`. But the way that `TextCatBOW` uses hashes results in a non-uniform distribution of parameters when the length is not a power of two. * Replace TexCatBOW `length_exponent` parameter by `length` We now round up the length to the next power of two if it isn't a power of two. * Remove some tests for TextCatBOW.v2 * Fix missing import |
||
---|---|---|
.. | ||
101 | ||
_benchmarks-models.mdx | ||
embeddings-transformers.mdx | ||
facts-figures.mdx | ||
index.mdx | ||
large-language-models.mdx | ||
layers-architectures.mdx | ||
linguistic-features.mdx | ||
models.mdx | ||
processing-pipelines.mdx | ||
projects.mdx | ||
rule-based-matching.mdx | ||
saving-loading.mdx | ||
spacy-101.mdx | ||
training.mdx | ||
v2-1.mdx | ||
v2-2.mdx | ||
v2-3.mdx | ||
v2.mdx | ||
v3-1.mdx | ||
v3-2.mdx | ||
v3-3.mdx | ||
v3-4.mdx | ||
v3-5.mdx | ||
v3-6.mdx | ||
v3-7.mdx | ||
v3.mdx | ||
visualizers.mdx |