mirror of
https://github.com/explosion/spaCy.git
synced 2025-11-02 17:07:49 +03:00
* Update `TextCatBOW` to use the fixed `SparseLinear` layer A while ago, we fixed the `SparseLinear` layer to use all available parameters: https://github.com/explosion/thinc/pull/754 This change updates `TextCatBOW` to `v3` which uses the new `SparseLinear_v2` layer. This results in a sizeable improvement on a text categorization task that was tested. While at it, this `spacy.TextCatBOW.v3` also adds the `length_exponent` option to make it possible to change the hidden size. Ideally, we'd just have an option called `length`. But the way that `TextCatBOW` uses hashes results in a non-uniform distribution of parameters when the length is not a power of two. * Replace TexCatBOW `length_exponent` parameter by `length` We now round up the length to the next power of two if it isn't a power of two. * Remove some tests for TextCatBOW.v2 * Fix missing import |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_analysis.py | ||
| test_annotates_on_update.py | ||
| test_attributeruler.py | ||
| test_edit_tree_lemmatizer.py | ||
| test_entity_linker.py | ||
| test_entity_ruler.py | ||
| test_functions.py | ||
| test_initialize.py | ||
| test_lemmatizer.py | ||
| test_models.py | ||
| test_morphologizer.py | ||
| test_pipe_factories.py | ||
| test_pipe_methods.py | ||
| test_sentencizer.py | ||
| test_senter.py | ||
| test_span_finder.py | ||
| test_span_ruler.py | ||
| test_spancat.py | ||
| test_tagger.py | ||
| test_textcat.py | ||
| test_tok2vec.py | ||