spaCy/spacy/ml/models
Paul O'Leary McCann 00d481dd12 Stack the mention scorer
In the reference implementations, there's usually a function to build a
ffnn of arbitrary depth, consisting of a stack of Linear >> Relu >>
Dropout. In practice the depth is always 1 in coref-hoi, but in earlier
iterations of the model, which are more similar to our model here (since
we aren't using attention or even necessarily BERT), using a small depth
like 2 was common. This hard-codes a stack of 2.

In brief tests this allows similar performance to the unstacked version
with much smaller embedding sizes.

The depth of the stack could be made into a hyperparameter.
2021-08-09 18:04:42 +09:00
..
__init__.py Merge branch 'master' into feature/coref 2021-07-10 18:10:40 +09:00
coref_util.py Minor speedup 2021-07-21 19:50:10 +09:00
coref.py Stack the mention scorer 2021-08-09 18:04:42 +09:00
entity_linker.py Update flake8 version in reqs and CI 2021-06-28 11:29:36 +02:00
multi_task.py applying suggestion to avoid mypy errors (#8265) 2021-06-02 19:25:30 +10:00
parser.py TransitionBasedParser.v1 to legacy (#8586) 2021-07-06 15:26:45 +02:00
spancat.py Fix span embeds 2021-07-10 21:38:53 +09:00
tagger.py consistently use registry as callable 2021-03-02 17:56:28 +01:00
textcat.py Resizable textcat (#7862) 2021-06-16 11:45:00 +02:00
tok2vec.py Replace negative rows with 0 in StaticVectors (#7674) 2021-04-22 18:04:15 +10:00