spaCy/spacy/ml
Paul O'Leary McCann 00d481dd12 Stack the mention scorer
In the reference implementations, there's usually a function to build a
ffnn of arbitrary depth, consisting of a stack of Linear >> Relu >>
Dropout. In practice the depth is always 1 in coref-hoi, but in earlier
iterations of the model, which are more similar to our model here (since
we aren't using attention or even necessarily BERT), using a small depth
like 2 was common. This hard-codes a stack of 2.

In brief tests this allows similar performance to the unstacked version
with much smaller embedding sizes.

The depth of the stack could be made into a hyperparameter.
2021-08-09 18:04:42 +09:00
..
models Stack the mention scorer 2021-08-09 18:04:42 +09:00
__init__.py Tidy up and auto-format 2020-06-20 14:15:04 +02:00
_character_embed.py Register CharEmbed layer (#7805) 2021-04-19 18:39:34 +10:00
_precomputable_affine.py TransitionBasedParser.v1 to legacy (#8586) 2021-07-06 15:26:45 +02:00
extract_ngrams.py register extract_ngrams layer (#8358) 2021-06-14 10:30:30 +02:00
extract_spans.py Add SpanCategorizer component (#6747) 2021-06-24 12:35:27 +02:00
featureextractor.py Fix import 2020-10-02 01:12:34 +02:00
parser_model.pxd The Parser is now a Pipe (2) (#5844) 2020-07-30 23:30:54 +02:00
parser_model.pyx The Parser is now a Pipe (2) (#5844) 2020-07-30 23:30:54 +02:00
staticvectors.py Set up GPU CI testing (#7293) 2021-04-22 14:58:29 +02:00
tb_framework.py TransitionBasedParser.v1 to legacy (#8586) 2021-07-06 15:26:45 +02:00