mirror of
https://github.com/explosion/spaCy.git
synced 2025-12-07 10:14:22 +03:00
This rewrites the loss to not use the Thinc crossentropy code at all. The main difference here is that the negative predictions are being masked out (= marginalized over), but negative gradient is still being reflected. I'm still not sure this is exactly right but models seem to train reliably now. |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| coref_util.py | ||
| coref.py | ||
| entity_linker.py | ||
| multi_task.py | ||
| parser.py | ||
| tagger.py | ||
| textcat.py | ||
| tok2vec.py | ||