spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-07-11 08:42:28 +03:00

History

Daniël de Kok b734e5314d Avoid `TrainablePipe.finish_update` getting called twice during training (#12450 ) * Avoid `TrainablePipe.finish_update` getting called twice during training PR #12136 fixed an issue where the tok2vec pipe was updated before gradient were accumulated. However, it introduced a new bug that cause `finish_update` to be called twice when using the training loop. This causes a fairly large slowdown. The `Language.update` method accepts the `sgd` argument for passing an optimizer. This argument has three possible values: - `Optimizer`: use the given optimizer to finish pipe updates. - `None`: use a default optimizer to finish pipe updates. - `False`: do not finish pipe updates. However, the latter option was not documented and not valid with the existing type of `sgd`. I assumed that this was a remnant of earlier spaCy versions and removed handling of `False`. However, with that change, we are passing `None` to `Language.update`. As a result, we were calling `finish_update` in both `Language.update` and in the training loop after all subbatches are processed. This change restores proper handling/use of `False`. Moreover, the role of `False` is now documented and added to the type to avoid future accidents. * Fix typo * Document defaults for `Language.update`		2023-03-30 09:30:42 +02:00
..
api	Avoid `TrainablePipe.finish_update` getting called twice during training (#12450 )	2023-03-30 09:30:42 +02:00
models	Backslash fixes in docs (#12213 )	2023-02-01 10:15:38 +01:00
usage	Merge branch 'master' into sync/master-into-v4	2023-03-02 16:24:15 +01:00
styleguide.mdx	Website migration from Gatsby to Next (#12058 )	2023-01-11 17:30:07 +01:00