Commit Graph

15154 Commits

Author SHA1 Message Date
Matthew Honnibal
394862b0f4 Start rigging beam back up 2022-01-20 16:17:54 +01:00
Matthew Honnibal
07603a26ae Get non-reference forward func working 2022-01-20 16:13:24 +01:00
Matthew Honnibal
160dbc58ea Improve indexing on reference implementation 2022-01-20 16:12:52 +01:00
Matthew Honnibal
0cdbcd8b9a Add missing file 2022-01-20 16:11:46 +01:00
Matthew Honnibal
c45e5ac5b7 Fix missing prints 2022-01-20 16:11:30 +01:00
Matthew Honnibal
337b3f22b8 Get tests passing with reference implementation 2022-01-20 16:11:01 +01:00
svlandeg
f8672c4dc2 black formatting 2022-01-20 16:09:54 +01:00
Matthew Honnibal
5903138ab6 Add reference version 2022-01-20 16:08:07 +01:00
Matthew Honnibal
604ceb1da1 Fix redundant test. 2 failures 2022-01-20 15:57:14 +01:00
Matthew Honnibal
dea702b4b7 Hackishly fix resizing. 3 failures 2022-01-20 15:46:54 +01:00
Matthew Honnibal
dd03ad2e27 Fix empty doc in update 2022-01-20 15:46:01 +01:00
svlandeg
87cf72d1c8 pass nO through 2021-10-29 17:38:11 +02:00
svlandeg
1cc0d05812 fixes 2021-10-29 17:10:07 +02:00
svlandeg
dbaf68a439 formatting 2021-10-29 14:19:30 +02:00
svlandeg
87fb268f76 Merge remote-tracking branch 'upstream/master' into refactor/parser-gpu 2021-10-29 14:16:43 +02:00
svlandeg
753f9ee685 cleanup 2021-10-29 13:25:15 +02:00
Paul O'Leary McCann
006df1ae1f
Clarify error when words are of wrong type (#9541)
* Clarify error when words are of wrong type

See #9437

* Update docs

* Use try/except

* Apply suggestions from code review

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2021-10-29 12:08:40 +02:00
Paul O'Leary McCann
2fd8d616e7
Add docs section for spacy.cli.train.train (#9545)
* Add section for spacy.cli.train.train

* Add link from training page to train function

* Ensure path in train helper

* Update docs

Co-authored-by: Ines Montani <ines@ines.io>
2021-10-29 10:36:34 +02:00
Adriane Boyd
5477453ea3
Docs for thinc-apple-ops (#9549)
* Docs for thinc-apple-ops

* Ignore thinc-apple-ops in reqs tests

* Fix install quickstart

* Add cupy cuda 113, 114 extras

* Remove draft section

Co-authored-by: Ines Montani <ines@ines.io>
2021-10-29 10:35:31 +02:00
Philip Vollet
76173b0866
fixed typo and URL (#9560) 2021-10-29 13:57:44 +09:00
Adriane Boyd
72dc63b3fb
Update for python 3.10 (#9519)
* Update for python 3.10

* Update mac image

* Update build constraints for python 3.10

* Add extras for cupy cuda 11.3-11.5

* Remove cupy-cuda115 extra

* Require thinc>=8.0.12

* Switch CI to windows-2019

* Skip mypy for python 3.10
2021-10-28 15:32:06 +02:00
Matthew Honnibal
79d5957c47 Xfail. 6 failures 2021-10-27 23:26:07 +02:00
Matthew Honnibal
6b5302cdf3 More xfail. 7 failures 2021-10-27 23:24:33 +02:00
Matthew Honnibal
7309e49286 Xfail beam stuff. 9 failures 2021-10-27 23:21:55 +02:00
Matthew Honnibal
880182afdb Work on parser. 15 tests failing 2021-10-27 23:02:29 +02:00
Matthew Honnibal
af9a30b192 Keep working through errors 2021-10-27 17:13:11 +02:00
Matthew Honnibal
b67dd0cf89 Keep working through errors 2021-10-27 17:10:33 +02:00
Adriane Boyd
386dcada1c
Address random results in slow readers tests (#9544)
* Set random seed for dataset shuffling
* Use more dev examples for non-zero scores
2021-10-26 16:53:10 +02:00
Matthew Honnibal
c538eaf1c8 Work through tests 2021-10-26 01:21:51 +02:00
Matthew Honnibal
d765a4f8ee Cleaner handling of unseen classes 2021-10-25 22:34:29 +02:00
Matthew Honnibal
07a3581ff8 Support unseen classes in parser 2021-10-25 22:26:52 +02:00
Matthew Honnibal
4b5d1b53f6 Support unseen_classes in parser model 2021-10-25 22:21:17 +02:00
Matthew Honnibal
03018904ef Work on parser model 2021-10-25 16:11:58 +02:00
Matthew Honnibal
9c4a04d0c5 Uncython 2021-10-25 12:51:32 +02:00
Matthew Honnibal
1921e86813 Uncython ner.pyx and dep_parser.pyx 2021-10-25 12:51:14 +02:00
Matthew Honnibal
45ca12f07a Wire up parser model 2021-10-25 12:50:33 +02:00
Matthew Honnibal
71abe2e42d Wire up tb_framework to new parser model 2021-10-25 12:50:20 +02:00
Matthew Honnibal
0279aa036a Delete _precomputable_affine module 2021-10-25 12:28:57 +02:00
Matthew Honnibal
9b459f9ef2 Delete spacy.ml.parser_model 2021-10-25 12:28:31 +02:00
Matthew Honnibal
7b9c282469 Convert parser from cdef class 2021-10-25 12:28:13 +02:00
Matthew Honnibal
34aab9899f Prepare to remove parser_model.pyx 2021-10-25 12:22:46 +02:00
Matthew Honnibal
de8c88babb New progress on parser model refactor 2021-10-25 03:13:31 +02:00
Elia Robyn Lake (Robyn Speer)
fa70837f28
clarify how to connect pretraining to training (#9450)
* clarify how to connect pretraining to training

Signed-off-by: Elia Robyn Speer <elia@explosion.ai>

* Update website/docs/usage/embeddings-transformers.md

* Update website/docs/usage/embeddings-transformers.md

* Update website/docs/usage/embeddings-transformers.md

* Update website/docs/usage/embeddings-transformers.md

Co-authored-by: Elia Robyn Speer <elia@explosion.ai>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2021-10-22 13:15:47 +02:00
github-actions[bot]
b0b115ff39
Auto-format code with black (#9530)
Co-authored-by: explosion-bot <explosion-bot@users.noreply.github.com>
2021-10-22 13:03:10 +02:00
Sofie Van Landeghem
c9f28b6d08
Merge branch 'spacy.io' into master 2021-10-21 20:46:33 +02:00
Sofie Van Landeghem
c7ed631f3c
bump version to 3.1.4 (#9524) 2021-10-21 20:34:57 +02:00
Daniël de Kok
f31ac6fd4f
Print a warning when multiprocessing is used on a GPU (#9475)
* Raise an error when multiprocessing is used on a GPU

As reported in #5507, a confusing exception is thrown when
multiprocessing is used with a GPU model and the `fork` multiprocessing
start method:

cupy.cuda.runtime.CUDARuntimeError: cudaErrorInitializationError: initialization error

This change checks whether one of the models uses the GPU when
multiprocessing is used. If so, raise a friendly error message.

Even though multiprocessing can work on a GPU with the `spawn` method,
it quickly runs the GPU out-of-memory on real-world data. Also,
multiprocessing on a single GPU typically does not provide large
performance gains.

* Move GPU multiprocessing check to Language.pipe

* Warn rather than error when using multiprocessing with GPU models

* Improve GPU multiprocessing warning message.

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Reduce API assumptions

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Update spacy/language.py

* Update spacy/language.py

* Test that warning is thrown with GPU + multiprocessing

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2021-10-21 16:14:23 +02:00
Sofie Van Landeghem
5a38f79f18
Custom component types in spacy.ty (#9469)
* add custom protocols in spacy.ty

* add a test for the new types in spacy.ty

* import Example when type checking

* some type fixes

* put Protocol in compat

* revert update check back to hasattr

* runtime_checkable in compat as well
2021-10-21 15:31:06 +02:00
Daniël de Kok
d0631e3005
Replace use_ops("numpy") by use_ops("cpu") in the parser (#9501)
* Replace use_ops("numpy") by use_ops("cpu") in the parser

This ensures that the best available CPU implementation is chosen
(e.g. Thinc Apple Ops on macOS).

* Run spaCy tests with apple-thinc-ops on macOS
2021-10-21 11:22:45 +02:00
Paul O'Leary McCann
28ecf399da
Remove some old version refs in the docs (#9448)
* Remove some old version refs in the docs

* Remove warning

* Update spacy/matcher/matcher.pyx

* Remove all references to the punctuation warning

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2021-10-21 11:17:59 +02:00