Paul O'Leary McCann
0f3c45646b
Update error number
...
This was changed by merge
2022-07-11 20:17:03 +09:00
Paul O'Leary McCann
7792229fa9
Merge branch 'master' into feature/coref
2022-07-11 20:16:23 +09:00
Adriane Boyd
5cb6f1ae51
CI: Install with two parallel build jobs ( #11111 )
2022-07-11 12:20:00 +02:00
Paul O'Leary McCann
1b3db149df
Merge branch 'fix/coref-alignment' into feature/coref
2022-07-11 19:12:03 +09:00
Paul O'Leary McCann
2c2791daa5
Merge pull request #11087 from polm/coref/doc-update
...
Update Coref Docs
2022-07-11 19:03:14 +09:00
Adriane Boyd
3701039c1f
Tweak build jobs setting, update install docs ( #11077 )
...
* Restrict SPACY_NUM_BUILD_JOBS to only override if set
* Update install docs
2022-07-08 19:21:17 +02:00
Peter Baumgartner
36cb2029a9
displaCy Spans Vertical Alignment Fix 2 ( #11092 )
...
* add in span render slot fix
* fix spacing off by one
* rm demo
* adjust comments
* fix whitespace and overlap issue
2022-07-08 19:20:13 +02:00
Richard Hudson
dc38a0f079
Change demo URL ( #11102 )
2022-07-08 19:19:48 +02:00
Adriane Boyd
66d6461c8f
Use thinc v8.1 ( #11101 )
2022-07-08 17:52:41 +02:00
Adriane Boyd
397197ec0e
Extend to mypy<0.970 ( #11100 )
2022-07-08 14:58:01 +02:00
Madeesh Kannan
f38aff4ec9
Add examples for new explosion bot commands ( #11082 )
...
* Add examples for new explosion bot commands
* Update extra/DEVELOPER_DOCS/ExplosionBot.md
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2022-07-08 13:36:12 +02:00
Adriane Boyd
be9e17c0e4
Add docs for compiling with build constraints ( #11081 )
2022-07-08 11:45:56 +02:00
github-actions[bot]
e7fd06bdbe
Auto-format code with black ( #11099 )
...
Co-authored-by: explosion-bot <explosion-bot@users.noreply.github.com>
2022-07-08 18:43:25 +09:00
Paul O'Leary McCann
2eee0d248e
Fix types
...
mypy now exits without an error, except for two apparently unrelated
ones about setup.py.
2022-07-08 18:29:14 +09:00
Nipun Sadvilkar
86ee26e3c2
Use pull_request_target
event for spaCy universe GA trigger ( #11097 )
2022-07-07 19:43:50 +05:30
Nipun Sadvilkar
bb3e11b9a1
Github Action for spaCy universe project alert ( #11090 )
2022-07-07 17:50:30 +05:30
Kenneth Enevoldsen
7b220afc29
Added asent to spacy universe ( #11078 )
...
* Added asent to spacy universe
* Update addition of asent following correction
2022-07-07 13:25:25 +09:00
Schero1994
c7c3fb1d0c
Merge pull request #11074 from Schero1994/feature/remove
...
Batch #2 | spaCy universe cleanup
2022-07-06 10:39:04 +02:00
Paul O'Leary McCann
ce49136458
Update NotImplementedError for coref component
2022-07-06 17:28:15 +09:00
Paul O'Leary McCann
5e405738d2
Update span predictor docstrings
2022-07-06 17:28:05 +09:00
Paul O'Leary McCann
c4de3e51a2
Remove old TODOs
2022-07-06 17:23:41 +09:00
Paul O'Leary McCann
da9c379355
Update docs
...
Parameter names in architecture docs were not updated after parameters
were renamed.
2022-07-06 17:13:31 +09:00
Paul O'Leary McCann
6f5cf838ec
Remove _spans_to_offsets
...
Basically the same as get_clusters_from_doc
2022-07-06 14:05:05 +09:00
Paul O'Leary McCann
8f598d7b01
Feedback from code review
2022-07-06 14:03:09 +09:00
Paul O'Leary McCann
63e27b5e44
Update spacy/ml/models/coref_util.py
...
Co-authored-by: kadarakos <kadar.akos@gmail.com>
2022-07-06 13:46:02 +09:00
Daniël de Kok
a06cbae70d
precompute_hiddens/Parser: do not look up CPU ops (3.4) ( #11069 )
...
* precompute_hiddens/Parser: do not look up CPU ops
`get_ops("cpu")` is quite expensive. To avoid this, we want to cache the
result as in #11068 . However, for 3.x we do not want to change the ABI.
So we avoid the expensive lookup by using NumpyOps. This should have a
minimal impact, since `get_ops("cpu")` was only used when the model ops
were `CupyOps`. If the ops are `AppleOps`, we are still passing through
the correct BLAS implementation.
* _NUMPY_OPS -> NUMPY_OPS
2022-07-05 10:53:42 +02:00
Adriane Boyd
78a84f0d78
Support env var for num build jobs ( #11073 )
2022-07-04 20:50:16 +02:00
Madeesh Kannan
d36d66b7ca
Increase test deadline to 30 minutes to prevent spurious test failures ( #11070 )
...
* Increase test deadline to 30 minutes to prevent spurious test failures
* Reduce deadline to 2 minutes
2022-07-04 18:37:09 +02:00
kadarakos
5240baccfe
dont use get_array_module ( #11056 )
2022-07-04 17:15:33 +02:00
Raphael Mitsch
e9eb59699f
NEL confidence threshold ( #11016 )
...
* Add base for NEL abstention threshold mechanism.
* Add abstention threshold to entity linker. Add test.
* Fix entity linking tests.
* Changed abstention default threshold from 0 to None.
* Fix default values for abstention thresholds.
* Fix mypy errors.
* Replace assertion with raise of proper error code.
* Simplify threshold check. Remove thresholding from EntityLinker_v1.
* Rename test.
* Update spacy/pipeline/entity_linker.py
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
* Update spacy/pipeline/entity_linker.py
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
* Make E1043 configurable.
* Update docs.
* Rephrase description in docs. Adjusting error code message.
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2022-07-04 17:05:21 +02:00
schaeran
b3165db41b
remove universe object: spacy-langdetect
2022-07-04 16:07:18 +02:00
schaeran
4e8a5994df
remove universe object: NLPre
2022-07-04 16:06:58 +02:00
schaeran
0e4a835468
remove universe object: num_fh
2022-07-04 16:06:38 +02:00
schaeran
5000a08a20
remove universe object: adam_qas
2022-07-04 16:06:20 +02:00
schaeran
60a35a2bb2
remove universe object: spacy_kenlm
2022-07-04 16:06:02 +02:00
schaeran
224f30c563
remove universe object: spacy-raspberry
2022-07-04 16:05:34 +02:00
schaeran
a9062ebf17
remove universe object: spacy-lookup
2022-07-04 16:05:11 +02:00
schaeran
9b823fc9e9
remove universe object: NeuroNER
2022-07-04 16:04:50 +02:00
schaeran
b94bcaa62f
remove universe object: spacy-vis
2022-07-04 16:04:29 +02:00
schaeran
880e7db44e
remove universe object: spacy_grammar
2022-07-04 16:04:06 +02:00
schaeran
6c036d1e25
remove universe object: spacy_hunspell
2022-07-04 16:03:30 +02:00
Madeesh Kannan
59c763eec1
StringStore
-related optimizations (#10938 )
...
* `strings`: More roubust type checking of keys/IDs, coerce `int`-like types to `hash_t`
* Preserve existing public API behaviour
* Fix return type
* Replace `bool` with `bint`, rename to `_try_coerce_to_hash`, replace `id` with `hash`
* Avoid unnecessary re-encoding and re-calculation of strings and hashs respectively
* Rename variables named `hash`
Add comment on early return
2022-07-04 15:04:03 +02:00
Paul O'Leary McCann
178feae00a
Add tests to give up with whitespace differences
...
Docs in Examples are allowed to have arbitrarily different whitespace.
Handling that properly would be nice but isn't required, but for now
check for it and blow up.
2022-07-04 19:37:42 +09:00
Paul O'Leary McCann
c7f333d593
Rename spans2ints > _spans_to_offsets
2022-07-04 19:28:35 +09:00
Paul O'Leary McCann
b09bbc7f5e
Fix alignment issues
...
I believe this resolves issues with tokenization mismatches.
2022-07-03 20:11:03 +09:00
Paul O'Leary McCann
cf33b48fe0
Update tests
2022-07-03 20:10:53 +09:00
Paul O'Leary McCann
fd574a89c4
Update overfitting test
2022-07-03 19:34:15 +09:00
Paul O'Leary McCann
a46bc03abb
Add failing test with tokenization mismatch
...
This test only fails due to the explicity assert False at the moment,
but the debug output shows that the learned spans are all off by one due
to misalignment. So the code still needs fixing.
2022-07-03 16:01:27 +09:00
Paul O'Leary McCann
619b1102e6
Use config to specify tok2vec_size
2022-07-03 15:32:35 +09:00
Paul O'Leary McCann
1a4dbb702d
Add basic span predictor tests
2022-07-03 15:13:15 +09:00