Adriane Boyd
36ff2a5441
Merge pull request #11200 from adrianeboyd/chore/reenable-model-tests
...
Revert "Temporarily skip tests that require models/compat"
2022-07-25 20:13:44 +02:00
Adriane Boyd
e5990db713
Revert "Temporarily skip tests that require models/compat"
...
This reverts commit d9320db7db
.
2022-07-25 18:12:18 +02:00
Paul O'Leary McCann
1c12812d1a
Replace link to old label ( #11188 )
2022-07-25 16:39:34 +09:00
Adriane Boyd
7a99fe3c65
Move sent-patterns to correct section of universe.json ( #11192 )
2022-07-25 09:14:50 +02:00
0xpeIpeI
93960dc4b5
[universe project] create English interpretation project ( #11184 )
...
* [add] my universe project setting
* [modify] A few adjustments
* [Modify] change package description
2022-07-24 19:01:04 +09:00
Dan Radenkovic
a5aa3a818f
fix docs ( #11123 )
2022-07-24 17:16:36 +09:00
Lucas Terriel
7ff52c02a1
Update meta for spacyfishing in spaCy Universe ( #11185 )
...
* add new logo for spacyfishing to update spacy universe
* change logo location
2022-07-24 17:10:29 +09:00
Maarten Grootendorst
1caa2d1d16
Added BERTopic to Spacy Universe ( #11159 )
...
* Added BERTopic to Spacy Universe
* Fix no render of visualization
2022-07-19 19:37:18 +09:00
Madeesh Kannan
ba18d2913d
Morphology
/Morphologizer
optimizations and refactoring (#11024 )
...
* `Morphology`: Refactor to use C types, reduce allocations, remove unused code
* `Morphologzier`: Avoid unnecessary sorting of morpho features
* `Morphologizer`: Remove execessive reallocations of labels, improve hash lookups of labels, coerce `numpy` numeric types to native ints
Update docs
* Remove unused method
* Replace `unique_ptr` usage with `shared_ptr`
* Add type annotations to internal Python methods, rename `hash` variable, fix typos
* Add comment to clarify implementation detail
* Fix return type
* `Morphology`: Stop early when splitting fields and values
2022-07-15 11:14:08 +02:00
Adriane Boyd
2235e3520c
Update binder version in docs ( #11124 )
2022-07-12 15:20:33 +02:00
Nicolai Bjerre Pedersen
2fa983aa2e
Fix span typings ( #11119 )
...
Add id, id_ to span.pyi.
2022-07-12 13:47:35 +02:00
Adriane Boyd
11f859c132
Docs for v3.4 ( #11057 )
...
* Add draft of v3.4 usage
* Add Croatian models
* Add Matcher min/max
* Update release notes
* Minor edits
* Add updates, tables
* Update pydantic/mypy versions
* Update version in README
* Fix sidebar
2022-07-11 15:36:31 +02:00
Adriane Boyd
d583626a82
Update build setup for aarch64 ( #11112 )
...
* Extend build constraints for aarch64
* Skip mypy for aarch64
2022-07-11 13:29:35 +02:00
Adriane Boyd
5cb6f1ae51
CI: Install with two parallel build jobs ( #11111 )
2022-07-11 12:20:00 +02:00
Adriane Boyd
3701039c1f
Tweak build jobs setting, update install docs ( #11077 )
...
* Restrict SPACY_NUM_BUILD_JOBS to only override if set
* Update install docs
2022-07-08 19:21:17 +02:00
Peter Baumgartner
36cb2029a9
displaCy Spans Vertical Alignment Fix 2 ( #11092 )
...
* add in span render slot fix
* fix spacing off by one
* rm demo
* adjust comments
* fix whitespace and overlap issue
2022-07-08 19:20:13 +02:00
Richard Hudson
dc38a0f079
Change demo URL ( #11102 )
2022-07-08 19:19:48 +02:00
Adriane Boyd
66d6461c8f
Use thinc v8.1 ( #11101 )
2022-07-08 17:52:41 +02:00
Adriane Boyd
397197ec0e
Extend to mypy<0.970 ( #11100 )
2022-07-08 14:58:01 +02:00
Madeesh Kannan
f38aff4ec9
Add examples for new explosion bot commands ( #11082 )
...
* Add examples for new explosion bot commands
* Update extra/DEVELOPER_DOCS/ExplosionBot.md
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2022-07-08 13:36:12 +02:00
Adriane Boyd
be9e17c0e4
Add docs for compiling with build constraints ( #11081 )
2022-07-08 11:45:56 +02:00
github-actions[bot]
e7fd06bdbe
Auto-format code with black ( #11099 )
...
Co-authored-by: explosion-bot <explosion-bot@users.noreply.github.com>
2022-07-08 18:43:25 +09:00
Nipun Sadvilkar
86ee26e3c2
Use pull_request_target
event for spaCy universe GA trigger ( #11097 )
2022-07-07 19:43:50 +05:30
Nipun Sadvilkar
bb3e11b9a1
Github Action for spaCy universe project alert ( #11090 )
2022-07-07 17:50:30 +05:30
Kenneth Enevoldsen
7b220afc29
Added asent to spacy universe ( #11078 )
...
* Added asent to spacy universe
* Update addition of asent following correction
2022-07-07 13:25:25 +09:00
Schero1994
c7c3fb1d0c
Merge pull request #11074 from Schero1994/feature/remove
...
Batch #2 | spaCy universe cleanup
2022-07-06 10:39:04 +02:00
Daniël de Kok
a06cbae70d
precompute_hiddens/Parser: do not look up CPU ops (3.4) ( #11069 )
...
* precompute_hiddens/Parser: do not look up CPU ops
`get_ops("cpu")` is quite expensive. To avoid this, we want to cache the
result as in #11068 . However, for 3.x we do not want to change the ABI.
So we avoid the expensive lookup by using NumpyOps. This should have a
minimal impact, since `get_ops("cpu")` was only used when the model ops
were `CupyOps`. If the ops are `AppleOps`, we are still passing through
the correct BLAS implementation.
* _NUMPY_OPS -> NUMPY_OPS
2022-07-05 10:53:42 +02:00
Adriane Boyd
78a84f0d78
Support env var for num build jobs ( #11073 )
2022-07-04 20:50:16 +02:00
Madeesh Kannan
d36d66b7ca
Increase test deadline to 30 minutes to prevent spurious test failures ( #11070 )
...
* Increase test deadline to 30 minutes to prevent spurious test failures
* Reduce deadline to 2 minutes
2022-07-04 18:37:09 +02:00
kadarakos
5240baccfe
dont use get_array_module ( #11056 )
2022-07-04 17:15:33 +02:00
Raphael Mitsch
e9eb59699f
NEL confidence threshold ( #11016 )
...
* Add base for NEL abstention threshold mechanism.
* Add abstention threshold to entity linker. Add test.
* Fix entity linking tests.
* Changed abstention default threshold from 0 to None.
* Fix default values for abstention thresholds.
* Fix mypy errors.
* Replace assertion with raise of proper error code.
* Simplify threshold check. Remove thresholding from EntityLinker_v1.
* Rename test.
* Update spacy/pipeline/entity_linker.py
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
* Update spacy/pipeline/entity_linker.py
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
* Make E1043 configurable.
* Update docs.
* Rephrase description in docs. Adjusting error code message.
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2022-07-04 17:05:21 +02:00
schaeran
b3165db41b
remove universe object: spacy-langdetect
2022-07-04 16:07:18 +02:00
schaeran
4e8a5994df
remove universe object: NLPre
2022-07-04 16:06:58 +02:00
schaeran
0e4a835468
remove universe object: num_fh
2022-07-04 16:06:38 +02:00
schaeran
5000a08a20
remove universe object: adam_qas
2022-07-04 16:06:20 +02:00
schaeran
60a35a2bb2
remove universe object: spacy_kenlm
2022-07-04 16:06:02 +02:00
schaeran
224f30c563
remove universe object: spacy-raspberry
2022-07-04 16:05:34 +02:00
schaeran
a9062ebf17
remove universe object: spacy-lookup
2022-07-04 16:05:11 +02:00
schaeran
9b823fc9e9
remove universe object: NeuroNER
2022-07-04 16:04:50 +02:00
schaeran
b94bcaa62f
remove universe object: spacy-vis
2022-07-04 16:04:29 +02:00
schaeran
880e7db44e
remove universe object: spacy_grammar
2022-07-04 16:04:06 +02:00
schaeran
6c036d1e25
remove universe object: spacy_hunspell
2022-07-04 16:03:30 +02:00
Madeesh Kannan
59c763eec1
StringStore
-related optimizations (#10938 )
...
* `strings`: More roubust type checking of keys/IDs, coerce `int`-like types to `hash_t`
* Preserve existing public API behaviour
* Fix return type
* Replace `bool` with `bint`, rename to `_try_coerce_to_hash`, replace `id` with `hash`
* Avoid unnecessary re-encoding and re-calculation of strings and hashs respectively
* Rename variables named `hash`
Add comment on early return
2022-07-04 15:04:03 +02:00
Paul O'Leary McCann
7c1bf2fa1f
Merge pull request #11062 from explosion/autoblack
...
Auto-format code with black
2022-07-03 14:35:53 +09:00
explosion-bot
7e55a51314
Auto-format code with black
2022-07-01 08:04:32 +00:00
Paul O'Leary McCann
e8fdbfc65e
Minor fix in Lemmatizer docs
2022-07-01 14:28:03 +09:00
Madeesh Kannan
eaf66e7431
Add NVTX ranges to TrainablePipe
components ( #10965 )
...
* `TrainablePipe`: Add NVTX range decorator
* Annotate `TrainablePipe` subclasses with NVTX ranges
* Export function signature to allow introspection of args in tests
* Revert "Annotate `TrainablePipe` subclasses with NVTX ranges"
This reverts commit d8684f7372
.
* Revert "Export function signature to allow introspection of args in tests"
This reverts commit f4405ca3ad
.
* Revert "`TrainablePipe`: Add NVTX range decorator"
This reverts commit 26536eb6b8
.
* Add `spacy.pipes_with_nvtx_range` pipeline callback
* Show warnings for all missing user-defined pipe functions that need to be annotated
Fix imports, typos
* Rename `DEFAULT_ANNOTATABLE_PIPE_METHODS` to `DEFAULT_NVTX_ANNOTATABLE_PIPE_METHODS`
Reorder import
* Walk model nodes directly whilst applying NVTX ranges
Ignore pipe method wrapper when applying range
2022-06-30 11:28:12 +02:00
Adriane Boyd
3fe9f47de4
Revert "disable failing test because Stanford servers are down ( #11015 )" ( #11054 )
...
This reverts commit f8116078ce
.
2022-06-30 11:24:54 +02:00
Adriane Boyd
3bc1fe0a78
Update cupy extras ( #11055 )
...
* Add cuda116 and cuda117 extras
* Revert "remove `cuda116` extra from install widget (#11012 )"
This reverts commit e7b498fb1f
.
* Add cuda117 to quickstart
2022-06-30 11:24:37 +02:00
Shen Qin
be00db6645
Addition of min_max quantifier in matcher {n,m} ( #10981 )
...
* Min_max_operators
1. Modified API and Usage for spaCy website to include min_max operator
2. Modified matcher.pyx to include min_max function {n,m} and its variants
3. Modified schemas.py to include min_max validation error
4. Added test cases to test_matcher_api.py, test_matcher_logic.py and test_pattern_validation.py
* attempt to fix mypy/pydantic compat issue
* formatting
* Update spacy/tests/matcher/test_pattern_validation.py
Co-authored-by: Source-Shen <82353723+Source-Shen@users.noreply.github.com>
Co-authored-by: svlandeg <svlandeg@github.com>
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2022-06-30 11:01:58 +02:00