Sofie Van Landeghem
8e6a3d58d8
fix typo ( #12543 )
2023-04-19 10:59:33 +02:00
TAN Long
923d24e885
perf(REL_OP): Replace some token.children with token.rights or token.lefts ( #12528 )
...
Co-authored-by: Tan Long <tanloong@foxmail.com>
2023-04-17 13:16:34 +02:00
TAN Long
8249ce3947
docs(REL_OP): modify docs for REL_OPs to match Semgrex's update on CoreNLP v4.5.2 ( #12531 )
...
Co-authored-by: Tan Long <tanloong@foxmail.com>
2023-04-17 13:14:26 +02:00
TAN Long
119f959218
docs(REL_OP): modify docs for REL_OPs to match Semgrex's update on CoreNLP v4.5.2 ( #12531 )
...
Co-authored-by: Tan Long <tanloong@foxmail.com>
2023-04-17 13:14:01 +02:00
andyjessen
02259fa195
Add category to spaCy project ( #12506 )
...
ScispaCy fits within biomedical domain. Consider adding this category.
2023-04-07 15:31:04 +02:00
Madeesh Kannan
027812a914
Docs
: Fix rule-based matching example that expands named entities (#12495 )
2023-04-06 11:46:35 +02:00
Edward
ebabd90ae5
Add more information to custom code docs ( #12491 )
...
* Add info to sections
* Update website/docs/usage/training.mdx
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-04-06 11:46:27 +02:00
Madeesh Kannan
6db20b354f
Docs
: Fix rule-based matching example that expands named entities (#12495 )
2023-04-06 11:45:58 +02:00
Edward
c95d320d28
Add more information to custom code docs ( #12491 )
...
* Add info to sections
* Update website/docs/usage/training.mdx
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-04-06 11:45:19 +02:00
Will Frey
215c86955e
Fix invalid ConsoleLogger.v3 example config ( #12498 )
...
Replace `progress_bar = "all_steps"` with `progress_bar = "eval"`, which is consistent with the default behavior for `spacy.ConsoleLogger.v1` and `spacy.ConsoleLogger.v2`.
2023-04-04 20:56:05 +02:00
Will Frey
8d4129e177
Fix invalid ConsoleLogger.v3 example config ( #12498 )
...
Replace `progress_bar = "all_steps"` with `progress_bar = "eval"`, which is consistent with the default behavior for `spacy.ConsoleLogger.v1` and `spacy.ConsoleLogger.v2`.
2023-04-04 20:53:07 +02:00
Edward
de32011e4c
Add model-last saving mechanism to pretraining ( #12459 )
...
* Adjust pretrain command
* chane naming and add finally block
* Add unit test
* Add unit test assertions
* Update spacy/training/pretrain.py
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* change finally block
* Add to docs
* Update website/docs/usage/embeddings-transformers.mdx
* Add flag to skip saving model-last
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-04-03 15:24:03 +02:00
Adriane Boyd
4a1ec332de
Add Span.kb_id/Span.id strings to Doc/DocBin serialization if set ( #12493 )
...
* Add Span.kb_id/Span.id strings to Doc/DocBin serialization if set
* Format
2023-04-03 15:11:12 +02:00
Adriane Boyd
4538ceb507
Remove redundant strings.add for Doc.char_span ( #12429 )
2023-04-03 11:38:56 +02:00
Adriane Boyd
476a2e7a0a
Allow cupy 12.0 for extras ( #12490 )
2023-03-31 13:48:15 +02:00
Adriane Boyd
69e20ce03d
Fix pickle for ngram suggester ( #12486 )
2023-03-31 13:43:51 +02:00
Adriane Boyd
140d53649d
Convert values to numpy for label smoothing tests ( #12472 )
2023-03-31 13:41:41 +02:00
Ye Lei (叶磊)
ce258670b7
Allow passing a Span to displacy.parse_deps ( #12477 )
...
* Allow passing a Span to displacy.parse_deps
* Update docstring
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update API docs
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-31 09:44:01 +02:00
Raphael Mitsch
d85df9d577
Fix Span.sents for edge case of Span being the only Span in the last sentence of a Doc. ( #12484 )
2023-03-29 18:54:47 +02:00
kadarakos
372a90885e
Fix spancat-singlelabel score ( #12469 )
...
* debug argmax sort and add span scores
* add missing tests for spanscores
2023-03-29 08:38:11 +02:00
Edward
08e9f2b98e
Add info to stringstore and vocab ( #12471 )
2023-03-27 13:15:36 +02:00
Edward
dba4e7bece
Add info to stringstore and vocab ( #12471 )
2023-03-27 13:15:14 +02:00
Adriane Boyd
2fba21be63
Restrict github workflows to explosion ( #12470 )
2023-03-27 12:44:04 +02:00
sloev / Johannes Valbjørn
0cd5980b53
add spacy_onnx_sentiment_english to universe ( #12422 )
...
* add spacy_onnx_sentiment_english to universe
* rename to sentimental-onix
* fix comma json error
* fix typo
* typo fix
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* mention need to download model before example works
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-27 11:35:32 +02:00
sloev / Johannes Valbjørn
fd072533e7
add spacy_onnx_sentiment_english to universe ( #12422 )
...
* add spacy_onnx_sentiment_english to universe
* rename to sentimental-onix
* fix comma json error
* fix typo
* typo fix
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* mention need to download model before example works
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-27 11:35:14 +02:00
Prajakta Darade
5a9c8aef8f
corrected example code ( #12466 )
2023-03-27 11:33:22 +02:00
Prajakta Darade
ae7779e830
corrected example code ( #12466 )
2023-03-27 11:32:49 +02:00
kadarakos
c48024159d
add explanation about overwriting behaviour ( #12464 )
...
* add explanation about overwriting behaviour
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* format
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-27 10:27:31 +02:00
kadarakos
d1474fdd91
add explanation about overwriting behaviour ( #12464 )
...
* add explanation about overwriting behaviour
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* format
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-27 10:27:11 +02:00
Adriane Boyd
fac457a509
Support floret for PretrainVectors ( #12435 )
...
* Support floret for PretrainVectors
* Format
2023-03-24 16:28:51 +01:00
Adriane Boyd
d0bd3f5ee4
Update Serbian tokenization for UD Serbian SET ( #12442 )
2023-03-24 16:26:40 +01:00
Vinit Ravishankar
28de85737f
Tagger label smoothing ( #12293 )
...
* add label smoothing
* use True/False instead of floats
* add entropy to debug data
* formatting
* docs
* change test to check difference in distributions
* Update website/docs/api/tagger.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update spacy/pipeline/tagger.pyx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* bool -> float
* update docs
* fix seed
* black
* update tests to use label_smoothing = 0.0
* set default to 0.0, update quickstart
* Update spacy/pipeline/tagger.pyx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* update morphologizer, tagger test
* fix morph docs
* add url to docs
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-22 12:17:56 +01:00
Ines Montani
f08eedb89d
Add user survey alert to the top ( #12452 )
...
* Add user survey alert to the top
* Shorter
---------
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-03-22 11:37:08 +01:00
Ines Montani
b479f8bfa5
Add user survey alert to the top ( #12452 )
...
* Add user survey alert to the top
* Shorter
---------
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-03-22 11:09:37 +01:00
Adriane Boyd
54c614e116
CI: Separate spacy universe validation into a separate workflow ( #12440 )
...
* Separate spacy universe validation into a separate workflow
* Fix new workflow name
2023-03-17 10:59:53 +01:00
Adriane Boyd
5f72d6c836
CI: Switch PR back to paths-ignore ( #12438 )
...
Switch PR tests back to paths-ignore but include changes to `.github`
for all PRs rather than trying to figure out complicated
includes+excludes. Changes to `.github` are relatively rare and should
not be a huge burden for the CI.
2023-03-17 10:01:49 +01:00
Adriane Boyd
4c5a3a2a7b
Remove autoblack workflow ( #12437 )
...
Now that all PRs have `black` formatting validation, we no longer need the
autoblack workflow.
2023-03-17 09:35:00 +01:00
Raphael Mitsch
96b61d0671
Fix EL failure with sentence-crossing entities ( #12398 )
...
* Add test reproducing EL failure in sentence-crossing entities.
* Format.
* Draft fix.
* Format.
* Fix case for len(ent.sents) == 1.
* Format.
* Format.
* Format.
* Fix mypy error.
* Merge EL sentence crossing tests.
* Remove unneeded sentencizer component.
* Fix or ignore mypy issues in test.
* Simplify ent.sents handling.
* Format. Update assert in ent.sents handling.
* Small rewrite
---------
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-03-14 22:02:49 +01:00
Adriane Boyd
2ce9a220db
Fix --verbose for spacy find-threshold ( #12418 )
2023-03-14 17:16:49 +01:00
Adriane Boyd
377f601bff
CI: Add all paths before excluding patterns ( #12419 )
2023-03-14 16:06:08 +01:00
Raphael Mitsch
e8cab4625c
Fix sentence indexing bug in Span.sents
( #12405 )
...
* Add test for partial sentences in ent.sents.
* Removed unneeded import.
* Format. Simplify code.
2023-03-14 10:21:53 +01:00
Adriane Boyd
ea6de64596
CI: Move CLI tests to ubuntu for speed ( #12409 )
2023-03-13 15:14:46 +01:00
Adriane Boyd
8f1280a514
Fix thinc-apple-ops test to run for python 3.11 ( #12408 )
2023-03-13 15:10:04 +01:00
Adriane Boyd
8ff9073161
CI: Move universe validation to validate job ( #12406 )
...
* CI: Move universe validation to validate job
* Fix indentation
* Update step name
2023-03-13 14:21:17 +01:00
Adriane Boyd
3c999f052e
Add GHA for CI tests ( #12403 )
...
* Add GHA for CI tests
* Reorder paths
2023-03-13 13:13:47 +01:00
Adriane Boyd
4d0fb098ee
Merge branch 'v3.5.x' into spacy.io
2023-03-10 09:41:00 +01:00
Adriane Boyd
8153bd573f
Merge pull request #12395 from adrianeboyd/backport/v3.5.1-2
...
Skip project clone tests if git is not available (#12394 )
2023-03-09 17:45:32 +01:00
Adriane Boyd
83056bb44c
Skip project clone tests if git is not available ( #12394 )
2023-03-09 16:42:33 +01:00
Adriane Boyd
f27bce67fd
Skip project clone tests if git is not available ( #12394 )
2023-03-09 16:41:21 +01:00
Adriane Boyd
03b320b3bd
Set version to v3.5.1 ( #12393 )
2023-03-09 12:40:28 +01:00