shadeMe
fbc299cfe9
Remove outdated info about default config
2023-08-24 13:29:53 +02:00
shadeMe
001926db50
Remove TODO comment
2023-08-23 11:13:44 +02:00
shadeMe
feda7fa63c
Merge branch 'website/curated-docs' of github.com:vin-ivar/spaCy into pr/vin-ivar/12677
2023-08-14 13:53:39 +02:00
shadeMe
8f1b3bdb4b
More formatting fixes
2023-08-14 13:52:40 +02:00
shadeMe
317816d689
Formatting fixes
2023-08-14 13:48:58 +02:00
shadeMe
0d9aa48865
Fill in the DocTransformerOutput section
2023-08-14 13:48:45 +02:00
shadeMe
921be30331
Add text to the model/tokenizer loader sections
2023-08-14 13:47:50 +02:00
shadeMe
eca4555c88
Remove placeholder text
2023-08-14 13:17:24 +02:00
shadeMe
2d834d12a5
Fix links
2023-08-14 13:16:27 +02:00
shadeMe
74b029d45e
Remove hyphen from the curated-transformers.mdx filename
2023-08-14 13:05:47 +02:00
shadeMe
979b723993
Update CLI command name, args
2023-08-14 13:03:02 +02:00
Madeesh Kannan
de136d6408
Apply suggestions from code review
...
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-08-11 10:37:21 +02:00
shadeMe
ca14547803
Add a note to the architectures section about the init fill-config-transformer CLI command
2023-08-09 12:40:28 +02:00
shadeMe
1d0604939b
Fix source URL
2023-08-08 14:26:32 +02:00
shadeMe
6e0f537c04
Update intro section of the pipeline component docs
2023-08-08 14:21:18 +02:00
shadeMe
13e1d8ca90
Update CLI command invocation syntax
2023-08-08 13:56:40 +02:00
shadeMe
3ab669ae6a
Add docs for init fill-config-transformer
2023-08-08 13:44:35 +02:00
shadeMe
0a0476cbfd
Fix transformer listener naming
2023-08-08 13:17:20 +02:00
shadeMe
0d2be9e96c
Set curated transformers API version to 3.7
2023-08-08 13:14:28 +02:00
shadeMe
fa809443de
Change debug pieces version tag to 3.7
2023-08-08 13:09:53 +02:00
shadeMe
d80e120779
Fix copy-paste typo
2023-08-08 13:08:43 +02:00
shadeMe
3bbd25ce8e
Merge branch 'website/curated-docs' of github.com:vin-ivar/spaCy into pr/vin-ivar/12677
2023-08-07 16:58:53 +02:00
shadeMe
985c1495dd
Remove type aliases
2023-08-07 16:58:50 +02:00
Madeesh Kannan
121c64818c
Doc fixes
...
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-08-07 16:26:19 +02:00
shadeMe
cca478152e
Fix duplicate entries in tables
2023-07-20 16:05:42 +02:00
shadeMe
a775fa25ad
Remove spacy-transformers-specific warning
2023-07-20 13:46:32 +02:00
shadeMe
d8722877cb
Fix piece_encoder entries
2023-07-20 13:15:37 +02:00
Madeesh Kannan
a282aec814
Remove mentions of Torchscript and quantization
...
Both are disabled in the initial release of `spacy-curated-transformers`.
2023-07-20 12:54:43 +02:00
vinit
b48ab353a1
fix typo
2023-05-26 14:48:02 +02:00
vinit
a633b88ef2
initial documentation run
2023-05-26 11:46:34 +02:00
vinit
1cbad4f3c9
initial
2023-05-24 17:24:49 +02:00
Victoria
6930a6bf45
Add spaCy VSCode extension materials ( #12592 )
2023-05-19 14:38:53 +02:00
Lj Miranda
58779c24ef
Remove shorthand for output-file in spacy apply ( #12636 )
...
The output-file argument is positional, so can't use a shorthand like -o.
2023-05-17 12:36:29 +02:00
Adriane Boyd
3dc445df8d
Fix new tags in docs for v3.5.x ( #12629 )
...
* Fix new tags in docs for v3.5.x
* Fix new tag
2023-05-15 12:06:58 +02:00
Basile Dura
2dd8825f09
docs: add comment on offset_x argument ( #12630 )
2023-05-15 11:42:47 +02:00
Adriane Boyd
3637148c4d
Add scorer option to return per-component scores ( #12540 )
...
* Add scorer option to return per-component scores
Add `per_component` option to `Language.evaluate` and `Scorer.score` to
return scores keyed by `tokenizer` (hard-coded) or by component name.
Add option to `evaluate` CLI to score by component. Per-component scores
can only be saved to JSON.
* Update help text and messages
2023-05-12 15:36:54 +02:00
Kenneth Enevoldsen
88680a6eed
docs: remove invalid huggingface-hub push argument ( #12624 )
2023-05-12 09:40:28 +02:00
Kenneth Enevoldsen
73698326df
Update inmemorylookupkb.mdx ( #12586 )
...
Example does not refer to the in memory lookup
2023-05-02 12:51:13 +02:00
Adriane Boyd
b60b027927
Add default option to MorphAnalysis.get ( #12545 )
...
* Add default to MorphAnalysis.get
Similar to `dict`, allow a `default` option for `MorphAnalysis.get` for
the user to provide a default return value if the field is not found.
The default return value remains `[]`, which is not the same as
`dict.get`, but is already established as this method's default return
value with the return type `List[str]`. However the new `default` option
does not enforce that the user-provided default is actually `List[str]`.
* Restore test case
2023-04-20 14:06:32 +02:00
TAN Long
119f959218
docs(REL_OP): modify docs for REL_OPs to match Semgrex's update on CoreNLP v4.5.2 ( #12531 )
...
Co-authored-by: Tan Long <tanloong@foxmail.com>
2023-04-17 13:14:01 +02:00
Madeesh Kannan
6db20b354f
Docs: Fix rule-based matching example that expands named entities (#12495 )
2023-04-06 11:45:58 +02:00
Edward
c95d320d28
Add more information to custom code docs ( #12491 )
...
* Add info to sections
* Update website/docs/usage/training.mdx
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-04-06 11:45:19 +02:00
Will Frey
8d4129e177
Fix invalid ConsoleLogger.v3 example config ( #12498 )
...
Replace `progress_bar = "all_steps"` with `progress_bar = "eval"`, which is consistent with the default behavior for `spacy.ConsoleLogger.v1` and `spacy.ConsoleLogger.v2`.
2023-04-04 20:53:07 +02:00
Edward
de32011e4c
Add model-last saving mechanism to pretraining ( #12459 )
...
* Adjust pretrain command
* chane naming and add finally block
* Add unit test
* Add unit test assertions
* Update spacy/training/pretrain.py
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* change finally block
* Add to docs
* Update website/docs/usage/embeddings-transformers.mdx
* Add flag to skip saving model-last
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-04-03 15:24:03 +02:00
Ye Lei (叶磊)
ce258670b7
Allow passing a Span to displacy.parse_deps ( #12477 )
...
* Allow passing a Span to displacy.parse_deps
* Update docstring
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update API docs
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-31 09:44:01 +02:00
Edward
dba4e7bece
Add info to stringstore and vocab ( #12471 )
2023-03-27 13:15:14 +02:00
Prajakta Darade
ae7779e830
corrected example code ( #12466 )
2023-03-27 11:32:49 +02:00
kadarakos
d1474fdd91
add explanation about overwriting behaviour ( #12464 )
...
* add explanation about overwriting behaviour
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update website/docs/api/spancategorizer.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* format
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-27 10:27:11 +02:00
Vinit Ravishankar
28de85737f
Tagger label smoothing ( #12293 )
...
* add label smoothing
* use True/False instead of floats
* add entropy to debug data
* formatting
* docs
* change test to check difference in distributions
* Update website/docs/api/tagger.mdx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* Update spacy/pipeline/tagger.pyx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* bool -> float
* update docs
* fix seed
* black
* update tests to use label_smoothing = 0.0
* set default to 0.0, update quickstart
* Update spacy/pipeline/tagger.pyx
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
* update morphologizer, tagger test
* fix morph docs
* add url to docs
---------
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2023-03-22 12:17:56 +01:00
Adriane Boyd
2ce9a220db
Fix --verbose for spacy find-threshold ( #12418 )
2023-03-14 17:16:49 +01:00