Adriane Boyd
0fb1881f36
Reformat processing pipelines
2021-03-18 13:31:42 +01:00
Adriane Boyd
acc58719da
Update custom similarity hooks example
2021-03-18 13:31:42 +01:00
Adriane Boyd
c9e1a9ac17
Add multiprocessing section
2021-03-18 13:31:42 +01:00
Adriane Boyd
9a254d3995
Include all en_core_web_sm components in examples
2021-03-18 13:31:42 +01:00
bsweileh
61472e7cb3
Update _training.md - Fix broken link on backpropagation ( #7431 )
...
* Update _training.md
Fix broken link on backpropagation
* Add agreement
add spacy contributor agreement
2021-03-15 09:21:35 +01:00
Adriane Boyd
d746ea6278
Add warning about GPU selection in Jupyter notebooks ( #7075 )
...
* Initial warning
* Update check
* Redo edit
* Move jupyter warning to helper method
* Add link with details to warnings
2021-03-09 15:35:21 +01:00
Sofie Van Landeghem
932887b950
textcat scoring fix and multi_label docs ( #6974 )
...
* add multi-label textcat to menu
* add infobox on textcat API
* add info to v3 migration guide
* small edits
* further fixes in doc strings
* add infobox to textcat architectures
* add textcat_multilabel to overview of built-in components
* spelling
* fix unrelated warn msg
* Add textcat_multilabel to quickstart [ci skip]
* remove separate documentation page for multilabel_textcategorizer
* small edits
* positive label clarification
* avoid duplicating information in self.cfg and fix textcat.score
* fix multilabel textcat too
* revert threshold to storage in cfg
* revert threshold stuff for multi-textcat
Co-authored-by: Ines Montani <ines@ines.io>
2021-03-09 23:04:22 +11:00
Ines Montani
dfb23a419e
Merge branch 'spacy.io' [ci skip]
2021-03-06 17:38:54 +11:00
graue70
7d085d5b1c
Fix typo in docs
2021-03-05 18:30:09 +01:00
svlandeg
d900c55061
consistently use registry as callable
2021-03-02 17:56:28 +01:00
svlandeg
08fd901a1b
kb.get_candidates renamed to get_alias_candidates
2021-02-25 20:09:36 +01:00
Ines Montani
24cecbb3f4
Merge pull request #7126 from adrianeboyd/docs/gpu-id-opt [ci skip]
...
Add tip about --gpu-id to training quickstart
2021-02-24 22:34:17 +11:00
Tocic
b1996a51a1
fix typo in models.md ( #7157 )
2021-02-22 09:00:38 +01:00
Adriane Boyd
7198be0f4b
Add tip about --gpu-id to training quickstart
2021-02-19 14:07:51 +01:00
Sofie Van Landeghem
709c9e75af
span.ent only returns first sentence ( #7084 )
...
* return first sentence when span contains sentence boundary
* docs fix
* small fixes
* cleanup
2021-02-19 23:02:38 +11:00
palandlom
9b82586699
var batch is useless ( #7111 )
...
It seems that nlp.update(examples) should be nlp.update(batch)
2021-02-18 09:44:22 +01:00
Ines Montani
fc4fb6eb3a
Make v2.x docs more prominent [ci skip]
2021-02-17 23:42:27 +11:00
Ines Montani
c08b3f294c
Support env vars and CLI overrides for project.yml
2021-02-10 13:45:27 +11:00
svlandeg
9a7f33c916
final 3.0 benchmark numbers
2021-02-09 21:28:33 +01:00
svlandeg
bb7482bef8
fix link
2021-02-08 18:39:59 +01:00
Ines Montani
433835d9b0
Merge pull request #6889 from adrianeboyd/docs/source-install-dup [ci skip]
2021-02-05 13:35:16 +11:00
Ines Montani
2cdfcd2d19
Update naming [ci skip]
2021-02-03 12:48:31 +11:00
Adriane Boyd
37a68a06ab
Update to recommend editable installs for source installs
2021-02-02 16:51:27 +01:00
Adriane Boyd
3a3e4daf60
Update install instructions
...
* Remove duplicate section about compiling from source
2021-02-02 14:44:15 +01:00
Pengcheng YIN
6fdc33203a
Fix a typo
2021-02-01 17:26:28 -05:00
Ines Montani
a59f3fcf5d
Make wheel the default format and update docs [ci skip]
2021-02-01 23:18:43 +11:00
Ines Montani
31b842d6ce
Update table [ci skip]
2021-02-01 14:17:52 +11:00
Ines Montani
7752f80f39
Update docs [ci skip]
2021-01-31 16:11:24 +11:00
Ines Montani
a8a1231ccd
Update README and docs [ci skip]
2021-01-31 12:36:04 +11:00
Ines Montani
ae07416fda
Merge branch 'website/v3-launch' into develop
2021-01-30 20:31:06 +11:00
Ines Montani
2332c4280b
Update and use unified --build option
2021-01-30 13:11:36 +11:00
Ines Montani
2609ba4e89
Support building wheel in spacy package
2021-01-30 11:54:02 +11:00
Ines Montani
95e958a229
Merge pull request #6852 from explosion/feature/replace-listeners
2021-01-30 00:58:08 +11:00
Ines Montani
7694f76dd1
Update warning and mention replace_listeners
2021-01-29 23:46:01 +11:00
Adriane Boyd
8b76cb8095
Rephrase transformers PyTorch instructions
2021-01-29 13:36:56 +01:00
Adriane Boyd
e3e87e7275
Update transfomers install docs
...
* Recommend installing PyTorch separately
* Add instructions for `sentencepiece`
2021-01-29 13:27:43 +01:00
Ines Montani
99af9e7125
Update documentation
2021-01-29 18:45:48 +11:00
Ines Montani
35d79c0a5d
Adjust formatting [ci skip]
2021-01-27 13:31:25 +11:00
Ines Montani
5d79d1af50
Merge pull request #6796 from svlandeg/docs/benchmarks [ci skip]
2021-01-27 13:01:23 +11:00
Ines Montani
1ed7029d47
Update website for v3 launch
2021-01-27 12:39:47 +11:00
Adriane Boyd
61c9f8bf24
Remove transformers model max length section ( #6807 )
2021-01-25 19:59:34 +08:00
svlandeg
56064faed9
update caption
2021-01-23 00:57:00 +01:00
svlandeg
d7c0f40a96
update comment
2021-01-22 18:55:18 +01:00
svlandeg
a071279bc7
add speed comparison to docs
2021-01-22 18:46:35 +01:00
svlandeg
b132cb3036
update accuracies for new a1 models
2021-01-21 20:24:05 +01:00
Sofie Van Landeghem
e680efc7cc
Set annotations in update ( #6767 )
...
* bump to 3.0.0rc4
* do set_annotations in component update calls
* update docs and remove set_annotations flag
* fix EL test
2021-01-20 11:49:25 +11:00
Sofie Van Landeghem
57640aa838
warn when frozen components break listener pattern ( #6766 )
...
* warn when frozen components break listener pattern
* few notes in the documentation
* update arg name
* formatting
* cleanup
* specify listeners return type
2021-01-20 11:12:35 +11:00
Ines Montani
4a1029a9b6
Add infobox [ci skip]
2021-01-19 19:18:39 +11:00
Sofie Van Landeghem
fed8f48965
raise NotImplementedError when noun_chunks iterator is not implemented ( #6711 )
...
* raise NotImplementedError when noun_chunks iterator is not implemented
* bring back, fix and document span.noun_chunks
* formatting
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2021-01-17 19:56:05 +08:00
Adriane Boyd
bf0cdae8d4
Add token_splitter component ( #6726 )
...
* Add long_token_splitter component
Add a `long_token_splitter` component for use with transformer
pipelines. This component splits up long tokens like URLs into smaller
tokens. This is particularly relevant for pretrained pipelines with
`strided_spans`, since the user can't change the length of the span
`window` and may not wish to preprocess the input texts.
The `long_token_splitter` splits tokens that are at least
`long_token_length` tokens long into smaller tokens of `split_length`
size.
Notes:
* Since this is intended for use as the first component in a pipeline,
the token splitter does not try to preserve any token annotation.
* API docs to come when the API is stable.
* Adjust API, add test
* Fix name in factory
2021-01-17 19:54:41 +08:00