Ines Montani
068b97a617
Merge pull request #7408 from adrianeboyd/bugfix/load-keyword-only
2021-03-13 04:25:50 +01:00
Adriane Boyd
3168103605
Fix type of spacy train --output in docs
2021-03-12 10:04:57 +01:00
Adriane Boyd
03e9e7b567
Add --code option to init fill-config
2021-03-12 10:03:57 +01:00
Adriane Boyd
124304b146
Add vocab kwarg back to spacy.load
...
* Additional minor formatting and docs cleanup
2021-03-11 10:58:59 +01:00
Adriane Boyd
84470d9b9e
Incorporate BILUO note from #7407
2021-03-11 10:11:21 +01:00
Adriane Boyd
4294bcf4ab
Align keyword-only in docs for init/util
2021-03-11 09:52:40 +01:00
Adriane Boyd
28726c25a1
Update docs for convert CLI and NER examples
2021-03-10 11:42:02 +01:00
Adriane Boyd
d746ea6278
Add warning about GPU selection in Jupyter notebooks ( #7075 )
...
* Initial warning
* Update check
* Redo edit
* Move jupyter warning to helper method
* Add link with details to warnings
2021-03-09 15:35:21 +01:00
Sofie Van Landeghem
932887b950
textcat scoring fix and multi_label docs ( #6974 )
...
* add multi-label textcat to menu
* add infobox on textcat API
* add info to v3 migration guide
* small edits
* further fixes in doc strings
* add infobox to textcat architectures
* add textcat_multilabel to overview of built-in components
* spelling
* fix unrelated warn msg
* Add textcat_multilabel to quickstart [ci skip]
* remove separate documentation page for multilabel_textcategorizer
* small edits
* positive label clarification
* avoid duplicating information in self.cfg and fix textcat.score
* fix multilabel textcat too
* revert threshold to storage in cfg
* revert threshold stuff for multi-textcat
Co-authored-by: Ines Montani <ines@ines.io>
2021-03-09 23:04:22 +11:00
Sofie Van Landeghem
cd70c3cb79
Fixing pretrain ( #7342 )
...
* initialize NLP with train corpus
* add more pretraining tests
* more tests
* function to fetch tok2vec layer for pretraining
* clarify parameter name
* test different objectives
* formatting
* fix check for static vectors when using vectors objective
* clarify docs
* logger statement
* fix init_tok2vec and proc.initialize order
* test training after pretraining
* add init_config tests for pretraining
* pop pretraining block to avoid config validation errors
* custom errors
2021-03-09 14:01:13 +11:00
Ines Montani
dfb23a419e
Merge branch 'spacy.io' [ci skip]
2021-03-06 17:38:54 +11:00
graue70
7d085d5b1c
Fix typo in docs
2021-03-05 18:30:09 +01:00
vincent d warmerdam
1b0d413e45
Removed Languages that were listed twice on Docs ( #7272 )
...
* removed languages that were listed twice
* sorted
* d0h
* the d0h strikes back when you dont hit save
2021-03-05 14:31:15 +01:00
svlandeg
682a6232e3
fix typo
2021-03-02 17:59:13 +01:00
svlandeg
d900c55061
consistently use registry as callable
2021-03-02 17:56:28 +01:00
graue70
0fddc0447c
Fix copy & paste error in API docs
2021-03-02 14:00:14 +01:00
Ines Montani
8f7c7b2658
Merge pull request #7211 from svlandeg/docs/el_update [ci skip]
...
kb.get_candidates renamed to get_alias_candidates
2021-02-27 11:51:22 +11:00
Ines Montani
408b94887a
Merge pull request #7207 from adrianeboyd/docs/get-noun-chunks [ci skip]
...
Extend docs related to Vocab.get_noun_chunks
2021-02-27 11:51:08 +11:00
svlandeg
248339039e
fix type in docs
2021-02-26 14:27:10 +01:00
svlandeg
08fd901a1b
kb.get_candidates renamed to get_alias_candidates
2021-02-25 20:09:36 +01:00
Adriane Boyd
6a37f343d5
Extend docs related to Vocab.get_noun_chunks
2021-02-25 16:38:21 +01:00
Ines Montani
d2c515354b
Auto-format [ci skip]
2021-02-24 22:37:32 +11:00
Ines Montani
9e8a7e08c1
Merge pull request #7115 from SergeyShk/ruts [ci skip]
2021-02-24 22:37:00 +11:00
Ines Montani
24cecbb3f4
Merge pull request #7126 from adrianeboyd/docs/gpu-id-opt [ci skip]
...
Add tip about --gpu-id to training quickstart
2021-02-24 22:34:17 +11:00
Ken
fa7ddc7f88
Update sentencizer documentation example with sentencizer pipe name ( #7185 )
2021-02-24 08:06:54 +01:00
Tocic
b1996a51a1
fix typo in models.md ( #7157 )
2021-02-22 09:00:38 +01:00
Sofie Van Landeghem
b92f81d5da
fix NEL config and IO, and n_sents functionality ( #7100 )
...
* fix NEL config and IO, and n_sents functionality
* add docs
* fix test
2021-02-22 14:49:52 +11:00
Sofie Van Landeghem
ba5a50f62b
NEL docs & UX ( #7129 )
...
* EL set_kb docs fix
* custom warning for set_kb mistake
2021-02-22 11:04:22 +11:00
Shkarin Sergey
22706ec9fb
Fixed universe.json
2021-02-20 08:02:38 +03:00
Adriane Boyd
7198be0f4b
Add tip about --gpu-id to training quickstart
2021-02-19 14:07:51 +01:00
Sofie Van Landeghem
709c9e75af
span.ent only returns first sentence ( #7084 )
...
* return first sentence when span contains sentence boundary
* docs fix
* small fixes
* cleanup
2021-02-19 23:02:38 +11:00
palandlom
9b82586699
var batch is useless ( #7111 )
...
It seems that nlp.update(examples) should be nlp.update(batch)
2021-02-18 09:44:22 +01:00
Ines Montani
fc4fb6eb3a
Make v2.x docs more prominent [ci skip]
2021-02-17 23:42:27 +11:00
Rajat
4e80ef3abb
updated code eg & description of contextualSpellCheck ( #7096 )
2021-02-17 13:26:43 +01:00
Shkarin Sergey
abac5dc203
Update universe.json
2021-02-15 15:01:46 +03:00
Ines Montani
4b729660bd
Merge pull request #7051 from MartinoMensio/dbpedia-spotlight [ci skip]
...
added spacy-dbpedia-spotlight
2021-02-14 14:06:08 +11:00
Ines Montani
06e66d4ced
Update languages.json [ci skip]
2021-02-13 12:33:17 +11:00
Martino Mensio
6c0c3d5ddc
added spacy-dbpedia-spotlight
2021-02-12 19:11:35 +01:00
Ines Montani
6b9026a219
Merge pull request #7000 from explosion/feature/project-yml-overrides
...
Support env vars and CLI overrides for project.yml
2021-02-11 12:31:45 +11:00
Peter Baumann
61b04a70d5
Run PhraseMatcher on Spans ( #6918 )
...
* Add regression test
* Run PhraseMatcher on Spans
* Add test for PhraseMatcher on Spans and Docs
* Add SCA
* Add test with 3 matches in Doc, 1 match in Span
* Update docs
* Use doc.length for find_matches in tokenizer
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2021-02-10 23:43:32 +11:00
Ines Montani
c08b3f294c
Support env vars and CLI overrides for project.yml
2021-02-10 13:45:27 +11:00
svlandeg
9a7f33c916
final 3.0 benchmark numbers
2021-02-09 21:28:33 +01:00
Ines Montani
ca3f8386d7
Merge pull request #6975 from svlandeg/fix/link [ci skip]
...
fix link
2021-02-09 14:34:11 +11:00
tarskiandhutch
e897e7aaad
Line 70: syntax error
...
Original config definition treated dictionary key as a function argument.
2021-02-08 15:24:57 -05:00
svlandeg
bb7482bef8
fix link
2021-02-08 18:39:59 +01:00
Sofie Van Landeghem
6ed423c16c
reduce memory load when reading all vectors from file ( #6945 )
...
* reduce memory load when reading all vectors from file
* one more small typo fix
2021-02-07 08:05:43 +08:00
Ines Montani
433835d9b0
Merge pull request #6889 from adrianeboyd/docs/source-install-dup [ci skip]
2021-02-05 13:35:16 +11:00
svlandeg
7cda5605a0
add type
2021-02-03 13:13:58 +01:00
svlandeg
94929c2b98
small doc fixes
2021-02-03 13:10:22 +01:00
Ines Montani
2cdfcd2d19
Update naming [ci skip]
2021-02-03 12:48:31 +11:00
Adriane Boyd
37a68a06ab
Update to recommend editable installs for source installs
2021-02-02 16:51:27 +01:00
Adriane Boyd
3a3e4daf60
Update install instructions
...
* Remove duplicate section about compiling from source
2021-02-02 14:44:15 +01:00
Ines Montani
ff6a21cd18
Update GitHub link [ci skip]
2021-02-02 14:27:46 +11:00
Pengcheng YIN
6fdc33203a
Fix a typo
2021-02-01 17:26:28 -05:00
Ines Montani
a59f3fcf5d
Make wheel the default format and update docs [ci skip]
2021-02-01 23:18:43 +11:00
Ines Montani
e17ea88e54
Fix config quickstart and download [ci skip]
2021-02-01 21:44:55 +11:00
Ines Montani
31b842d6ce
Update table [ci skip]
2021-02-01 14:17:52 +11:00
Ines Montani
4ca0f91506
Update labels [ci skip]
2021-01-31 20:10:56 +11:00
Ines Montani
7752f80f39
Update docs [ci skip]
2021-01-31 16:11:24 +11:00
Ines Montani
6a683970ea
Update Binder meta [ci skip]
2021-01-31 15:43:08 +11:00
Ines Montani
82da6aee08
Update labels [ci skip]
2021-01-31 15:28:52 +11:00
Ines Montani
a8a1231ccd
Update README and docs [ci skip]
2021-01-31 12:36:04 +11:00
Ines Montani
45c551037d
Update CLI docs [ci skip]
2021-01-30 21:50:23 +11:00
Ines Montani
ae07416fda
Merge branch 'website/v3-launch' into develop
2021-01-30 20:31:06 +11:00
Ines Montani
d07683873f
Merge branch 'master' into develop
2021-01-30 20:28:14 +11:00
Ines Montani
8626b82e49
Update images [ci skip]
2021-01-30 18:50:25 +11:00
Ines Montani
44dc987d85
Fix icon [ci skip]
2021-01-30 18:27:55 +11:00
Ines Montani
8d293a4c4b
Update website to support legacy state [ci skip]
2021-01-30 18:27:31 +11:00
Ines Montani
d3350afe45
Update docs and add support for legacy style
2021-01-30 17:43:12 +11:00
Ines Montani
2332c4280b
Update and use unified --build option
2021-01-30 13:11:36 +11:00
Ines Montani
2609ba4e89
Support building wheel in spacy package
2021-01-30 11:54:02 +11:00
Ines Montani
95e958a229
Merge pull request #6852 from explosion/feature/replace-listeners
2021-01-30 00:58:08 +11:00
Ines Montani
7694f76dd1
Update warning and mention replace_listeners
2021-01-29 23:46:01 +11:00
Adriane Boyd
8b76cb8095
Rephrase transformers PyTorch instructions
2021-01-29 13:36:56 +01:00
Ines Montani
095055ac48
Merge pull request #6855 from adrianeboyd/docs/trf-sentencepiece [ci skip]
...
Update transfomers install docs
2021-01-29 23:34:01 +11:00
Adriane Boyd
e3e87e7275
Update transfomers install docs
...
* Recommend installing PyTorch separately
* Add instructions for `sentencepiece`
2021-01-29 13:27:43 +01:00
Ines Montani
e766e8c56d
Apply suggestions from code review
...
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2021-01-29 21:41:17 +11:00
svlandeg
d7d838281c
adding new="3" mentions in the doc
2021-01-29 11:26:37 +01:00
Ines Montani
99af9e7125
Update documentation
2021-01-29 18:45:48 +11:00
Sofie Van Landeghem
24a697abb8
avoid empty aliases and improve UX and docs ( #6840 )
2021-01-29 08:51:40 +08:00
Sofie Van Landeghem
837a4f53c2
Error handling in nlp.pipe ( #6817 )
...
* add error handler for pipe methods
* add unit tests
* remove pipe method that are the same as their base class
* have Language keep track of a default error handler
* cleanup
* formatting
* small refactor
* add documentation
2021-01-29 08:51:21 +08:00
Ines Montani
ec5f55aa5b
Update config generation defaults and transformers ( #6832 )
2021-01-27 23:56:33 +11:00
Adriane Boyd
4096a79de7
Add alignment mode error and fix Doc.char_span docs ( #6820 )
...
* Raise an error on an unrecognized alignment mode rather than
defaulting to `strict`
* Fix the `Doc.char_span` API doc alignment mode details
2021-01-27 23:40:42 +11:00
Ines Montani
230e651ad6
Merge branch 'develop' into master-tmp
2021-01-27 13:26:29 +11:00
Ines Montani
634ae609b4
Adjust formatting [ci skip]
2021-01-27 13:08:00 +11:00
Ines Montani
d5ef245bb1
Merge pull request #6822 from jganseman/master [ci skip]
2021-01-27 13:04:30 +11:00
Ines Montani
5d79d1af50
Merge pull request #6796 from svlandeg/docs/benchmarks [ci skip]
2021-01-27 13:01:23 +11:00
Ines Montani
1ed7029d47
Update website for v3 launch
2021-01-27 12:39:47 +11:00
Adriane Boyd
c447aa2b98
Update --code arg in evaluate CLI docs
2021-01-26 15:30:46 +01:00
jganseman
907bce7a78
Merge pull request #1 from jganseman/patch-1
...
Patch 1
2021-01-26 11:12:30 +01:00
jganseman
8bc57ec372
also update is_oov in lexeme docs
2021-01-26 11:09:16 +01:00
jganseman
1f2b0ec168
proposing a more concise explanation for is_oov
...
proposing a more concise explanation for is_oov
2021-01-26 10:53:39 +01:00
Matthew Honnibal
f049df1715
Revert "Set annotations in update" ( #6810 )
...
* Revert "Set annotations in update (#6767 )"
This reverts commit e680efc7cc
.
* Fix version
* Update spacy/pipeline/entity_linker.py
* Update spacy/pipeline/entity_linker.py
* Update spacy/pipeline/tagger.pyx
* Update spacy/pipeline/tok2vec.py
* Update spacy/pipeline/tok2vec.py
* Update spacy/pipeline/transition_parser.pyx
* Update spacy/pipeline/transition_parser.pyx
* Update website/docs/api/multilabel_textcategorizer.md
* Update website/docs/api/tok2vec.md
* Update website/docs/usage/layers-architectures.md
* Update website/docs/usage/layers-architectures.md
* Update website/docs/api/transformer.md
* Update website/docs/api/textcategorizer.md
* Update website/docs/api/tagger.md
* Update spacy/pipeline/entity_linker.py
* Update website/docs/api/sentencerecognizer.md
* Update website/docs/api/pipe.md
* Update website/docs/api/morphologizer.md
* Update website/docs/api/entityrecognizer.md
* Update spacy/pipeline/entity_linker.py
* Update spacy/pipeline/multitask.pyx
* Update spacy/pipeline/tagger.pyx
* Update spacy/pipeline/tagger.pyx
* Update spacy/pipeline/textcat.py
* Update spacy/pipeline/textcat.py
* Update spacy/pipeline/textcat.py
* Update spacy/pipeline/tok2vec.py
* Update spacy/pipeline/trainable_pipe.pyx
* Update spacy/pipeline/trainable_pipe.pyx
* Update spacy/pipeline/transition_parser.pyx
* Update spacy/pipeline/transition_parser.pyx
* Update website/docs/api/entitylinker.md
* Update website/docs/api/dependencyparser.md
* Update spacy/pipeline/trainable_pipe.pyx
2021-01-25 22:18:45 +08:00
Adriane Boyd
61c9f8bf24
Remove transformers model max length section ( #6807 )
2021-01-25 19:59:34 +08:00
muratjumashev
7d0154a36e
Added language meta data
2021-01-25 00:42:19 +06:00
svlandeg
56064faed9
update caption
2021-01-23 00:57:00 +01:00
svlandeg
d7c0f40a96
update comment
2021-01-22 18:55:18 +01:00
svlandeg
a071279bc7
add speed comparison to docs
2021-01-22 18:46:35 +01:00
svlandeg
b132cb3036
update accuracies for new a1 models
2021-01-21 20:24:05 +01:00
Adriane Boyd
d0236136a2
Fix default config init in Transformer API docs ( #6781 )
2021-01-21 23:18:03 +08:00