Ines Montani
ae51f580c1
Fix handling of score_weights
2020-09-24 10:27:33 +02:00
Ines Montani
e2ffe51fb5
Update docs [ci skip]
2020-09-24 10:13:41 +02:00
Ines Montani
02008e9a55
Update docs [ci skip]
2020-09-23 22:02:31 +02:00
Ines Montani
c8bda92243
Update benchmarks [ci skip]
2020-09-23 20:05:02 +02:00
svlandeg
35dbc63578
Merge remote-tracking branch 'upstream/develop' into fix/nr_features
...
# Conflicts:
# spacy/ml/models/parser.py
# spacy/tests/serialize/test_serialize_config.py
# website/docs/api/architectures.md
2020-09-23 17:01:13 +02:00
Ines Montani
e4e7f5b00d
Update docs [ci skip]
2020-09-23 15:44:40 +02:00
svlandeg
6c85fab316
state_type and extra_state_tokens instead of nr_feature_tokens
2020-09-23 13:35:09 +02:00
Ines Montani
6ca06cb62c
Update docs and formatting [ci skip]
2020-09-23 10:14:27 +02:00
Ines Montani
60a317520a
Merge pull request #6109 from svlandeg/feature/2rename
2020-09-23 09:47:12 +02:00
Ines Montani
930b116f00
Update docs [ci skip]
2020-09-23 09:35:21 +02:00
svlandeg
b556a10808
rename converts in_to_out
2020-09-22 11:50:19 +02:00
Ines Montani
f9af7d365c
Update docs [ci skip]
2020-09-22 09:45:41 +02:00
Ines Montani
49e80dbcac
Merge pull request #6103 from explosion/chore/tidy-up-tests-docs-get-doc
2020-09-22 09:45:04 +02:00
Adriane Boyd
844db6ff12
Update architecture overview
2020-09-22 09:31:47 +02:00
Adriane Boyd
5fbb8dfcbc
Merge remote-tracking branch 'upstream/develop' into docs/various-v3-2
2020-09-22 09:22:58 +02:00
Ines Montani
67fbcb3da5
Tidy up tests and docs
2020-09-21 20:43:54 +02:00
Ines Montani
e548654aca
Update docs [ci skip]
2020-09-21 14:46:55 +02:00
Ines Montani
9d32cac736
Update docs [ci skip]
2020-09-21 10:55:36 +02:00
Adriane Boyd
cc71ec901f
Fix typo in saving and loading usage docs
2020-09-21 09:08:55 +02:00
Ines Montani
012b3a7096
Update docs [ci skip]
2020-09-20 17:44:58 +02:00
Ines Montani
554c9a2497
Update docs [ci skip]
2020-09-20 12:30:53 +02:00
Sofie Van Landeghem
39872de1f6
Introducing the gpu_allocator ( #6091 )
...
* rename 'use_pytorch_for_gpu_memory' to 'gpu_allocator'
* --code instead of --code-path
* update documentation
* avoid querying the "system" section directly
* add explanation of gpu_allocator to TF/PyTorch section in docs
* fix typo
* fix typo 2
* use set_gpu_allocator from thinc 8.0.0a34
* default null instead of empty string
2020-09-19 01:17:02 +02:00
Ines Montani
a127fa475e
Merge pull request #6078 from svlandeg/fix/corpus
2020-09-18 14:44:21 +02:00
Ines Montani
a0b4389a38
Update docs [ci skip]
2020-09-17 19:24:48 +02:00
Matthew Honnibal
6efb7688a6
Draft pretrain usage
2020-09-17 18:17:03 +02:00
Ines Montani
a2c8cda26f
Update docs [ci skip]
2020-09-17 17:12:51 +02:00
Matthew Honnibal
ec751068f3
Draft text for static vectors intro
2020-09-17 16:42:53 +02:00
svlandeg
c8c84f1ccd
Merge remote-tracking branch 'upstream/develop' into fix/corpus
2020-09-17 15:43:04 +02:00
Ines Montani
c8fa2247e3
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-09-17 12:34:15 +02:00
Ines Montani
6761028c6f
Update docs [ci skip]
2020-09-17 12:34:11 +02:00
svlandeg
0c35885751
generalize corpora, dot notation for dev and train corpus
2020-09-17 11:38:59 +02:00
svlandeg
781fae678b
Merge remote-tracking branch 'upstream/develop' into fix/corpus
2020-09-17 09:24:36 +02:00
Adriane Boyd
7e4cd7575c
Refactor Docs.is_ flags ( #6044 )
...
* Refactor Docs.is_ flags
* Add derived `Doc.has_annotation` method
* `Doc.has_annotation(attr)` returns `True` for partial annotation
* `Doc.has_annotation(attr, require_complete=True)` returns `True` for
complete annotation
* Add deprecation warnings to `is_tagged`, `is_parsed`, `is_sentenced`
and `is_nered`
* Add `Doc._get_array_attrs()`, which returns a full list of `Doc` attrs
for use with `Doc.to_array`, `Doc.to_bytes` and `Doc.from_docs`. The
list is the `DocBin` attributes list plus `SPACY` and `LENGTH`.
Notes on `Doc.has_annotation`:
* `HEAD` is converted to `DEP` because heads don't have an unset state
* Accept `IS_SENT_START` as a synonym of `SENT_START`
Additional changes:
* Add `NORM`, `ENT_ID` and `SENT_START` to default attributes for
`DocBin`
* In `Doc.from_array()` the presence of `DEP` causes `HEAD` to override
`SENT_START`
* In `Doc.from_array()` using `attrs` other than
`Doc._get_array_attrs()` (i.e., a user's custom list rather than our
default internal list) with both `HEAD` and `SENT_START` shows a warning
that `HEAD` will override `SENT_START`
* `set_children_from_heads` does not require dependency labels to set
sentence boundaries and sets `sent_start` for all non-sentence starts to
`-1`
* Fix call to set_children_form_heads
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2020-09-17 00:14:01 +02:00
svlandeg
51fa929f47
rewrite train_corpus to corpus.train in config
2020-09-15 21:58:04 +02:00
Ines Montani
b7faa38960
Update docs [ci skip]
2020-09-15 12:44:03 +02:00
Ines Montani
154752f9c2
Update docs and consistency [ci skip]
2020-09-15 00:32:49 +02:00
Ines Montani
85e5910102
Update docs [ci skip]
2020-09-13 23:09:19 +02:00
Ines Montani
5ebb2a2ac8
Update docs [ci skip]
2020-09-13 22:36:20 +02:00
Ines Montani
47acb45850
Update docs [ci skip]
2020-09-13 22:30:33 +02:00
Ines Montani
2e3d067a7b
Update docs [ci skip]
2020-09-13 19:29:06 +02:00
Ines Montani
99b26fe492
Update docs [ci skip]
2020-09-13 17:59:38 +02:00
Ines Montani
1316071086
Update docs [ci skip]
2020-09-13 11:31:50 +02:00
Ines Montani
368ecf705a
Update docs [ci skip]
2020-09-12 17:40:50 +02:00
Ines Montani
8b0dabe987
Update docs [ci skip]
2020-09-12 17:05:10 +02:00
Ines Montani
4fec8c39a3
Update project teaser [ci skip]
2020-09-10 13:23:03 +02:00
Ines Montani
763e302dcc
Update project widgets and examples [ci skip]
2020-09-10 13:04:16 +02:00
Ines Montani
908f3a4494
Update default projects repo [ci skip]
2020-09-10 11:42:14 +02:00
Ines Montani
2e567a47c2
Update docs and formatting
2020-09-09 21:26:10 +02:00
svlandeg
aa27e3f1f2
PyTorch spelling
2020-09-09 16:27:21 +02:00
svlandeg
a8aa9a8068
document Pipe API details, crossreferences etc
2020-09-09 15:56:27 +02:00
svlandeg
9a7c6cc61a
references to usage page on layers and architectures
2020-09-09 14:47:32 +02:00
svlandeg
e80898092b
Merge branch 'feature/more-layers-docs' of https://github.com/svlandeg/spaCy into feature/more-layers-docs
2020-09-09 14:44:28 +02:00
svlandeg
4c080b3a98
details on Thinc shape inference
2020-09-09 13:57:05 +02:00
svlandeg
39aa740777
Merge remote-tracking branch 'upstream/develop' into feature/more-layers-docs
2020-09-09 11:59:34 +02:00
svlandeg
e39242c4e6
formatting
2020-09-09 11:25:35 +02:00
Ines Montani
24053d83ec
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-09-09 11:20:14 +02:00
Ines Montani
406aed78ee
Update docs [ci skip]
2020-09-09 11:20:07 +02:00
Sofie Van Landeghem
8e7557656f
Renaming gold & annotation_setter ( #6042 )
...
* version bump to 3.0.0a16
* rename "gold" folder to "training"
* rename 'annotation_setter' to 'set_extra_annotations'
* formatting
2020-09-09 10:31:03 +02:00
svlandeg
a16afb79e3
add section on Thinc implementation details
2020-09-08 20:43:09 +02:00
svlandeg
1c476b4b41
how to register and use custom function
2020-09-08 20:22:20 +02:00
svlandeg
b35a26ea5d
example wrapped Torch model and chaining with Thinc
2020-09-08 18:32:58 +02:00
svlandeg
bd8f9b188b
small fixes
2020-09-08 17:24:36 +02:00
Ines Montani
d98ae9d918
Update docs [ci skip]
2020-09-08 10:33:48 +02:00
Ines Montani
c443c82722
Update docs [ci skip]
2020-09-05 13:41:10 +02:00
Ines Montani
b3e338d65e
Update docs [ci skip]
2020-09-04 20:58:36 +02:00
Ines Montani
157caf4dfa
WIP: update docs [ci skip]
2020-09-04 16:30:31 +02:00
Ines Montani
f174c7b1f3
Merge branch 'develop' into pr/6018
2020-09-04 15:54:49 +02:00
Ines Montani
864a697e63
Merge branch 'develop' into master-tmp
2020-09-04 13:15:36 +02:00
Adriane Boyd
b927893309
Merge branch 'develop' into feature/dependency-matcher-v3
2020-09-04 13:03:30 +02:00
Ines Montani
2189046869
Merge pull request #6024 from explosion/chore/registry-renaming
2020-09-04 10:54:10 +02:00
Ines Montani
b1eb98b15c
Remove todos [ci skip]
2020-09-03 17:43:58 +02:00
Ines Montani
23b7d9cfa3
Prefix span getters
2020-09-03 17:37:06 +02:00
Ines Montani
5afe6447cd
registry.assets -> registry.misc
2020-09-03 17:31:14 +02:00
Ines Montani
121809dd1e
Fix anchor [ci skip]
2020-09-03 16:49:56 +02:00
Ines Montani
25a595dc10
Fix typos and wording [ci skip]
2020-09-03 16:37:45 +02:00
Ines Montani
b5a0657fd6
"model" terminology consistency in docs
2020-09-03 13:13:03 +02:00
Ines Montani
b02ad8045b
Update docs [ci skip]
2020-09-03 10:10:13 +02:00
Ines Montani
1815c613c9
Update docs [ci skip]
2020-09-03 10:07:45 +02:00
Adriane Boyd
960d9cfadc
Officially support DependencyMatcher
...
Add official support for the `DependencyMatcher`. Redesign the pattern
specification. Fix and extend operator implementations. Update API docs
and add usage docs.
Patterns
--------
Refactor pattern structure to:
```
{
"LEFT_ID": str,
"REL_OP": str,
"RIGHT_ID": str,
"RIGHT_ATTRS": dict,
}
```
The first node contains only `RIGHT_ID` and `RIGHT_ATTRS` and all
subsequent nodes contain all four keys.
New operators
-------------
Because of the way patterns are constructed from left to right, it's
helpful to have `follows` operators along with `precedes` operators. Add
operators for simple precedes / follows alongside immediate precedes /
follows.
* `.*`: precedes
* `;`: immediately follows
* `;*`: follows
Operator fixes
--------------
* `<` and `<<` do not include the node itself
* Fix reversed order for all operators involving linear precedence (`.`,
all sibling operators)
* Linear precedence operators do not match nodes outside the same parse
Additional fixes
----------------
* Use v3 Matcher API
* Support `get` and `remove`
* Support pickling
2020-09-02 17:45:29 +02:00
svlandeg
19298de352
small fix
2020-09-02 17:43:11 +02:00
svlandeg
bbaea530f6
sublayers paragraph
2020-09-02 17:36:22 +02:00
svlandeg
1be7ff02a6
swapping section
2020-09-02 15:26:07 +02:00
svlandeg
57e432ba2a
editor tip as Accordion instead of Infobox
2020-09-02 14:26:57 +02:00
svlandeg
d19ec6c67b
small rewrites in types paragraph
2020-09-02 14:25:18 +02:00
svlandeg
821b2d4e63
update examples
2020-09-02 14:15:50 +02:00
svlandeg
e29a33449d
rewrite intro, simpel Model example
2020-09-02 13:41:18 +02:00
svlandeg
422df9c2e2
Merge remote-tracking branch 'upstream/develop' into feature/docs-layers
...
# Conflicts:
# website/docs/usage/layers-architectures.md
2020-09-02 13:17:11 +02:00
Ines Montani
70238543c8
Update layers/arch docs structure [ci skip]
2020-09-02 13:04:35 +02:00
svlandeg
6fd7f140ec
custom-architectures section
2020-09-02 11:14:06 +02:00
svlandeg
3d9ae9286f
small fixes
2020-09-02 10:46:38 +02:00
Ines Montani
690bd77669
Add todos [ci skip]
2020-09-01 14:04:36 +02:00
Ines Montani
70b226f69d
Support ignore marker in project document [ci skip]
2020-09-01 12:49:04 +02:00
Ines Montani
9af82f3f11
Merge pull request #6003 from explosion/feature/matcher-as-spans
2020-08-31 17:50:56 +02:00
Sofie Van Landeghem
3ac620f09d
fix config example [ci skip]
2020-08-31 17:40:04 +02:00
Ines Montani
add9de5487
Deprecate (Phrase)Matcher.pipe
2020-08-31 17:01:24 +02:00
Ines Montani
bca6bf8dda
Update docs [ci skip]
2020-08-31 16:39:53 +02:00
Ines Montani
db9f8896f5
Add docs [ci skip]
2020-08-31 16:10:41 +02:00
svlandeg
e47ea88aeb
revert annotations refactor
2020-08-31 14:40:55 +02:00
svlandeg
13ee742fb4
example of custom logger
2020-08-31 14:24:41 +02:00
svlandeg
c18eb63483
Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs
...
# Conflicts:
# website/docs/usage/embeddings-transformers.md
2020-08-31 13:21:36 +02:00
Juan Gutiérrez
9002bea29f
Update suffixes example ( #5989 )
...
* Update suffixes example
The current example will throw `TypeError: can only concatenate list (not "tuple") to list`
* Signing Contributor Agreement
2020-08-31 12:44:56 +02:00
Sofie Van Landeghem
ec14744ee4
Rename Transformer listener ( #6001 )
...
* rename to spacy-transformers.TransformerListener
* add some more tok2vec tests
* use select_pipes
* fix docs - annotation setter was not changed in the end
2020-08-31 12:41:39 +02:00
Adriane Boyd
216efaf5f5
Restrict tokenizer exceptions to ORTH and NORM
2020-08-31 09:55:01 +02:00
Ines Montani
9b86312bab
Update docs [ci skip]
2020-08-29 18:43:19 +02:00
Adriane Boyd
870774f475
Merge branch 'develop' into docs/morph-usage-v3
2020-08-29 16:00:50 +02:00
Ines Montani
45f46a5c85
Merge pull request #5993 from explosion/feature/disabled-components
2020-08-29 15:58:41 +02:00
Adriane Boyd
f9ed31a757
Update usage docs for lemmatization and morphology
2020-08-29 15:56:50 +02:00
Ines Montani
bc0730be3f
Update docs [ci skip]
2020-08-29 12:53:14 +02:00
Ines Montani
450bf806b0
Merge pull request #5991 from adrianeboyd/docs/sent-usage-v3
...
Update sentence segmentation usage docs
2020-08-29 12:40:06 +02:00
Ines Montani
66d76f5126
Update docs
2020-08-29 12:36:05 +02:00
svlandeg
9f00a20ce4
proofreading and custom examples
2020-08-28 21:50:42 +02:00
svlandeg
5230529de2
add loggers registry & logger docs sections
2020-08-28 21:44:04 +02:00
Adriane Boyd
48df50533d
Update sentence segmentation usage docs
...
Update sentence segmentation usage docs to incorporate `senter`.
2020-08-28 10:58:16 +02:00
svlandeg
8cde6ccb7d
Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs
2020-08-27 19:56:09 +02:00
svlandeg
556e975a30
various fixes
2020-08-27 19:24:44 +02:00
svlandeg
329e490560
small import fixes
2020-08-27 14:50:43 +02:00
svlandeg
28e4ba7270
fix references to TransformerListener
2020-08-27 14:33:28 +02:00
svlandeg
4d37ac3f33
configure_custom_sent_spans example
2020-08-27 14:14:16 +02:00
svlandeg
c68169f83f
fix link
2020-08-27 10:19:43 +02:00
svlandeg
acc794c975
example of writing to other custom attribute
2020-08-27 10:10:10 +02:00
svlandeg
559b65f2e0
adjust references to null_annotation_setter to trfdata_setter
2020-08-27 09:43:32 +02:00
svlandeg
ec069627fe
rename to TransformerListener
2020-08-26 13:31:01 +02:00
Ines Montani
627617a079
Tidy up and add docs [ci skip]
2020-08-26 13:24:55 +02:00
svlandeg
15902c5aa2
fix link
2020-08-26 11:51:57 +02:00
Ines Montani
f31c4462ca
Update docs [ci skip]
2020-08-25 13:27:59 +02:00
Ines Montani
8ac5ef1284
Update docs
2020-08-25 11:54:37 +02:00
Matthew Honnibal
8038b87f04
Various small tweaks to project CLI ( #5965 )
...
* Fix up/download of http and local paths
* Support git_sparse_checkout for assets
* Fix scorer
* Handle already-present directories for git assets
* Improve convert command
* Fix support for existant files in git assets
* Support branches in git sparse checkout
* Format
* Fix git assets
* Document git block in assets
* Fix test
* Fix test
* Revert "Fix test"
This reverts commit cf3097260f
.
* Revert "Fix test"
This reverts commit 964d636e27
.
* Dont multiply p/r/f by 100
* Display scores * 100 during training
2020-08-25 00:30:52 +02:00
Matthew Honnibal
e559867605
Allow spacy project to push and pull to/from remote storage ( #5949 )
...
* Add utils for working with remote storage
* WIP add remote_cache for project
* WIP add push and pull commands
* Use pathy in remote_cache
* Updarte util
* Update remote_cache
* Update util
* Update project assets
* Update pull script
* Update push script
* Fix type annotation in util
* Work on remote storage
* Remove site and env hash
* Fix imports
* Fix type annotation
* Require pathy
* Require pathy
* Fix import
* Add a util to handle project variable substitution
* Import push and pull commands
* Fix pull command
* Fix push command
* Fix tarfile in remote_storage
* Improve printing
* Fiddle with status messages
* Set version to v3.0.0a9
* Draft docs for spacy project remote storages
* Update docs [ci skip]
* Use Thinc config to simplify and unify template variables
* Auto-format
* Don't import Pathy globally for now
Causes slow and annoying Google Cloud warning
* Tidy up test
* Tidy up and update tests
* Update to latest Thinc
* Update docs
* variables -> vars
* Update docs [ci skip]
* Update docs [ci skip]
Co-authored-by: Ines Montani <ines@ines.io>
2020-08-23 18:32:09 +02:00
Ines Montani
f27aecac14
Update formatting [ci skip]
2020-08-23 11:57:56 +02:00
Ines Montani
98a9e063b6
Update docs [ci skip]
2020-08-22 17:15:05 +02:00
Matthew Honnibal
8dfc4cbfe7
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-08-22 17:12:09 +02:00
Matthew Honnibal
048de64d4c
Suggest edits
2020-08-22 17:11:28 +02:00
Ines Montani
adcf790b96
Update docs[ci skip]
2020-08-22 17:04:16 +02:00
Ines Montani
37ebff6997
Update docs [ci skip]
2020-08-22 16:47:03 +02:00
Matthew Honnibal
8685229891
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-08-22 16:06:59 +02:00
Matthew Honnibal
d97695d09d
Update embeddings-transformers.md
2020-08-22 15:41:35 +02:00
Ines Montani
c7c9b0451f
Update docs [ci skip]
2020-08-22 13:52:52 +02:00
Ines Montani
71aeae89c5
Merge pull request #5948 from svlandeg/feature/docs-docs-docs [ci skip]
2020-08-22 12:18:47 +02:00
Ines Montani
27f81109d6
Update docs [ci skip]
2020-08-21 20:02:18 +02:00
Ines Montani
f102164a1f
Update docs [ci skip]
2020-08-21 19:34:06 +02:00
svlandeg
1b7cfa7347
Merge remote-tracking branch 'upstream/develop' into feature/docs-docs-docs
2020-08-21 18:36:18 +02:00
svlandeg
942adf0f4d
comma
2020-08-21 18:36:02 +02:00
svlandeg
262552010d
context manager with space (for consistency)
2020-08-21 18:34:02 +02:00
svlandeg
da48c6a2a2
several small updates
2020-08-21 18:25:26 +02:00
svlandeg
ad2332d4b7
alphabetize registries
2020-08-21 18:10:31 +02:00
svlandeg
c6659e37d8
small fixes
2020-08-21 18:02:20 +02:00
svlandeg
518a1f97f3
remove outdated TODO's
2020-08-21 17:55:15 +02:00
Ines Montani
2cc4640385
Update docs [ci skip]
2020-08-21 16:21:55 +02:00
Ines Montani
74cb6d39d0
Update docs [ci skip]
2020-08-21 16:11:38 +02:00
Ines Montani
aa6a7cd6e7
Update docs and consistency [ci skip]
2020-08-21 13:49:18 +02:00
Ines Montani
52bd3a8b48
Update docs [ci skip]
2020-08-21 13:22:59 +02:00
Ines Montani
e60442d83a
Adjust label casing in displaCy NER visualizer ( resolves #4866 )
...
- Accept any case for label names in ents and colors option, even if actual predicted label uses different casing
- Don't text-transform: uppercase visually, if it's important to users that the label is represented as-is in the UI
2020-08-21 11:51:31 +02:00
Ines Montani
04e4d59235
Update docs [ci skip]
2020-08-20 16:17:25 +02:00
Ines Montani
6ad59d59fe
Merge branch 'develop' of https://github.com/explosion/spaCy into develop [ci skip]
2020-08-20 11:20:58 +02:00
Ines Montani
fb51b55eb9
Add comment [ci skip]
2020-08-20 11:20:43 +02:00
Ines Montani
2253d26b82
Update vectors and similarity docs [ci skip]
2020-08-19 21:18:26 +02:00
Ines Montani
15e6feed01
Update docs [ci skip]
2020-08-19 20:37:54 +02:00
svlandeg
d8f6abdc23
add linking TODO back in
2020-08-19 18:00:35 +02:00
svlandeg
169b5bcda0
Merge remote-tracking branch 'upstream/develop' into feature/update-docs
...
# Conflicts:
# website/docs/usage/training.md
2020-08-19 17:58:25 +02:00
svlandeg
7119295a8a
badgers intro
2020-08-19 17:53:22 +02:00
svlandeg
4906a2ae6c
custom functions intro
2020-08-19 17:32:35 +02:00
svlandeg
7a2e6a96f5
fix typo
2020-08-19 16:54:16 +02:00
svlandeg
648499157a
rename "custom models" to "custom functions"
2020-08-19 16:53:51 +02:00
Ines Montani
63921161c8
Update docs [ci skip]
2020-08-19 16:04:21 +02:00
svlandeg
d3a8321172
fix typos
2020-08-19 15:12:12 +02:00
Ines Montani
225f8866a1
Fix consistency
2020-08-19 12:47:57 +02:00
Ines Montani
9c25656ccc
Update docs [ci skip]
2020-08-19 12:14:41 +02:00
Ines Montani
2285e59765
Merge pull request #5933 from svlandeg/feature/more-v3-docs [ci skip]
2020-08-19 11:29:02 +02:00
Ines Montani
13291e97ba
Update docs [ci skip]
2020-08-19 00:28:37 +02:00
svlandeg
6ed67d495a
format
2020-08-18 19:43:20 +02:00
svlandeg
f9fe5eb323
clean up example
2020-08-18 19:35:23 +02:00
svlandeg
a8acedd4ba
example of custom reader and batcher
2020-08-18 19:15:16 +02:00
svlandeg
abba639565
Merge remote-tracking branch 'upstream/develop' into feature/more-v3-docs
2020-08-18 18:55:12 +02:00
Ines Montani
82f0e20318
Update docs and consistency [ci skip]
2020-08-18 14:39:40 +02:00
Matthew Honnibal
b72bd1767f
Remove todo
2020-08-18 13:52:22 +02:00
Matthew Honnibal
574fd53289
Add precision/recall description
2020-08-18 13:51:08 +02:00
Matthew Honnibal
96a9c65f97
Add model architectures intro
2020-08-18 13:50:55 +02:00
svlandeg
f7b76d2d83
Merge remote-tracking branch 'upstream/develop' into feature/more-v3-docs
2020-08-18 11:57:52 +02:00
svlandeg
8dcda351ec
typo's and quick note on default values
2020-08-18 10:23:27 +02:00
Ines Montani
ef6cf3b276
Update docs [ci skip]
2020-08-18 01:29:34 +02:00
Ines Montani
728fec0194
Update docs [ci skip]
2020-08-18 00:49:19 +02:00
Ines Montani
9299166c75
Merge pull request #5925 from explosion/docs/vectors [ci skip]
...
Update the 'vectors' docs page
2020-08-17 21:45:09 +02:00
svlandeg
4fe4bab1c9
typo fixes
2020-08-17 17:10:15 +02:00
svlandeg
da80c18660
merge develop into branch
2020-08-17 16:57:18 +02:00
Ines Montani
3ae5e02f4f
Update docs, types and API consistency
2020-08-17 16:45:24 +02:00
Matthew Honnibal
052d82aa4e
Suggest vectors changes
2020-08-17 15:32:30 +02:00
svlandeg
961e818be6
p/r definitions
2020-08-17 15:02:39 +02:00
svlandeg
319692aa53
fix typos
2020-08-17 14:05:48 +02:00
Matthew Honnibal
be07567ac6
Update transformers page
2020-08-16 20:29:50 +02:00
Matthew Honnibal
8e5f99ee25
Update transformer docs intro. Also write system requirements
2020-08-16 20:13:24 +02:00
Ines Montani
a570c304df
Update quickstart, template and docs
2020-08-15 14:50:29 +02:00
Ines Montani
950832f087
Tidy up pipes ( #5906 )
...
* Tidy up pipes
* Fix init, defaults and raise custom errors
* Update docs
* Update docs [ci skip]
* Apply suggestions from code review
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
* Tidy up error handling and validation, fix consistency
* Simplify get_examples check
* Remove unused import [ci skip]
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2020-08-11 23:29:31 +02:00
Ines Montani
b7ec06e331
Update docs [ci skip]
2020-08-11 20:57:23 +02:00
Ines Montani
10f42e3a39
Update docs [ci skip]
2020-08-11 00:09:49 +02:00
Ines Montani
2778d04377
Update docs [ci skip]
2020-08-10 23:41:09 +02:00
Ines Montani
023ba7ae26
Update docs
2020-08-10 17:13:11 +02:00
Ines Montani
12052bd8f6
Update docs [ci skip]
2020-08-10 01:20:10 +02:00
Ines Montani
d611cbef43
Update docs [ci skip]
2020-08-10 00:42:26 +02:00
Ines Montani
c044460823
Update docs [ci skip]
2020-08-10 00:01:38 +02:00
Ines Montani
05dcab10aa
Fix typo
2020-08-09 22:34:03 +02:00
Ines Montani
8d2baa153d
Update tokenizer docs and add test
2020-08-09 15:24:01 +02:00
Ines Montani
3901b088ff
Update graphics and 101 [ci skip]
2020-08-07 17:14:13 +02:00
Ines Montani
5e1421e5a6
Update docs [ci skip]
2020-08-07 16:23:12 +02:00
Ines Montani
b7e34c1451
Update docs [ci skip]
2020-08-07 16:13:13 +02:00
Ines Montani
e829d3bf14
Update docs [ci skip]
2020-08-07 15:46:20 +02:00
svlandeg
824f4b2107
casing consistent
2020-08-06 23:20:13 +02:00
Ines Montani
e5995904d6
Update docs
2020-08-06 19:30:43 +02:00
Ines Montani
5d417d3b19
WIP: Update docs [ci skip]
2020-08-06 13:10:15 +02:00
Ines Montani
06e80d95cd
Sync develop with nightly docs state ( #5883 )
...
Co-authored-by: svlandeg <sofie.vanlandeghem@gmail.com>
2020-08-06 00:28:14 +02:00
Ines Montani
50311a4d37
Update docs [ci skip]
2020-08-05 20:29:53 +02:00
Ines Montani
cdec46493f
Update docs
2020-08-05 15:00:54 +02:00
Ines Montani
4c055f0aa7
Add init CLI and init config ( #5854 )
...
* Add init CLI and init config draft
* Improve config validation
* Auto-format
* Don't export anything in debug config
* Update docs
2020-08-02 15:18:30 +02:00
Ines Montani
b40f44419b
Simplify pipe analysis
...
- remove unused code
- don't print by default
- integrate attrs info into analysis output
2020-08-01 13:40:06 +02:00
Ines Montani
98c6a85c8b
Update docs [ci skip]
2020-07-31 18:55:38 +02:00
Ines Montani
e9e8fa2466
Update docs and types
2020-07-31 17:02:54 +02:00
Ines Montani
160f1a5f94
Update docs [ci skip]
2020-07-31 13:26:39 +02:00
Ines Montani
3449c45fd9
Update docs [ci skip]
2020-07-29 19:48:26 +02:00
Ines Montani
9c80cb673d
Update docs [ci skip]
2020-07-29 19:41:34 +02:00
Ines Montani
9f69afdd1e
Update docs [ci skip]
2020-07-29 19:09:44 +02:00
Ines Montani
7a21775cd0
Merge pull request #5834 from explosion/feature/vectors
2020-07-29 18:49:26 +02:00
Ines Montani
158d8c1e48
Update docs [ci skip]
2020-07-29 18:44:10 +02:00
Matthew Honnibal
f7adc9d3b7
Start rewriting vectors docs
2020-07-29 17:10:06 +02:00
Ines Montani
e0ffe36e79
Update docstrings, docs and types
2020-07-29 11:36:42 +02:00
Ines Montani
d8b519c23c
API docs, docstrings and argument consistency
2020-07-27 18:11:45 +02:00
Ines Montani
7dd53d0964
Fix typo [ci skip]
2020-07-27 00:34:00 +02:00
Ines Montani
7adbaf9a5b
Update docs [ci skip]
2020-07-27 00:29:45 +02:00
Matthew Honnibal
fb5dbe30b5
Trim training 101
2020-07-26 13:43:22 +02:00
Matthew Honnibal
e6a7deb7cc
Edits to the training 101 section
2020-07-26 13:42:08 +02:00
Ines Montani
c288dba8e7
Update docs [ci skip]
2020-07-25 18:51:12 +02:00
Li Zhe
a69eb445dc
fix the wrong hash url in adding-languages.md file ( #5810 )
...
* fix the wrong hash url in adding-languages.md file
change the #101 url hash path to #language-data
* filled in the spaCy Contributor Agreement
filled in the spaCy Contributor Agreement
2020-07-25 13:13:38 +02:00
Adriane Boyd
d3385f4be2
Add Morphology and MorphAnalysis to overview
2020-07-21 13:06:22 +02:00
Ines Montani
644074b954
Merge branch 'develop' into master-tmp
2020-07-20 14:58:04 +02:00
Adriane Boyd
39ebcd9ec9
Refactor Chinese tokenizer configuration ( #5736 )
...
* Refactor Chinese tokenizer configuration
Refactor `ChineseTokenizer` configuration so that it uses a single
`segmenter` setting to choose between character segmentation, jieba, and
pkuseg.
* replace `use_jieba`, `use_pkuseg`, `require_pkuseg` with the setting
`segmenter` with the supported values: `char`, `jieba`, `pkuseg`
* make the default segmenter plain character segmentation `char` (no
additional libraries required)
* Fix Chinese serialization test to use char default
* Warn if attempting to customize other segmenter
Add a warning if `Chinese.pkuseg_update_user_dict` is called when
another segmenter is selected.
2020-07-19 13:34:37 +02:00
Adriane Boyd
cd5af72c9a
Update pkuseg version ( #5774 )
...
* Update pkuseg version in Chinese tokenizer warnings
* Update pkuseg version in `Makefile`
* Remove warning about python3.8 wheels in docs
2020-07-19 11:09:49 +02:00
Ines Montani
872938ec76
Merge pull request #5747 from explosion/feature/refactor-config-args
2020-07-14 00:00:22 +02:00
Ines Montani
5f6f4ff594
Remove object subclassing
2020-07-12 14:03:23 +02:00
Ines Montani
3f948b9c74
Update docs
2020-07-12 12:32:28 +02:00
Ines Montani
7b5717cac3
Merge branch 'develop' into feature/refactor-config-args
2020-07-10 22:50:07 +02:00
Ines Montani
e6a6587a9a
Update projects.md [ci skip]
2020-07-10 22:41:27 +02:00
Ines Montani
f2cd982e7b
Update training.md
2020-07-10 22:34:27 +02:00
Ines Montani
52e9b5b472
Fix formatting
2020-07-09 23:25:58 +02:00
Ines Montani
28cdae898a
Update projects.md
2020-07-09 22:35:54 +02:00
Ines Montani
7bcf9f7cfb
Document new features
2020-07-09 21:10:36 +02:00
Ines Montani
ea01831f6a
Update projects docs etc.
2020-07-09 19:43:25 +02:00
Ines Montani
2298e129e6
Update example and training docs
2020-07-07 20:30:12 +02:00
svlandeg
2b60e894cb
fix component constructors, update, begin_training, reference to GoldParse
2020-07-07 19:17:19 +02:00
Ines Montani
bb3ee38cf9
Update WIP
2020-07-06 22:22:37 +02:00
Ines Montani
44790c1c32
Update docs and add keyword-only tag
2020-07-06 18:14:57 +02:00
Ines Montani
a35236e5f0
Update v3 docs WIP [ci skip]
2020-07-06 15:57:44 +02:00
Ines Montani
63247cbe87
Update v3 docs [ci skip]
2020-07-05 16:11:16 +02:00