Ines Montani
154752f9c2
Update docs and consistency [ci skip]
2020-09-15 00:32:49 +02:00
Sofie Van Landeghem
3216a33149
positive_label config for textcat ( #6062 )
...
* hook up positive_label in textcat
* unit tests
* documentation
* formatting
* tests
* fix typo
* move verify_config to after begin_training
* revert accidential commit
2020-09-14 17:08:00 +02:00
Ines Montani
9afb1d9965
Merge pull request #6063 from svlandeg/feature/doc_cleanup [ci skip]
2020-09-14 10:35:43 +02:00
Ines Montani
47acb45850
Update docs [ci skip]
2020-09-13 22:30:33 +02:00
Ines Montani
2e3d067a7b
Update docs [ci skip]
2020-09-13 19:29:06 +02:00
Sofie Van Landeghem
744df9814a
define threshold for scoring textcat in TextCat config ( #6055 )
...
* define threshold for scoring textcat in TextCat config
* fix unit test and documentation
2020-09-13 14:15:52 +02:00
svlandeg
c4f324d5f1
doc fixes
2020-09-12 17:38:54 +02:00
Ines Montani
8b0dabe987
Update docs [ci skip]
2020-09-12 17:05:10 +02:00
Ines Montani
0b2e07215d
Support overwriting name on spacy package
2020-09-11 11:38:28 +02:00
svlandeg
97d99f7efa
Merge remote-tracking branch 'upstream/develop' into feature/doc-fixes
2020-09-10 11:51:34 +02:00
Ines Montani
15bc3a37b4
Add --branch to project clone
2020-09-10 11:08:15 +02:00
Ines Montani
b7afd09d27
Update formatting [ci skip]
2020-09-10 11:07:09 +02:00
svlandeg
9073d99fc9
fix link to shape inference section
2020-09-10 10:22:59 +02:00
Ines Montani
1955aaaa20
Merge pull request #6045 from svlandeg/feature/more-layers-docs [ci skip]
2020-09-09 21:46:40 +02:00
Ines Montani
2e567a47c2
Update docs and formatting
2020-09-09 21:26:10 +02:00
svlandeg
c89e07927e
document individual component API pages
2020-09-09 16:18:38 +02:00
Sofie Van Landeghem
cb66ea7400
Remove simple_ner code ( #6041 )
...
* remove simple_ner code
* remove unused _biluo and _iob files
2020-09-09 16:11:27 +02:00
svlandeg
a8aa9a8068
document Pipe API details, crossreferences etc
2020-09-09 15:56:27 +02:00
svlandeg
39aa740777
Merge remote-tracking branch 'upstream/develop' into feature/more-layers-docs
2020-09-09 11:59:34 +02:00
Sofie Van Landeghem
8e7557656f
Renaming gold & annotation_setter ( #6042 )
...
* version bump to 3.0.0a16
* rename "gold" folder to "training"
* rename 'annotation_setter' to 'set_extra_annotations'
* formatting
2020-09-09 10:31:03 +02:00
Marek Grzenkowicz
a26f864ed3
Clarify how to choose pretrained weights files ( closes #6027 ) [ci skip] ( #6039 )
2020-09-08 21:13:50 +02:00
svlandeg
bd8f9b188b
small fixes
2020-09-08 17:24:36 +02:00
Ines Montani
157caf4dfa
WIP: update docs [ci skip]
2020-09-04 16:30:31 +02:00
Ines Montani
f174c7b1f3
Merge branch 'develop' into pr/6018
2020-09-04 15:54:49 +02:00
Ines Montani
864a697e63
Merge branch 'develop' into master-tmp
2020-09-04 13:15:36 +02:00
Adriane Boyd
b927893309
Merge branch 'develop' into feature/dependency-matcher-v3
2020-09-04 13:03:30 +02:00
Ines Montani
4daf138136
Fix alphabetic ordering [ci skip]
2020-09-03 23:01:50 +02:00
Ines Montani
23b7d9cfa3
Prefix span getters
2020-09-03 17:37:06 +02:00
Ines Montani
5afe6447cd
registry.assets -> registry.misc
2020-09-03 17:31:14 +02:00
Ines Montani
c063e55eb7
Add prefix to batchers
2020-09-03 17:30:41 +02:00
Ines Montani
804f120361
Don't use registered function version in title
2020-09-03 17:29:47 +02:00
Ines Montani
c53b1433b9
Adjust more arguments [ci skip]
2020-09-03 17:12:24 +02:00
Ines Montani
25a595dc10
Fix typos and wording [ci skip]
2020-09-03 16:37:45 +02:00
Ines Montani
b5a0657fd6
"model" terminology consistency in docs
2020-09-03 13:13:03 +02:00
Adriane Boyd
960d9cfadc
Officially support DependencyMatcher
...
Add official support for the `DependencyMatcher`. Redesign the pattern
specification. Fix and extend operator implementations. Update API docs
and add usage docs.
Patterns
--------
Refactor pattern structure to:
```
{
"LEFT_ID": str,
"REL_OP": str,
"RIGHT_ID": str,
"RIGHT_ATTRS": dict,
}
```
The first node contains only `RIGHT_ID` and `RIGHT_ATTRS` and all
subsequent nodes contain all four keys.
New operators
-------------
Because of the way patterns are constructed from left to right, it's
helpful to have `follows` operators along with `precedes` operators. Add
operators for simple precedes / follows alongside immediate precedes /
follows.
* `.*`: precedes
* `;`: immediately follows
* `;*`: follows
Operator fixes
--------------
* `<` and `<<` do not include the node itself
* Fix reversed order for all operators involving linear precedence (`.`,
all sibling operators)
* Linear precedence operators do not match nodes outside the same parse
Additional fixes
----------------
* Use v3 Matcher API
* Support `get` and `remove`
* Support pickling
2020-09-02 17:45:29 +02:00
svlandeg
bbaea530f6
sublayers paragraph
2020-09-02 17:36:22 +02:00
Ines Montani
9af82f3f11
Merge pull request #6003 from explosion/feature/matcher-as-spans
2020-08-31 17:50:56 +02:00
Ines Montani
3929431af1
Update docs [ci skip]
2020-08-31 17:06:33 +02:00
Ines Montani
add9de5487
Deprecate (Phrase)Matcher.pipe
2020-08-31 17:01:24 +02:00
svlandeg
2c3b64a567
console logging example
2020-08-31 16:56:13 +02:00
Ines Montani
bca6bf8dda
Update docs [ci skip]
2020-08-31 16:39:53 +02:00
Ines Montani
db9f8896f5
Add docs [ci skip]
2020-08-31 16:10:41 +02:00
svlandeg
fe6c08218e
fixes
2020-08-31 14:51:49 +02:00
svlandeg
0e0abb0378
fix
2020-08-31 14:50:29 +02:00
svlandeg
56ba691ecd
small fixes
2020-08-31 14:46:00 +02:00
svlandeg
e47ea88aeb
revert annotations refactor
2020-08-31 14:40:55 +02:00
svlandeg
2c90a06fee
some more information about the loggers
2020-08-31 13:43:17 +02:00
svlandeg
c18eb63483
Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs
...
# Conflicts:
# website/docs/usage/embeddings-transformers.md
2020-08-31 13:21:36 +02:00
Sofie Van Landeghem
ec14744ee4
Rename Transformer listener ( #6001 )
...
* rename to spacy-transformers.TransformerListener
* add some more tok2vec tests
* use select_pipes
* fix docs - annotation setter was not changed in the end
2020-08-31 12:41:39 +02:00
Ines Montani
9b86312bab
Update docs [ci skip]
2020-08-29 18:43:19 +02:00
Adriane Boyd
870774f475
Merge branch 'develop' into docs/morph-usage-v3
2020-08-29 16:00:50 +02:00
Ines Montani
45f46a5c85
Merge pull request #5993 from explosion/feature/disabled-components
2020-08-29 15:58:41 +02:00
Adriane Boyd
f9ed31a757
Update usage docs for lemmatization and morphology
2020-08-29 15:56:50 +02:00
Ines Montani
450bf806b0
Merge pull request #5991 from adrianeboyd/docs/sent-usage-v3
...
Update sentence segmentation usage docs
2020-08-29 12:40:06 +02:00
Ines Montani
66d76f5126
Update docs
2020-08-29 12:36:05 +02:00
svlandeg
5230529de2
add loggers registry & logger docs sections
2020-08-28 21:44:04 +02:00
Adriane Boyd
48df50533d
Update sentence segmentation usage docs
...
Update sentence segmentation usage docs to incorporate `senter`.
2020-08-28 10:58:16 +02:00
svlandeg
72a87095d9
add loggers registry
2020-08-27 20:26:28 +02:00
svlandeg
aa9e0c9c39
small fix
2020-08-27 19:56:52 +02:00
svlandeg
8cde6ccb7d
Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs
2020-08-27 19:56:09 +02:00
svlandeg
556e975a30
various fixes
2020-08-27 19:24:44 +02:00
Ines Montani
ff4175e839
Add more info to debug config
2020-08-27 18:17:58 +02:00
svlandeg
559b65f2e0
adjust references to null_annotation_setter to trfdata_setter
2020-08-27 09:43:32 +02:00
Ines Montani
696f167478
Add diff example to docs [ci skip]
2020-08-26 15:57:54 +02:00
Adriane Boyd
90d88729e0
Add AttributeRuler.score ( #5963 )
...
* Add AttributeRuler.score
Add scoring for TAG / POS / MORPH / LEMMA if these are present in the
assigned token attributes.
Add default score weights (that don't really make a lot of sense) so
that the scores are in the default config in some form.
* Update docs
2020-08-26 15:39:30 +02:00
svlandeg
ec069627fe
rename to TransformerListener
2020-08-26 13:31:01 +02:00
Ines Montani
627617a079
Tidy up and add docs [ci skip]
2020-08-26 13:24:55 +02:00
svlandeg
15902c5aa2
fix link
2020-08-26 11:51:57 +02:00
svlandeg
feb86d5206
clarify default
2020-08-26 11:21:30 +02:00
Ines Montani
8ac5ef1284
Update docs
2020-08-25 11:54:37 +02:00
Matthew Honnibal
e559867605
Allow spacy project to push and pull to/from remote storage ( #5949 )
...
* Add utils for working with remote storage
* WIP add remote_cache for project
* WIP add push and pull commands
* Use pathy in remote_cache
* Updarte util
* Update remote_cache
* Update util
* Update project assets
* Update pull script
* Update push script
* Fix type annotation in util
* Work on remote storage
* Remove site and env hash
* Fix imports
* Fix type annotation
* Require pathy
* Require pathy
* Fix import
* Add a util to handle project variable substitution
* Import push and pull commands
* Fix pull command
* Fix push command
* Fix tarfile in remote_storage
* Improve printing
* Fiddle with status messages
* Set version to v3.0.0a9
* Draft docs for spacy project remote storages
* Update docs [ci skip]
* Use Thinc config to simplify and unify template variables
* Auto-format
* Don't import Pathy globally for now
Causes slow and annoying Google Cloud warning
* Tidy up test
* Tidy up and update tests
* Update to latest Thinc
* Update docs
* variables -> vars
* Update docs [ci skip]
* Update docs [ci skip]
Co-authored-by: Ines Montani <ines@ines.io>
2020-08-23 18:32:09 +02:00
Ines Montani
c7c9b0451f
Update docs [ci skip]
2020-08-22 13:52:52 +02:00
Ines Montani
71aeae89c5
Merge pull request #5948 from svlandeg/feature/docs-docs-docs [ci skip]
2020-08-22 12:18:47 +02:00
Ines Montani
f102164a1f
Update docs [ci skip]
2020-08-21 19:34:06 +02:00
svlandeg
1b7cfa7347
Merge remote-tracking branch 'upstream/develop' into feature/docs-docs-docs
2020-08-21 18:36:18 +02:00
svlandeg
dc98f69b57
alphabetize registries
2020-08-21 18:10:21 +02:00
svlandeg
518a1f97f3
remove outdated TODO's
2020-08-21 17:55:15 +02:00
svlandeg
e92bd6e1c1
alphabetize training lists
2020-08-21 17:42:19 +02:00
Ines Montani
74cb6d39d0
Update docs [ci skip]
2020-08-21 16:11:38 +02:00
Matthew Honnibal
f5bcc10268
Update architectures
2020-08-21 15:34:54 +02:00
Matthew Honnibal
7ed8f4504b
Update API docs for architectures
2020-08-21 15:22:19 +02:00
Ines Montani
52bd3a8b48
Update docs [ci skip]
2020-08-21 13:22:59 +02:00
Ines Montani
e60442d83a
Adjust label casing in displaCy NER visualizer ( resolves #4866 )
...
- Accept any case for label names in ents and colors option, even if actual predicted label uses different casing
- Don't text-transform: uppercase visually, if it's important to users that the label is represented as-is in the UI
2020-08-21 11:51:31 +02:00
Ines Montani
04e4d59235
Update docs [ci skip]
2020-08-20 16:17:25 +02:00
Sofie Van Landeghem
410b54e10e
Update website/docs/api/data-formats.md
...
Co-authored-by: Ines Montani <ines@ines.io>
2020-08-20 11:15:34 +02:00
svlandeg
ae719b354f
fix typos
2020-08-20 10:20:40 +02:00
svlandeg
f728c00cbb
Merge remote-tracking branch 'upstream/develop' into feature/update-more-docs
...
# Conflicts:
# website/docs/api/data-formats.md
2020-08-20 10:02:13 +02:00
svlandeg
229033831a
add explanation of raw_text
2020-08-20 10:00:45 +02:00
Ines Montani
ea6640ea72
Merge pull request #5939 from explosion/feature/thinc-v8.0.0a28
...
Update Thinc and config variables
2020-08-19 21:14:36 +02:00
svlandeg
09f3cfc985
add version
2020-08-19 19:58:45 +02:00
svlandeg
7d9f00bdbf
waltzing schedule
2020-08-19 19:53:00 +02:00
Ines Montani
3dd390b1a1
Update Thinc and config variables
2020-08-19 19:46:12 +02:00
svlandeg
85b39639e1
small fix
2020-08-19 19:17:36 +02:00
svlandeg
169b5bcda0
Merge remote-tracking branch 'upstream/develop' into feature/update-docs
...
# Conflicts:
# website/docs/usage/training.md
2020-08-19 17:58:25 +02:00
svlandeg
7119295a8a
badgers intro
2020-08-19 17:53:22 +02:00
svlandeg
648499157a
rename "custom models" to "custom functions"
2020-08-19 16:53:51 +02:00
Ines Montani
63921161c8
Update docs [ci skip]
2020-08-19 16:04:21 +02:00
svlandeg
60fedb8518
fix 2 more API lines
2020-08-19 14:55:32 +02:00
svlandeg
2dfd919585
add kb_loader and get_candidates back to EL API
2020-08-19 14:52:49 +02:00
Ines Montani
225f8866a1
Fix consistency
2020-08-19 12:47:57 +02:00