Ines Montani
be99f1e4de
Remove output dirs before training ( #6204 )
...
* Remove output dirs before training
* Re-raise error if cleaning fails
2020-10-05 20:11:16 +02:00
Matthew Honnibal
e50047f1c5
Check lengths match
2020-10-05 20:02:45 +02:00
Ines Montani
582701519e
Remove __release__ flag
2020-10-05 20:00:49 +02:00
Ines Montani
d58fb42707
Add spacy_version option and validation for project.yml
2020-10-05 20:00:42 +02:00
Matthew Honnibal
db84d175c3
Fix test
2020-10-05 19:59:30 +02:00
Matthew Honnibal
cdd2b79b6d
Remove deprecated MultiHashEmbed
2020-10-05 19:58:18 +02:00
Matthew Honnibal
6dcc4a0ba6
Simplify MultiHashEmbed signature
2020-10-05 19:57:45 +02:00
Adriane Boyd
d2806f11f2
Update to spacy-pkuseg==0.0.26 in Makefile
2020-10-05 18:08:32 +02:00
svlandeg
193e0d5a98
add docs for entity_ruler.initialize
2020-10-05 18:04:08 +02:00
svlandeg
3ac3447eee
cleanup
2020-10-05 17:50:37 +02:00
svlandeg
9eb813a35d
Merge remote-tracking branch 'upstream/develop' into fix/patterns-init
2020-10-05 17:49:44 +02:00
Adriane Boyd
f102ef6b54
Read features.msgpack instead of features.pkl
2020-10-05 17:47:39 +02:00
svlandeg
4e3ace4b8c
is_trainable method
2020-10-05 17:43:42 +02:00
Ines Montani
84fedcebab
Make args keyword-only [ci skip]
...
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2020-10-05 17:07:35 +02:00
Matthew Honnibal
71e73ed0a6
Merge branch 'develop' into feature/embed-features
2020-10-05 17:00:05 +02:00
Matthew Honnibal
3ee3649b52
Fix augment
2020-10-05 16:59:49 +02:00
Matthew Honnibal
22937d25a9
Merge branch 'develop' into feature/embed-features
2020-10-05 16:42:17 +02:00
Matthew Honnibal
8deed614e9
Fix augment
2020-10-05 16:41:45 +02:00
Matthew Honnibal
4ed3e037df
Fix augment
2020-10-05 16:40:55 +02:00
Matthew Honnibal
9f1bc3f24c
Fix augment
2020-10-05 16:40:23 +02:00
svlandeg
dc06912c76
prevent loss keyerror for non-trainable components
2020-10-05 16:33:28 +02:00
Adriane Boyd
187234648c
Revert back to "default" as default for pkuseg_user_dict
2020-10-05 16:24:28 +02:00
svlandeg
65abd77779
add finish_update to Pipe
2020-10-05 16:23:33 +02:00
Matthew Honnibal
90040aacec
Fix merge
2020-10-05 16:12:01 +02:00
Matthew Honnibal
93a98e8c3e
Merge branch 'develop' into feature/embed-features
2020-10-05 15:51:31 +02:00
Matthew Honnibal
eb9ba61517
Format
2020-10-05 15:29:49 +02:00
Matthew Honnibal
7d93575f35
spacy/tests/
2020-10-05 15:28:12 +02:00
Matthew Honnibal
f4ca9a39cb
spacy/tests/
2020-10-05 15:27:06 +02:00
Matthew Honnibal
f2f1deca66
spacy/tests/
2020-10-05 15:24:33 +02:00
Matthew Honnibal
8ec79ad3fa
Allow configuration of MultiHashEmbed features
...
Update arguments to MultiHashEmbed layer so that the attributes can be
controlled. A kind of tricky scheme is used to allow optional
specification of the rows. I think it's an okay balance between
flexibility and convenience.
2020-10-05 15:22:00 +02:00
Ines Montani
7946fd84bb
Merge pull request #6200 from adrianeboyd/bugfix/vocab-disk-lookups-vectors
...
Always serialize lookups and vectors to disk
2020-10-05 15:15:25 +02:00
Ines Montani
8171e28b20
Remove logging [ci skip]
...
This would be fired on each example, which is wrong
2020-10-05 15:09:52 +02:00
svlandeg
251b3eb4e5
add initialize method for entity_ruler
2020-10-05 14:59:13 +02:00
Sofie Van Landeghem
f4f49f5877
update blis ( #6198 )
...
* allow higher blis version
* fix typo
* bump to 3.0.0a34
* fix pins in other files
2020-10-05 14:58:56 +02:00
Adriane Boyd
5d19dfc9d3
Update Chinese tokenizer for spacy-pkuseg fork
2020-10-05 14:21:53 +02:00
Matthew Honnibal
6a9d14e35a
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-10-05 14:17:41 +02:00
Matthew Honnibal
d2b9aafb8c
Fix augmenter
2020-10-05 14:14:49 +02:00
Ines Montani
6260fa3c10
Merge pull request #6201 from svlandeg/fix/error_nr
2020-10-05 14:00:57 +02:00
Ines Montani
6958510bda
Include spaCy version check in project CLI
2020-10-05 13:53:07 +02:00
Ines Montani
20f2a17a09
Merge test_misc and test_util
2020-10-05 13:45:57 +02:00
svlandeg
fd2d48556c
fix E902 and E903 numbering
2020-10-05 13:43:32 +02:00
Ines Montani
e3acad6264
Update docs [ci skip]
2020-10-05 13:06:20 +02:00
Ines Montani
0f64556c04
Merge pull request #6197 from svlandeg/feature/pipe-docs [ci skip]
2020-10-05 11:55:40 +02:00
Ines Montani
1c641e41c3
Remove unused import [ci skip]
2020-10-05 11:50:11 +02:00
Ines Montani
2acbec2d2b
Merge pull request #6199 from adrianeboyd/bugfix/ru-uk-lemmatizer-init
...
Update ru/uk lemmatizers for new nlp.initialize
2020-10-05 11:48:47 +02:00
Adriane Boyd
03cfb2d2f4
Always serialize lookups and vectors to disk
2020-10-05 09:40:20 +02:00
Adriane Boyd
b0b93854cb
Update ru/uk lemmatizers for new nlp.initialize
2020-10-05 09:27:16 +02:00
svlandeg
9a6c9b133b
various small fixes
2020-10-05 01:05:37 +02:00
svlandeg
52b660e9dc
initialize and update explanation
2020-10-05 00:39:36 +02:00
Ines Montani
549758f67d
Adjust test for now
2020-10-04 23:16:09 +02:00