Matthew Honnibal
|
648532d647
|
Don't assume blas methods are present
|
2018-03-16 02:48:20 +01:00 |
|
Matthew Honnibal
|
d55620041b
|
Switch parser to gemm from thinc.openblas
|
2018-03-13 02:10:58 +01:00 |
|
Matthew Honnibal
|
4b72c38556
|
Fix dropout bug in beam parser
|
2018-03-10 23:16:40 +01:00 |
|
Matthew Honnibal
|
3d6487c734
|
Support dropout in beam parse
|
2018-03-10 22:41:55 +01:00 |
|
Matthew Honnibal
|
661873ee4c
|
Randomize the rebatch size in parser
|
2018-02-21 21:02:07 +01:00 |
|
Matthew Honnibal
|
ea2fc5d45f
|
Improve length and freq cutoffs in parser
|
2018-02-21 16:00:38 +01:00 |
|
Matthew Honnibal
|
e5757d4bf0
|
Add labels property to parser
|
2018-02-21 16:00:00 +01:00 |
|
Matthew Honnibal
|
8f06903e09
|
Fix multitask objectives
|
2018-02-17 18:41:36 +01:00 |
|
Matthew Honnibal
|
d1246c95fb
|
Fix model loading when using multitask objectives
|
2018-02-17 18:11:36 +01:00 |
|
Matthew Honnibal
|
7d5c720fc3
|
Fix multitask objective when no pipeline provided
|
2018-02-15 23:50:21 +01:00 |
|
Claudiu-Vlad Ursache
|
e28de12cbd
|
Ensure files opened in from_disk are closed
Fixes [issue 1706](https://github.com/explosion/spaCy/issues/1706).
|
2018-02-13 20:49:43 +01:00 |
|
Matthew Honnibal
|
f74a802d09
|
Test and fix #1919: Error resuming training
|
2018-02-02 02:32:40 +01:00 |
|
Matthew Honnibal
|
85c942a6e3
|
Dont overwrite pretrained_dims setting from cfg. Fixes #1727
|
2018-01-23 19:10:49 +01:00 |
|
Matthew Honnibal
|
fe4748fc38
|
Merge pull request #1870 from avadhpatel/master
Model Load Performance Improvement by more than 5x
|
2018-01-22 00:05:15 +01:00 |
|
Avadh Patel
|
a517df55c8
|
Small fix
Signed-off-by: Avadh Patel <avadh4all@gmail.com>
|
2018-01-21 15:20:45 -06:00 |
|
Avadh Patel
|
5b5029890d
|
Merge branch 'perfTuning' into perfTuningMaster
Signed-off-by: Avadh Patel <avadh4all@gmail.com>
|
2018-01-21 15:20:00 -06:00 |
|
Matthew Honnibal
|
203d2ea830
|
Allow multitask objectives to be added to the parser and NER more easily
|
2018-01-21 19:37:02 +01:00 |
|
Avadh Patel
|
75903949da
|
Updated model building after suggestion from Matthew
Signed-off-by: Avadh Patel <avadh4all@gmail.com>
|
2018-01-18 06:51:57 -06:00 |
|
Avadh Patel
|
fe879da2a1
|
Do not train model if its going to be loaded from disk
This saves significant time in loading a model from disk.
Signed-off-by: Avadh Patel <avadh4all@gmail.com>
|
2018-01-17 06:16:07 -06:00 |
|
Avadh Patel
|
2146faffee
|
Do not train model if its going to be loaded from disk
This saves significant time in loading a model from disk.
Signed-off-by: Avadh Patel <avadh4all@gmail.com>
|
2018-01-17 06:04:22 -06:00 |
|
Matthew Honnibal
|
d274d3a3b9
|
Let beam forward use minibatches
|
2017-11-15 00:51:42 +01:00 |
|
Matthew Honnibal
|
855872f872
|
Remove state hashing
|
2017-11-14 23:36:46 +01:00 |
|
Matthew Honnibal
|
2512ea9eeb
|
Fix memory leak in beam parser
|
2017-11-14 02:11:40 +01:00 |
|
Matthew Honnibal
|
ca73d0d8fe
|
Cleanup states after beam parsing, explicitly
|
2017-11-13 18:18:26 +01:00 |
|
Matthew Honnibal
|
25859dbb48
|
Return optimizer from begin_training, creating if necessary
|
2017-11-06 14:26:49 +01:00 |
|
Matthew Honnibal
|
2b35bb76ad
|
Fix tensorizer on GPU
|
2017-11-05 15:34:40 +01:00 |
|
Matthew Honnibal
|
3ca16ddbd4
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-04 00:25:02 +01:00 |
|
Matthew Honnibal
|
98c29b7912
|
Add padding vector in parser, to make gradient more correct
|
2017-11-04 00:23:23 +01:00 |
|
Matthew Honnibal
|
13c8881d2f
|
Expose parser's tok2vec model component
|
2017-11-03 20:20:59 +01:00 |
|
Matthew Honnibal
|
7fea845374
|
Remove print statement
|
2017-11-03 14:04:51 +01:00 |
|
Matthew Honnibal
|
a5b05f85f0
|
Set Doc.tensor attribute in parser
|
2017-11-03 11:21:00 +01:00 |
|
Matthew Honnibal
|
7698903617
|
Fix GPU usage
|
2017-10-31 02:33:16 +01:00 |
|
Matthew Honnibal
|
b713d10d97
|
Switch to 13 features in parser
|
2017-10-28 23:01:14 +00:00 |
|
Matthew Honnibal
|
5414e2f14b
|
Use missing features in parser
|
2017-10-28 16:45:54 +00:00 |
|
Matthew Honnibal
|
64e4ff7c4b
|
Merge 'tidy-up' changes into branch. Resolve conflicts
|
2017-10-28 13:16:06 +02:00 |
|
Explosion Bot
|
b22e42af7f
|
Merge changes to parser and _ml
|
2017-10-28 11:52:10 +02:00 |
|
ines
|
b4d226a3f1
|
Tidy up syntax
|
2017-10-27 19:45:57 +02:00 |
|
ines
|
e33b7e0b3c
|
Tidy up parser and ML
|
2017-10-27 14:39:30 +02:00 |
|
Matthew Honnibal
|
531142a933
|
Merge remote-tracking branch 'origin/develop' into feature/better-parser
|
2017-10-27 12:34:48 +00:00 |
|
Matthew Honnibal
|
75a637fa43
|
Remove redundant imports from _ml
|
2017-10-27 10:19:56 +00:00 |
|
Matthew Honnibal
|
bb25bdcd92
|
Adjust call to scatter_add for the new version
|
2017-10-27 01:16:55 +00:00 |
|
Matthew Honnibal
|
90d1d9b230
|
Remove obsolete parser code
|
2017-10-26 13:22:45 +02:00 |
|
Matthew Honnibal
|
35977bdbb9
|
Update better-parser branch with develop
|
2017-10-26 00:55:53 +00:00 |
|
ines
|
18aae423fb
|
Remove import of non-existing function
|
2017-10-25 15:54:10 +02:00 |
|
ines
|
5117a7d24d
|
Fix whitespace
|
2017-10-25 15:54:02 +02:00 |
|
Matthew Honnibal
|
075e8118ea
|
Update from develop
|
2017-10-25 12:45:21 +02:00 |
|
Matthew Honnibal
|
dd5b2d8fa3
|
Check for out-of-memory when calling calloc. Closes #1446
|
2017-10-24 12:40:47 +02:00 |
|
Matthew Honnibal
|
e7556ff048
|
Fix non-maxout parser
|
2017-10-23 18:16:23 +02:00 |
|
Matthew Honnibal
|
1036798155
|
Make parser consistent if maxout==1
|
2017-10-20 16:24:16 +02:00 |
|
Matthew Honnibal
|
827cd8a883
|
Fix support of maxout pieces in parser
|
2017-10-20 03:07:17 +02:00 |
|
Matthew Honnibal
|
a8850b4282
|
Remove redundant PrecomputableMaxouts class
|
2017-10-19 20:27:34 +02:00 |
|
Matthew Honnibal
|
b00d0a2c97
|
Fix bias in parser
|
2017-10-19 18:42:11 +02:00 |
|
Matthew Honnibal
|
b54b4b8a97
|
Make parser_maxout_pieces hyper-param work
|
2017-10-19 13:45:18 +02:00 |
|
Matthew Honnibal
|
15e5a04a8d
|
Clean up more depth=0 conditional code
|
2017-10-19 01:48:43 +02:00 |
|
Matthew Honnibal
|
906c50ac59
|
Fix loop typing, that caused error on windows
|
2017-10-19 01:48:39 +02:00 |
|
Matthew Honnibal
|
960788aaa2
|
Eliminate dead code in parser, and raise errors for obsolete options
|
2017-10-19 00:42:34 +02:00 |
|
Matthew Honnibal
|
bbfd7d8d5d
|
Clean up parser multi-threading
|
2017-10-19 00:25:21 +02:00 |
|
Matthew Honnibal
|
f018f2030c
|
Try optimized parser forward loop
|
2017-10-18 21:48:00 +02:00 |
|
Matthew Honnibal
|
633a75c7e0
|
Break parser batches into sub-batches, sorted by length.
|
2017-10-18 21:45:01 +02:00 |
|
Matthew Honnibal
|
908f44c3fe
|
Disable history features by default
|
2017-10-12 14:56:11 +02:00 |
|
Matthew Honnibal
|
cecfcc7711
|
Set default hyper params back to 'slow' settings
|
2017-10-12 13:12:26 +02:00 |
|
Matthew Honnibal
|
807e109f2b
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-10-11 02:47:59 -05:00 |
|
Matthew Honnibal
|
6e552c9d83
|
Prune number of non-projective labels more aggressiely
|
2017-10-11 02:46:44 -05:00 |
|
Matthew Honnibal
|
188f620046
|
Improve parser defaults
|
2017-10-11 09:43:48 +02:00 |
|
Matthew Honnibal
|
3065f12ef2
|
Make add parser label work for hidden_depth=0
|
2017-10-10 22:57:31 +02:00 |
|
Matthew Honnibal
|
8265b90c83
|
Update parser defaults
|
2017-10-09 21:55:20 -05:00 |
|
Matthew Honnibal
|
09d61ada5e
|
Merge pull request #1396 from explosion/feature/pipeline-management
💫 Improve pipeline and factory management
|
2017-10-10 04:29:54 +02:00 |
|
Matthew Honnibal
|
d8a2506023
|
Merge pull request #1401 from explosion/feature/add-parser-action
💫 Allow labels to be added to pre-trained parser and NER modes
|
2017-10-09 04:57:51 +02:00 |
|
Matthew Honnibal
|
d43a83e37a
|
Allow parser.add_label for pretrained models
|
2017-10-09 03:35:40 +02:00 |
|
Matthew Honnibal
|
20309fb9db
|
Make history features default to zero
|
2017-10-08 20:32:14 +02:00 |
|
Matthew Honnibal
|
42b401d08b
|
Change default hidden depth to 1
|
2017-10-07 21:05:21 -05:00 |
|
Matthew Honnibal
|
3d22ccf495
|
Update default hyper-parameters
|
2017-10-07 07:16:41 -05:00 |
|
Matthew Honnibal
|
0384f08218
|
Trigger nonproj.deprojectivize as a postprocess
|
2017-10-07 02:00:47 +02:00 |
|
Matthew Honnibal
|
8be46d766e
|
Remove print statement
|
2017-10-06 16:19:02 -05:00 |
|
Matthew Honnibal
|
8e731009fe
|
Fix parser config serialization
|
2017-10-06 13:50:52 -05:00 |
|
Matthew Honnibal
|
16ba6aa8a6
|
Fix parser config serialization
|
2017-10-06 13:17:31 -05:00 |
|
Matthew Honnibal
|
c66399d8ae
|
Fix depth definition with history features
|
2017-10-06 06:20:05 -05:00 |
|
Matthew Honnibal
|
21d11936fe
|
Fix significant train/test skew error in history feats
|
2017-10-06 06:08:50 -05:00 |
|
Matthew Honnibal
|
555d8c8bff
|
Fix beam history features
|
2017-10-05 22:21:50 -05:00 |
|
Matthew Honnibal
|
363aa47b40
|
Clean up dead parsing code
|
2017-10-05 21:53:49 -05:00 |
|
Matthew Honnibal
|
ca12764772
|
Enable history features for beam parser
|
2017-10-05 21:53:29 -05:00 |
|
Matthew Honnibal
|
e25ffcb11f
|
Move history size under feature flags
|
2017-10-05 19:38:13 -05:00 |
|
Matthew Honnibal
|
943af4423a
|
Make depth setting in parser work again
|
2017-10-04 20:06:05 -05:00 |
|
Matthew Honnibal
|
246612cb53
|
Merge remote-tracking branch 'origin/develop' into feature/parser-history-model
|
2017-10-03 16:56:42 -05:00 |
|
Matthew Honnibal
|
5454b20cd7
|
Update thinc imports for 6.9
|
2017-10-03 20:07:17 +02:00 |
|
Matthew Honnibal
|
4a59f6358c
|
Fix thinc imports
|
2017-10-03 19:21:26 +02:00 |
|
Matthew Honnibal
|
dc3c791947
|
Fix history size option
|
2017-10-03 13:41:23 +02:00 |
|
Matthew Honnibal
|
278a4c17c6
|
Fix history features
|
2017-10-03 13:27:10 +02:00 |
|
Matthew Honnibal
|
b50a359e11
|
Add support for history features in parsing models
|
2017-10-03 12:44:01 +02:00 |
|
Matthew Honnibal
|
cdb2d83e16
|
Pass dropout in parser
|
2017-09-28 18:47:13 -05:00 |
|
Matthew Honnibal
|
158e177cae
|
Fix default embed size
|
2017-09-28 08:25:23 -05:00 |
|
Matthew Honnibal
|
1a37a2c0a0
|
Update training defaults
|
2017-09-27 11:48:07 -05:00 |
|
Matthew Honnibal
|
3274b46a0d
|
Try to fix compile error on Windows
|
2017-09-26 09:05:53 -05:00 |
|
Matthew Honnibal
|
5056743ad5
|
Fix parser serialization
|
2017-09-26 06:44:56 -05:00 |
|
Matthew Honnibal
|
bf917225ab
|
Allow multi-task objectives during training
|
2017-09-26 05:42:52 -05:00 |
|
Matthew Honnibal
|
4348c479fc
|
Merge pre-trained vectors and noshare patches
|
2017-09-22 20:07:28 -05:00 |
|
Matthew Honnibal
|
0795857dcb
|
Fix beam parsing
|
2017-09-23 02:59:53 +02:00 |
|
Matthew Honnibal
|
d9124f1aa3
|
Add link_vectors_to_models function
|
2017-09-22 09:38:22 -05:00 |
|
Matthew Honnibal
|
20193371f5
|
Don't share CNN, to reduce complexities
|
2017-09-21 14:59:48 +02:00 |
|
Matthew Honnibal
|
24e85c2048
|
Pass values for CNN maxout pieces option
|
2017-09-20 19:16:12 -05:00 |
|
Matthew Honnibal
|
2489dcaccf
|
Fix serialization of parser
|
2017-09-19 23:42:12 +02:00 |
|
Matthew Honnibal
|
2b0efc77ae
|
Fix wiring of pre-trained vectors in parser loading
|
2017-09-17 05:47:34 -05:00 |
|
Matthew Honnibal
|
31c2e91c35
|
Fix wiring of pre-trained vectors in parser loading
|
2017-09-17 05:46:55 -05:00 |
|
Matthew Honnibal
|
43210abacc
|
Resolve fine-tuning conflict
|
2017-09-17 05:30:04 -05:00 |
|
Matthew Honnibal
|
5ff2491f24
|
Pass option for pre-trained vectors in parser
|
2017-09-16 12:47:21 -05:00 |
|
Matthew Honnibal
|
f730d07e4e
|
Fix prange error for Windows
|
2017-09-16 00:25:33 +02:00 |
|
Matthew Honnibal
|
8b481e0465
|
Remove redundant brackets
|
2017-09-15 10:38:08 +02:00 |
|
Matthew Honnibal
|
664c5af745
|
Revert padding in parser
|
2017-09-14 16:59:25 +02:00 |
|
Matthew Honnibal
|
c6395b057a
|
Improve parser feature extraction, for missing values
|
2017-09-14 16:18:02 +02:00 |
|
Matthew Honnibal
|
dcbf866970
|
Merge parser changes
|
2017-09-06 18:41:05 +02:00 |
|
Matthew Honnibal
|
24ff6b0ad9
|
Fix parsing and tok2vec models
|
2017-09-06 05:50:58 -05:00 |
|
Matthew Honnibal
|
33fa91feb7
|
Restore correctness of parser model
|
2017-09-04 21:19:30 +02:00 |
|
Matthew Honnibal
|
9d65d67985
|
Preserve model compatibility in parser, for now
|
2017-09-04 16:46:22 +02:00 |
|
Matthew Honnibal
|
789e1a3980
|
Use 13 parser features, not 8
|
2017-08-31 14:13:00 -05:00 |
|
Matthew Honnibal
|
4ceebde523
|
Fix gradient bug in parser
|
2017-08-30 17:32:56 -05:00 |
|
Matthew Honnibal
|
682346dd66
|
Restore optimized hidden_depth=0 for parser
|
2017-08-21 19:18:04 -05:00 |
|
Matthew Honnibal
|
62878e50db
|
Fix misalignment caued by filtering inputs at wrong point in parser
|
2017-08-20 15:59:28 -05:00 |
|
Matthew Honnibal
|
84b7ed49e4
|
Ensure updates aren't made if no gold available
|
2017-08-20 14:41:38 +02:00 |
|
Matthew Honnibal
|
ab28f911b4
|
Fix parser learning rates
|
2017-08-19 09:02:57 -05:00 |
|
Matthew Honnibal
|
c307a0ffb8
|
Restore patches from nn-beam-parser to spacy/syntax
|
2017-08-18 22:38:59 +02:00 |
|
Matthew Honnibal
|
5f81d700ff
|
Restore patches from nn-beam-parser to spacy/syntax
|
2017-08-18 22:23:03 +02:00 |
|
Matthew Honnibal
|
d456d2efe1
|
Fix conflicts in nn_parser
|
2017-08-18 20:55:58 +02:00 |
|
Matthew Honnibal
|
1cec1efca7
|
Fix merge conflicts in nn_parser from beam stuff
|
2017-08-18 20:50:49 +02:00 |
|
Matthew Honnibal
|
426f84937f
|
Resolve conflicts when merging new beam parsing stuff
|
2017-08-18 13:38:32 -05:00 |
|
Matthew Honnibal
|
f75420ae79
|
Unhack beam parsing, moving it under options instead of global flags
|
2017-08-18 13:31:15 -05:00 |
|
Matthew Honnibal
|
0209a06b4e
|
Update beam parser
|
2017-08-16 18:25:49 -05:00 |
|
Matthew Honnibal
|
210f6d5175
|
Fix efficiency error in batch parse
|
2017-08-15 03:19:03 -05:00 |
|
Matthew Honnibal
|
a8e4064dd8
|
Fix tensor gradient in parser
|
2017-08-15 03:14:36 -05:00 |
|
Matthew Honnibal
|
e420e0366c
|
Remove use of hash function in beam parser
|
2017-08-15 03:13:57 -05:00 |
|
Matthew Honnibal
|
52c180ecf5
|
Revert "Merge branch 'develop' of https://github.com/explosion/spaCy into develop"
This reverts commit ea8de11ad5 , reversing
changes made to 08e443e083 .
|
2017-08-14 13:00:23 +02:00 |
|
Matthew Honnibal
|
0ae045256d
|
Fix beam training
|
2017-08-13 18:02:05 -05:00 |
|
Matthew Honnibal
|
6a42cc16ff
|
Fix beam parser, improve efficiency of non-beam
|
2017-08-13 12:37:26 +02:00 |
|
Matthew Honnibal
|
17874fe491
|
Disable beam parsing
|
2017-08-12 19:35:40 -05:00 |
|
Matthew Honnibal
|
3e30712b62
|
Improve defaults
|
2017-08-12 19:24:17 -05:00 |
|
Matthew Honnibal
|
28e930aae0
|
Fixes for beam parsing. Not working
|
2017-08-12 19:22:52 -05:00 |
|
Matthew Honnibal
|
4638f4b869
|
Fix beam update
|
2017-08-12 17:15:16 -05:00 |
|
Matthew Honnibal
|
b353e4d843
|
Work on parser beam training
|
2017-08-12 14:47:45 -05:00 |
|
Matthew Honnibal
|
cd5ecedf6a
|
Try drop_layer in parser
|
2017-08-12 08:56:33 -05:00 |
|
Matthew Honnibal
|
1a59db1c86
|
Fix dropout and learn rate in parser
|
2017-08-12 05:44:39 -05:00 |
|
Matthew Honnibal
|
d01dc3704a
|
Adjust parser model
|
2017-08-09 20:06:33 -05:00 |
|
Matthew Honnibal
|
f37528ef58
|
Pass embed size for parser fine-tune. Use SELU
|
2017-08-09 17:52:53 -05:00 |
|
Matthew Honnibal
|
bbace204be
|
Gate parser fine-tuning behind feature flag
|
2017-08-09 16:40:42 -05:00 |
|
Matthew Honnibal
|
dbdd8afc4b
|
Fix parser fine-tune training
|
2017-08-08 15:46:07 -05:00 |
|
Matthew Honnibal
|
88bf1cf87c
|
Update parser for fine tuning
|
2017-08-08 15:34:17 -05:00 |
|
Matthew Honnibal
|
42bd26f6f3
|
Give parser its own tok2vec weights
|
2017-08-06 18:33:46 +02:00 |
|
Matthew Honnibal
|
bfffdeabb2
|
Fix parser batch-size bug introduced during cleanup
|
2017-08-06 14:10:48 +02:00 |
|
Matthew Honnibal
|
7f876a7a82
|
Clean up some unused code in parser
|
2017-08-06 00:00:21 +02:00 |
|
Matthew Honnibal
|
3da1063b36
|
Add beam decoding to parser, to allow NER uncertainties
|
2017-07-20 15:02:55 +02:00 |
|
Matthew Honnibal
|
6d0356e6cc
|
Whitespace
|
2017-06-04 14:55:24 -05:00 |
|
ines
|
6669583f4e
|
Use OrderedDict
|
2017-06-02 21:07:56 +02:00 |
|