ines
|
507ecb67af
|
Fix Spanish tag map
|
2017-11-05 19:23:34 +01:00 |
|
Matthew Honnibal
|
320008352b
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-05 18:46:15 +01:00 |
|
Matthew Honnibal
|
38109a0e4a
|
Register SentenceSegmenter in Language.factories
|
2017-11-05 18:45:57 +01:00 |
|
ines
|
975e1042ff
|
Fix Italian tag map
|
2017-11-05 18:34:09 +01:00 |
|
ines
|
6b2d6e4937
|
Fix Portuguese tag map
|
2017-11-05 18:31:00 +01:00 |
|
ines
|
fa2687fded
|
Fix Dutch tag map
|
2017-11-05 17:57:59 +01:00 |
|
ines
|
fb8990d916
|
Fix Spanish tag map
|
2017-11-05 17:48:46 +01:00 |
|
ines
|
9d13288f73
|
Fix French tag map
|
2017-11-05 17:47:59 +01:00 |
|
ines
|
54579805c5
|
Fix French tag map
|
2017-11-05 17:44:05 +01:00 |
|
Matthew Honnibal
|
2b35bb76ad
|
Fix tensorizer on GPU
|
2017-11-05 15:34:40 +01:00 |
|
Matthew Honnibal
|
6e5181bbaa
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-05 15:33:56 +01:00 |
|
Matthew Honnibal
|
6f438b17c1
|
Increment version to v2.0.0a19
|
2017-11-05 14:43:36 +01:00 |
|
Matthew Honnibal
|
225cc249c9
|
Pass string path to numpy, to fix #1479
|
2017-11-05 14:42:46 +01:00 |
|
Matthew Honnibal
|
00435d8f0c
|
Add extra beam parsing test
|
2017-11-05 14:39:57 +01:00 |
|
Matthew Honnibal
|
e777ea25bb
|
Merge pull request #1492 from uwol/develop
TextCategorizer return parameter fix
|
2017-11-05 14:13:04 +01:00 |
|
Matthew Honnibal
|
0d4bd6414e
|
Fix Italian tag map
|
2017-11-05 14:11:03 +01:00 |
|
ines
|
ef597622a6
|
Add Portuguese tag map
|
2017-11-05 13:58:34 +01:00 |
|
ines
|
793c62dfda
|
Add Dutch tag map
|
2017-11-05 13:48:07 +01:00 |
|
ines
|
f7485a09c8
|
Fix Italian tag map
|
2017-11-05 13:12:58 +01:00 |
|
uwol
|
a2162b8908
|
tensorizer return parameter fix
|
2017-11-05 12:25:10 +01:00 |
|
ines
|
0a27afbf86
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-04 23:32:52 +01:00 |
|
ines
|
3cef901834
|
Add tag map for French and Italian
|
2017-11-04 23:32:51 +01:00 |
|
Matthew Honnibal
|
cfb83c231c
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-04 23:08:19 +01:00 |
|
Matthew Honnibal
|
d185927998
|
Undo harmful pickling hacks on Language class
|
2017-11-04 23:07:03 +01:00 |
|
ines
|
6c15aafebd
|
Fix formatting
|
2017-11-04 23:07:02 +01:00 |
|
Matthew Honnibal
|
3ca16ddbd4
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-04 00:25:02 +01:00 |
|
Matthew Honnibal
|
e4ec4be948
|
Fix parser test
|
2017-11-04 00:23:45 +01:00 |
|
Matthew Honnibal
|
98c29b7912
|
Add padding vector in parser, to make gradient more correct
|
2017-11-04 00:23:23 +01:00 |
|
ines
|
5e7d98f72a
|
Remove test for #1491
|
2017-11-03 22:10:57 +01:00 |
|
ines
|
718f1c50fb
|
Add regression test for #1491
|
2017-11-03 21:11:20 +01:00 |
|
Matthew Honnibal
|
144a93c2a5
|
Back-off to tensor for similarity if no vectors
|
2017-11-03 20:56:33 +01:00 |
|
Matthew Honnibal
|
1e9634691a
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-03 20:21:15 +01:00 |
|
Matthew Honnibal
|
13c8881d2f
|
Expose parser's tok2vec model component
|
2017-11-03 20:20:59 +01:00 |
|
Matthew Honnibal
|
17c63906f9
|
Update tensorizer component
|
2017-11-03 20:20:26 +01:00 |
|
Matthew Honnibal
|
2bf21cbe29
|
Update model after optimising it instead of waiting
|
2017-11-03 20:20:01 +01:00 |
|
Matthew Honnibal
|
d6e831bf89
|
Fix lemmatizer tests
|
2017-11-03 19:46:34 +01:00 |
|
ines
|
eef930c73e
|
Assert instead of print
|
2017-11-03 18:50:57 +01:00 |
|
ines
|
f0986df94b
|
Add test for #1488 (passes on v2.0.0a18?)
|
2017-11-03 14:44:36 +01:00 |
|
Matthew Honnibal
|
711278b667
|
Make test less flakey
|
2017-11-03 14:36:08 +01:00 |
|
Matthew Honnibal
|
7fea845374
|
Remove print statement
|
2017-11-03 14:04:51 +01:00 |
|
Matthew Honnibal
|
0a534ae96a
|
Fix test for backprop d_pad
|
2017-11-03 14:04:16 +01:00 |
|
Matthew Honnibal
|
33bd2428db
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-03 13:29:56 +01:00 |
|
Matthew Honnibal
|
6681058abd
|
Fix tensor extending in tagger
|
2017-11-03 13:29:36 +01:00 |
|
Matthew Honnibal
|
bd2cbdfa85
|
Make Morphology not fail on unknown tags
|
2017-11-03 13:29:09 +01:00 |
|
Matthew Honnibal
|
c9b118a7e9
|
Set softmax attr in tagger model
|
2017-11-03 11:22:01 +01:00 |
|
Matthew Honnibal
|
a5b05f85f0
|
Set Doc.tensor attribute in parser
|
2017-11-03 11:21:00 +01:00 |
|
Matthew Honnibal
|
62ed58935a
|
Add Doc.extend_tensor() method
|
2017-11-03 11:20:31 +01:00 |
|
Matthew Honnibal
|
d6fc39c8a6
|
Set Doc.tensor from Tagger
|
2017-11-03 11:20:05 +01:00 |
|
Matthew Honnibal
|
b3264aa5f0
|
Expose the softmax layer in the tagger model, to allow setting tensors
|
2017-11-03 11:19:51 +01:00 |
|
Matthew Honnibal
|
c2bbf076a4
|
Add document length cap for training
|
2017-11-03 01:54:54 +01:00 |
|
Matthew Honnibal
|
6771780d3f
|
Fix backprop of padding variable
|
2017-11-03 01:54:34 +01:00 |
|
Matthew Honnibal
|
54a716f2ec
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-03 00:55:20 +01:00 |
|
Matthew Honnibal
|
260e6ee3fb
|
Improve efficiency of backprop of padding variable
|
2017-11-03 00:49:11 +01:00 |
|
Matthew Honnibal
|
a22f96c3f1
|
Add test for backpropagating padding
|
2017-11-03 00:48:54 +01:00 |
|
ines
|
9baab241b4
|
Add skeleton language data for Turkish
|
2017-11-02 16:32:24 +01:00 |
|
ines
|
c6fea3e5f6
|
Add Romanian and Croatian skeletons (experimental)
Add language data templates to make it easier for others to contribute to the language support
|
2017-11-01 23:04:28 +01:00 |
|
ines
|
18c859500b
|
Add missing imports
|
2017-11-01 23:02:51 +01:00 |
|
ines
|
819e30a26e
|
Tidy up tokenizer exceptions
|
2017-11-01 23:02:45 +01:00 |
|
ines
|
3af281a334
|
Update test model name
|
2017-11-01 23:02:00 +01:00 |
|
Matthew Honnibal
|
b30dd36179
|
Allow Tagger.add_label() before training
|
2017-11-01 21:49:24 +01:00 |
|
Matthew Honnibal
|
eca41f0cf6
|
Fix filename conversion for conllu
|
2017-11-01 21:26:49 +01:00 |
|
Matthew Honnibal
|
e237472cdc
|
Fix tag and filename conversion for conllu
|
2017-11-01 21:25:33 +01:00 |
|
Matthew Honnibal
|
b84d99b281
|
Revert tagger.add_label() changes, to fix model
|
2017-11-01 21:10:45 +01:00 |
|
Matthew Honnibal
|
f5855e539b
|
Fix tagger model loading
|
2017-11-01 20:42:36 +01:00 |
|
Matthew Honnibal
|
624644adfe
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 20:26:41 +01:00 |
|
ines
|
5f661a1b3a
|
Remove tensorizer from pre-set pipe_names
|
2017-11-01 19:48:33 +01:00 |
|
Matthew Honnibal
|
190522efd3
|
Fix tagger when some tags aren't in Morphology
|
2017-11-01 19:27:49 +01:00 |
|
Matthew Honnibal
|
e85e31cfbd
|
Fix backprop of d_pad
|
2017-11-01 19:27:26 +01:00 |
|
Matthew Honnibal
|
759cc79185
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 19:00:19 +01:00 |
|
Matthew Honnibal
|
1ae40b50b4
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 17:07:02 +01:00 |
|
Matthew Honnibal
|
7ae1aacdb8
|
Fix add_label methods
|
2017-11-01 17:06:43 +01:00 |
|
ines
|
8c2260e18c
|
Move span tests to /doc
|
2017-11-01 16:56:35 +01:00 |
|
Matthew Honnibal
|
2ef7b59eb0
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 16:51:41 +01:00 |
|
ines
|
1d1f91a041
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 16:49:44 +01:00 |
|
ines
|
9659391944
|
Update deprecated methods and add warnings
|
2017-11-01 16:49:42 +01:00 |
|
ines
|
260cb37224
|
Catch deprecation warning
|
2017-11-01 16:49:18 +01:00 |
|
ines
|
5914faafbb
|
Fix .merge tests to not use deprecated API
|
2017-11-01 16:49:11 +01:00 |
|
ines
|
705a4e3e4a
|
Fix formatting
|
2017-11-01 16:44:08 +01:00 |
|
Matthew Honnibal
|
d17a12c71d
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 16:38:26 +01:00 |
|
Matthew Honnibal
|
9f9439667b
|
Don't create low-data text classifier if no vectors
|
2017-11-01 16:34:09 +01:00 |
|
Matthew Honnibal
|
e7a9174877
|
Add add_label methods to Tagger and TextCategorizer
|
2017-11-01 16:32:44 +01:00 |
|
ines
|
39e0586192
|
Add deprecated helper
Uses warning to show DeprecationWarning and custom stack trace
|
2017-11-01 16:32:36 +01:00 |
|
Matthew Honnibal
|
a7bf38bf31
|
Remove misleading comment on util.get_cuda_stream()
|
2017-11-01 13:57:25 +01:00 |
|
Matthew Honnibal
|
273e96b63f
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 13:27:35 +01:00 |
|
Matthew Honnibal
|
9e0ebee81c
|
Add Token.is_sent_start property, so can deprecate Token.sent_start
|
2017-11-01 13:27:14 +01:00 |
|
Matthew Honnibal
|
7e7116cdf7
|
Fix Doc.to_array when only one string attr provided
|
2017-11-01 13:26:43 +01:00 |
|
Matthew Honnibal
|
301fb2bb60
|
Implement Span.n_lefts and Span.n_rights
|
2017-11-01 13:25:12 +01:00 |
|
Matthew Honnibal
|
c047498f87
|
Fix vectors test
|
2017-11-01 13:24:47 +01:00 |
|
ines
|
9a5e7c6fe2
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 13:14:45 +01:00 |
|
ines
|
bfe17b7df1
|
Fix begin_training if get_gold_tuples is None
|
2017-11-01 13:14:31 +01:00 |
|
ines
|
affd3404ab
|
Remove old model command (now "vocab")
|
2017-11-01 13:14:03 +01:00 |
|
Matthew Honnibal
|
fdb4b8e456
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 02:07:17 +01:00 |
|
Matthew Honnibal
|
c48dd0e1d3
|
Fix vector pruning
|
2017-11-01 02:06:58 +01:00 |
|
ines
|
37e62ab0e2
|
Update vector meta in meta.json
|
2017-11-01 01:25:09 +01:00 |
|
ines
|
96b4aef0bf
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 01:10:53 +01:00 |
|
Matthew Honnibal
|
86eba61fae
|
Fix token.vector when vectors are missing
|
2017-11-01 00:47:35 +01:00 |
|
ines
|
5683fd65ed
|
Update docstrings
|
2017-11-01 00:42:39 +01:00 |
|
Matthew Honnibal
|
44bce8e53f
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-11-01 00:35:16 +01:00 |
|
Matthew Honnibal
|
c16310d156
|
Update vectors with find method
|
2017-11-01 00:34:55 +01:00 |
|
Ines Montani
|
d11659463b
|
Merge pull request #1152 from jimregan/develop-irish
[WIP] attempt a port from #1147
|
2017-11-01 00:23:43 +01:00 |
|