Commit Graph

7682 Commits

Author SHA1 Message Date
Matthew Honnibal
3ca16ddbd4 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-11-04 00:25:02 +01:00
Matthew Honnibal
e4ec4be948 Fix parser test 2017-11-04 00:23:45 +01:00
Matthew Honnibal
98c29b7912 Add padding vector in parser, to make gradient more correct 2017-11-04 00:23:23 +01:00
ines
2639ecd5f8 Add docs note on custom tokenizer rules (see #1491) 2017-11-03 23:33:18 +01:00
ines
5e7d98f72a Remove test for #1491 2017-11-03 22:10:57 +01:00
ines
718f1c50fb Add regression test for #1491 2017-11-03 21:11:20 +01:00
Matthew Honnibal
144a93c2a5 Back-off to tensor for similarity if no vectors 2017-11-03 20:56:33 +01:00
Matthew Honnibal
1e9634691a Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-11-03 20:21:15 +01:00
Matthew Honnibal
13c8881d2f Expose parser's tok2vec model component 2017-11-03 20:20:59 +01:00
Matthew Honnibal
17c63906f9 Update tensorizer component 2017-11-03 20:20:26 +01:00
Matthew Honnibal
2bf21cbe29 Update model after optimising it instead of waiting 2017-11-03 20:20:01 +01:00
Matthew Honnibal
d6e831bf89 Fix lemmatizer tests 2017-11-03 19:46:34 +01:00
ines
380f2441b4 Fix script includes 2017-11-03 18:51:03 +01:00
ines
eef930c73e Assert instead of print 2017-11-03 18:50:57 +01:00
ines
f0986df94b Add test for #1488 (passes on v2.0.0a18?) 2017-11-03 14:44:36 +01:00
Matthew Honnibal
711278b667 Make test less flakey 2017-11-03 14:36:08 +01:00
Matthew Honnibal
7fea845374 Remove print statement 2017-11-03 14:04:51 +01:00
Matthew Honnibal
0a534ae96a Fix test for backprop d_pad 2017-11-03 14:04:16 +01:00
Matthew Honnibal
33bd2428db Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-11-03 13:29:56 +01:00
Matthew Honnibal
6681058abd Fix tensor extending in tagger 2017-11-03 13:29:36 +01:00
Matthew Honnibal
bd2cbdfa85 Make Morphology not fail on unknown tags 2017-11-03 13:29:09 +01:00
Ines Montani
540374bf64
Merge pull request #1481 from IamJeffG/contributing-agreement-iamjeffg
Contributing agreement - IamJeffG
2017-11-03 12:49:21 +01:00
Ines Montani
58bb4bd484
Merge pull request #1486 from abhi18av/patch-1
Minor typo [ nad => and ]
2017-11-03 12:49:07 +01:00
Abhinav Sharma
2aaf5315f3
Filled the details of the contribution license 2017-11-03 16:56:58 +05:30
Abhinav Sharma
c740277f9f
Minor typo [ nad => and ] 2017-11-03 16:30:44 +05:30
ines
7986ada1cb Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-11-03 11:29:36 +01:00
ines
1e16374687 Update models list to reflect spaCy v2.0.0a18 2017-11-03 11:29:34 +01:00
ines
a62b0727d8 Tidy up and always use bundle in built site for now
Just to be safe
2017-11-03 11:29:21 +01:00
ines
d0f88af5b6 Hide error earlier 2017-11-03 11:29:04 +01:00
Matthew Honnibal
c9b118a7e9 Set softmax attr in tagger model 2017-11-03 11:22:01 +01:00
Matthew Honnibal
a5b05f85f0 Set Doc.tensor attribute in parser 2017-11-03 11:21:00 +01:00
Matthew Honnibal
62ed58935a Add Doc.extend_tensor() method 2017-11-03 11:20:31 +01:00
Matthew Honnibal
d6fc39c8a6 Set Doc.tensor from Tagger 2017-11-03 11:20:05 +01:00
Matthew Honnibal
b3264aa5f0 Expose the softmax layer in the tagger model, to allow setting tensors 2017-11-03 11:19:51 +01:00
Matthew Honnibal
c2bbf076a4 Add document length cap for training 2017-11-03 01:54:54 +01:00
Matthew Honnibal
6771780d3f Fix backprop of padding variable 2017-11-03 01:54:34 +01:00
Matthew Honnibal
54a716f2ec Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-11-03 00:55:20 +01:00
Matthew Honnibal
260e6ee3fb Improve efficiency of backprop of padding variable 2017-11-03 00:49:11 +01:00
Matthew Honnibal
a22f96c3f1 Add test for backpropagating padding 2017-11-03 00:48:54 +01:00
ines
43512c68b2 Fix vector details in model overview 2017-11-02 20:04:13 +01:00
Jeffrey Gerard
3dc3f10a40 Contributing agreement - IamJeffG 2017-11-02 09:28:26 -07:00
ines
9baab241b4 Add skeleton language data for Turkish 2017-11-02 16:32:24 +01:00
ines
31e349a62c Update model families 2017-11-02 16:13:38 +01:00
ines
15cbc61a6e Adjust rendering of large numbers
1234 -> 1.2k
12345 -> 12.3k
123456 -> 123k
1234567 -> 1.2m
2017-11-02 16:13:18 +01:00
ines
391fce09d9 Update licenses 2017-11-01 23:04:40 +01:00
ines
c6fea3e5f6 Add Romanian and Croatian skeletons (experimental)
Add language data templates to make it easier for others to contribute to the language support
2017-11-01 23:04:28 +01:00
ines
18c859500b Add missing imports 2017-11-01 23:02:51 +01:00
ines
819e30a26e Tidy up tokenizer exceptions 2017-11-01 23:02:45 +01:00
ines
3af281a334 Update test model name 2017-11-01 23:02:00 +01:00
ines
408f450ce0 Tidy up 2017-11-01 23:01:12 +01:00