Jim O'Regan
|
b1b6123867
|
add ga_tokenizer
|
2017-09-11 10:31:41 +01:00 |
|
Jim O'Regan
|
9dfd301962
|
rearrange
|
2017-09-11 10:14:18 +01:00 |
|
Jim O'Regan
|
187be6d372
|
copy/paste error
|
2017-09-11 09:33:17 +01:00 |
|
Jim O'Regan
|
c283e9edfe
|
first stab at test
|
2017-09-11 08:57:48 +01:00 |
|
Jim O'Regan
|
1ee75ae337
|
Merge remote-tracking branch 'origin/develop' into develop-irish
|
2017-09-11 08:40:11 +01:00 |
|
Matthew Honnibal
|
456bb8a74c
|
Unxfail and close #1305
|
2017-09-06 19:14:17 +02:00 |
|
Matthew Honnibal
|
99e44fbdbb
|
Update regression test
|
2017-09-06 19:13:51 +02:00 |
|
Matthew Honnibal
|
5c3ff06924
|
Fix lemmatizer rules
|
2017-09-06 19:13:24 +02:00 |
|
Matthew Honnibal
|
dd9cab0faf
|
Fix type-check for int/long
|
2017-09-06 19:03:05 +02:00 |
|
Matthew Honnibal
|
497a9308a8
|
Xfail new lemmatizer test
|
2017-09-06 18:41:22 +02:00 |
|
Matthew Honnibal
|
dcbf866970
|
Merge parser changes
|
2017-09-06 18:41:05 +02:00 |
|
Matthew Honnibal
|
5384fff5ce
|
Add test for 1305: Incorrect lemmatization of VBZ for English
|
2017-09-06 18:40:18 +02:00 |
|
Matthew Honnibal
|
24ff6b0ad9
|
Fix parsing and tok2vec models
|
2017-09-06 05:50:58 -05:00 |
|
Matthew Honnibal
|
33fa91feb7
|
Restore correctness of parser model
|
2017-09-04 21:19:30 +02:00 |
|
Matthew Honnibal
|
e88a42e460
|
Increment version
|
2017-09-04 21:14:39 +02:00 |
|
Matthew Honnibal
|
9d65d67985
|
Preserve model compatibility in parser, for now
|
2017-09-04 16:46:22 +02:00 |
|
Matthew Honnibal
|
d5fbf27335
|
Fix test
|
2017-09-04 16:45:11 +02:00 |
|
Matthew Honnibal
|
7fdafcc4c4
|
Fix config loading in tagger
|
2017-09-04 16:38:49 +02:00 |
|
Matthew Honnibal
|
058372d120
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 16:27:53 +02:00 |
|
Matthew Honnibal
|
16e25ce3b5
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 09:26:53 -05:00 |
|
Matthew Honnibal
|
9f512e657a
|
Fix drop_layer calculation
|
2017-09-04 09:26:38 -05:00 |
|
Matthew Honnibal
|
cb4839033c
|
Fix loader for EN tests
|
2017-09-04 15:19:18 +02:00 |
|
Matthew Honnibal
|
382ce566eb
|
Fix deserialization bug
|
2017-09-04 15:19:01 +02:00 |
|
Matthew Honnibal
|
bfddf50081
|
Fix #1296: Incorrect lemmatization of base form verbs
|
2017-09-04 15:18:41 +02:00 |
|
Matthew Honnibal
|
b29e6bff46
|
Improve lemmatization rule for am|VBP
|
2017-09-04 15:18:10 +02:00 |
|
Matthew Honnibal
|
644d6c9e1a
|
Improve lemmatization tests, re #1296
|
2017-09-04 15:17:44 +02:00 |
|
Matthew Honnibal
|
c0eaba8b28
|
Fix low-data textcat
|
2017-09-02 15:17:32 +02:00 |
|
Matthew Honnibal
|
9e378bdac5
|
Fix textcat serialization
|
2017-09-02 15:17:20 +02:00 |
|
Matthew Honnibal
|
e3ea6ee02b
|
Increment version
|
2017-09-02 15:17:01 +02:00 |
|
Matthew Honnibal
|
a3b69bcb3d
|
Add low_data mode in textcat
|
2017-09-02 14:56:30 +02:00 |
|
Matthew Honnibal
|
ead78c7b9b
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-02 12:55:25 +02:00 |
|
Matthew Honnibal
|
5e6a9e7dcc
|
Add rule-based SBD
|
2017-09-02 12:53:38 +02:00 |
|
Matthew Honnibal
|
a824cf8f9a
|
Adjust text classification model
|
2017-09-02 11:41:00 +02:00 |
|
Matthew Honnibal
|
ac040b99bb
|
Add support for pre-trained vectors in text classifier
|
2017-09-01 16:39:55 +02:00 |
|
Matthew Honnibal
|
7742a6d559
|
Add GloVe vectors reader
|
2017-09-01 16:39:22 +02:00 |
|
Matthew Honnibal
|
789e1a3980
|
Use 13 parser features, not 8
|
2017-08-31 14:13:00 -05:00 |
|
Matthew Honnibal
|
30e35d9666
|
Fix syntax error
|
2017-08-30 17:35:39 -05:00 |
|
Matthew Honnibal
|
4ceebde523
|
Fix gradient bug in parser
|
2017-08-30 17:32:56 -05:00 |
|
ines
|
173089a45a
|
Add more validation for model meta
|
2017-08-29 11:21:46 +02:00 |
|
Matthew Honnibal
|
2e28982e28
|
Merge pull request #1288 from geovedi/indonesian
Indonesian language support
|
2017-08-26 21:31:13 +02:00 |
|
ines
|
7e04b7f89c
|
Fix info text on pipeline in package cli
|
2017-08-26 18:30:59 +02:00 |
|
ines
|
40afa13a8a
|
Increment version
|
2017-08-26 18:30:49 +02:00 |
|
Matthew Honnibal
|
876f38c548
|
Merge pull request #1279 from oroszgy/model_cli_v2
Added vector loading to model cli
|
2017-08-26 15:57:50 +02:00 |
|
Matthew Honnibal
|
cfc055734e
|
Split % in units, for compatibility with corpus
|
2017-08-25 20:03:37 -05:00 |
|
Matthew Honnibal
|
4bb6bc3f9e
|
Add support for sent_start to GoldParse
|
2017-08-25 20:03:14 -05:00 |
|
Matthew Honnibal
|
44589fb38c
|
Fix Break oracle
|
2017-08-25 19:50:55 -05:00 |
|
Matthew Honnibal
|
6d4e8e14ca
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-25 12:37:16 -05:00 |
|
Matthew Honnibal
|
4ce5531389
|
Use layer norm instead of batch norm
|
2017-08-25 12:37:10 -05:00 |
|
Matthew Honnibal
|
20dd66ddc2
|
Constrain sentence boundaries to IS_PUNCT and IS_SPACE tokens
|
2017-08-25 19:35:47 +02:00 |
|
Jim Geovedi
|
58d8078971
|
Merge remote-tracking branch 'upstream/develop' into indonesian
|
2017-08-25 09:21:49 +08:00 |
|