Matthew Honnibal
|
c6395b057a
|
Improve parser feature extraction, for missing values
|
2017-09-14 16:18:02 +02:00 |
|
Matthew Honnibal
|
daf869ab3b
|
Fix add_action for NER, so labelled 'O' actions aren't added
|
2017-09-14 16:16:41 +02:00 |
|
Matthew Honnibal
|
9cb2aef587
|
Remove print statement
|
2017-09-14 13:38:28 +02:00 |
|
Matthew Honnibal
|
ba23d63c35
|
Fix minibatch function, for fixed batch size
|
2017-09-14 13:37:41 +02:00 |
|
Matthew Honnibal
|
456bb8a74c
|
Unxfail and close #1305
|
2017-09-06 19:14:17 +02:00 |
|
Matthew Honnibal
|
99e44fbdbb
|
Update regression test
|
2017-09-06 19:13:51 +02:00 |
|
Matthew Honnibal
|
5c3ff06924
|
Fix lemmatizer rules
|
2017-09-06 19:13:24 +02:00 |
|
Matthew Honnibal
|
dd9cab0faf
|
Fix type-check for int/long
|
2017-09-06 19:03:05 +02:00 |
|
Matthew Honnibal
|
497a9308a8
|
Xfail new lemmatizer test
|
2017-09-06 18:41:22 +02:00 |
|
Matthew Honnibal
|
dcbf866970
|
Merge parser changes
|
2017-09-06 18:41:05 +02:00 |
|
Matthew Honnibal
|
5384fff5ce
|
Add test for 1305: Incorrect lemmatization of VBZ for English
|
2017-09-06 18:40:18 +02:00 |
|
Matthew Honnibal
|
24ff6b0ad9
|
Fix parsing and tok2vec models
|
2017-09-06 05:50:58 -05:00 |
|
Matthew Honnibal
|
1b65115bc2
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 20:02:53 -05:00 |
|
Matthew Honnibal
|
33fa91feb7
|
Restore correctness of parser model
|
2017-09-04 21:19:30 +02:00 |
|
Matthew Honnibal
|
e88a42e460
|
Increment version
|
2017-09-04 21:14:39 +02:00 |
|
Matthew Honnibal
|
9d65d67985
|
Preserve model compatibility in parser, for now
|
2017-09-04 16:46:22 +02:00 |
|
Matthew Honnibal
|
d5fbf27335
|
Fix test
|
2017-09-04 16:45:11 +02:00 |
|
Matthew Honnibal
|
7fdafcc4c4
|
Fix config loading in tagger
|
2017-09-04 16:38:49 +02:00 |
|
Matthew Honnibal
|
058372d120
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 16:27:53 +02:00 |
|
Matthew Honnibal
|
16e25ce3b5
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 09:26:53 -05:00 |
|
Matthew Honnibal
|
9f512e657a
|
Fix drop_layer calculation
|
2017-09-04 09:26:38 -05:00 |
|
Matthew Honnibal
|
cb4839033c
|
Fix loader for EN tests
|
2017-09-04 15:19:18 +02:00 |
|
Matthew Honnibal
|
382ce566eb
|
Fix deserialization bug
|
2017-09-04 15:19:01 +02:00 |
|
Matthew Honnibal
|
bfddf50081
|
Fix #1296: Incorrect lemmatization of base form verbs
|
2017-09-04 15:18:41 +02:00 |
|
Matthew Honnibal
|
b29e6bff46
|
Improve lemmatization rule for am|VBP
|
2017-09-04 15:18:10 +02:00 |
|
Matthew Honnibal
|
644d6c9e1a
|
Improve lemmatization tests, re #1296
|
2017-09-04 15:17:44 +02:00 |
|
Matthew Honnibal
|
3cf3fa1704
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-02 12:46:11 -05:00 |
|
Matthew Honnibal
|
e920885676
|
Fix pickle during train
|
2017-09-02 12:46:01 -05:00 |
|
Matthew Honnibal
|
c0eaba8b28
|
Fix low-data textcat
|
2017-09-02 15:17:32 +02:00 |
|
Matthew Honnibal
|
9e378bdac5
|
Fix textcat serialization
|
2017-09-02 15:17:20 +02:00 |
|
Matthew Honnibal
|
e3ea6ee02b
|
Increment version
|
2017-09-02 15:17:01 +02:00 |
|
Matthew Honnibal
|
a3b69bcb3d
|
Add low_data mode in textcat
|
2017-09-02 14:56:30 +02:00 |
|
Matthew Honnibal
|
ead78c7b9b
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-02 12:55:25 +02:00 |
|
Matthew Honnibal
|
5e6a9e7dcc
|
Add rule-based SBD
|
2017-09-02 12:53:38 +02:00 |
|
Matthew Honnibal
|
a824cf8f9a
|
Adjust text classification model
|
2017-09-02 11:41:00 +02:00 |
|
Matthew Honnibal
|
ac040b99bb
|
Add support for pre-trained vectors in text classifier
|
2017-09-01 16:39:55 +02:00 |
|
Matthew Honnibal
|
7742a6d559
|
Add GloVe vectors reader
|
2017-09-01 16:39:22 +02:00 |
|
Matthew Honnibal
|
789e1a3980
|
Use 13 parser features, not 8
|
2017-08-31 14:13:00 -05:00 |
|
Matthew Honnibal
|
30e35d9666
|
Fix syntax error
|
2017-08-30 17:35:39 -05:00 |
|
Matthew Honnibal
|
4ceebde523
|
Fix gradient bug in parser
|
2017-08-30 17:32:56 -05:00 |
|
ines
|
173089a45a
|
Add more validation for model meta
|
2017-08-29 11:21:46 +02:00 |
|
Matthew Honnibal
|
2e28982e28
|
Merge pull request #1288 from geovedi/indonesian
Indonesian language support
|
2017-08-26 21:31:13 +02:00 |
|
ines
|
7e04b7f89c
|
Fix info text on pipeline in package cli
|
2017-08-26 18:30:59 +02:00 |
|
ines
|
40afa13a8a
|
Increment version
|
2017-08-26 18:30:49 +02:00 |
|
Matthew Honnibal
|
876f38c548
|
Merge pull request #1279 from oroszgy/model_cli_v2
Added vector loading to model cli
|
2017-08-26 15:57:50 +02:00 |
|
Matthew Honnibal
|
cfc055734e
|
Split % in units, for compatibility with corpus
|
2017-08-25 20:03:37 -05:00 |
|
Matthew Honnibal
|
4bb6bc3f9e
|
Add support for sent_start to GoldParse
|
2017-08-25 20:03:14 -05:00 |
|
Matthew Honnibal
|
44589fb38c
|
Fix Break oracle
|
2017-08-25 19:50:55 -05:00 |
|
Matthew Honnibal
|
6d4e8e14ca
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-25 12:37:16 -05:00 |
|
Matthew Honnibal
|
4ce5531389
|
Use layer norm instead of batch norm
|
2017-08-25 12:37:10 -05:00 |
|
Matthew Honnibal
|
20dd66ddc2
|
Constrain sentence boundaries to IS_PUNCT and IS_SPACE tokens
|
2017-08-25 19:35:47 +02:00 |
|
Jim Geovedi
|
58d8078971
|
Merge remote-tracking branch 'upstream/develop' into indonesian
|
2017-08-25 09:21:49 +08:00 |
|
Matthew Honnibal
|
6ceb0f0518
|
Allow Lexeme.rank to be set
|
2017-08-24 21:43:00 +02:00 |
|
Matthew Honnibal
|
44a1fa80d3
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-23 13:02:16 +02:00 |
|
ines
|
bb1abbeba5
|
Only link model if download was successfull
|
2017-08-23 12:36:31 +02:00 |
|
Matthew Honnibal
|
bb2541ffd3
|
Fix PROB attr for OOV words
|
2017-08-23 12:11:52 +02:00 |
|
Matthew Honnibal
|
1c5c256e58
|
Fix fine_tune when optimizer is None
|
2017-08-23 10:51:33 +02:00 |
|
Matthew Honnibal
|
9c580ad28a
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-22 17:02:04 -05:00 |
|
Matthew Honnibal
|
a4633fff6f
|
Restore use of batch norm in model
|
2017-08-22 17:01:58 -05:00 |
|
Matthew Honnibal
|
03b5b9727a
|
Fix Doc.vector for empty doc objects
|
2017-08-22 19:52:19 +02:00 |
|
Matthew Honnibal
|
0551b7b03a
|
Fix doc.vector
|
2017-08-22 19:46:52 +02:00 |
|
Matthew Honnibal
|
83f8e98450
|
Fix retrieval of OOV vectors
|
2017-08-22 19:46:35 +02:00 |
|
Matthew Honnibal
|
df2745eb08
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-22 19:00:43 +02:00 |
|
Matthew Honnibal
|
5b329acbf2
|
Fix vectors_length property in vocab
|
2017-08-22 19:00:27 +02:00 |
|
Matthew Honnibal
|
1fe605dfe5
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-21 19:18:31 -05:00 |
|
Matthew Honnibal
|
18b64e79ec
|
Fix fine tuning
|
2017-08-21 19:18:26 -05:00 |
|
Matthew Honnibal
|
682346dd66
|
Restore optimized hidden_depth=0 for parser
|
2017-08-21 19:18:04 -05:00 |
|
Matthew Honnibal
|
a21d8f3f0b
|
Add predict paths to _ml models
|
2017-08-21 23:23:45 +02:00 |
|
Matthew Honnibal
|
cec76801dc
|
Add profile command to CLI
|
2017-08-21 23:23:05 +02:00 |
|
Matthew Honnibal
|
7be5f30f17
|
Add profile function
|
2017-08-21 23:22:49 +02:00 |
|
ines
|
a68dc891ea
|
Port over changes from #1281
|
2017-08-21 23:19:18 +02:00 |
|
Matthew Honnibal
|
5e50a65252
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-21 14:15:46 -05:00 |
|
Matthew Honnibal
|
80acbc5f1f
|
Fix fine-tune weight mixture
|
2017-08-21 14:15:29 -05:00 |
|
ines
|
d15775c3ad
|
Fix typos and commands in alpha docs
|
2017-08-21 13:40:11 +02:00 |
|
Gyorgy Orosz
|
b3576bfc86
|
Added vector leading to model cli
|
2017-08-20 23:16:12 +02:00 |
|
Matthew Honnibal
|
c10f63bf10
|
Initialize fine tuning to 0.5
|
2017-08-20 15:59:48 -05:00 |
|
Matthew Honnibal
|
62878e50db
|
Fix misalignment caued by filtering inputs at wrong point in parser
|
2017-08-20 15:59:28 -05:00 |
|
Matthew Honnibal
|
78a5f842e9
|
Fix update when update_shared=False
|
2017-08-20 15:58:34 -05:00 |
|
Matthew Honnibal
|
7a6edeea68
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-20 12:55:39 -05:00 |
|
Matthew Honnibal
|
f2f9229964
|
Fix name of update_shared flag
|
2017-08-20 18:19:06 +02:00 |
|
Matthew Honnibal
|
8a59718fd6
|
Fix fine-tuning
|
2017-08-20 18:17:35 +02:00 |
|
Matthew Honnibal
|
80a5146ec2
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-20 11:07:08 -05:00 |
|
Matthew Honnibal
|
84bb543e4d
|
Add gold_preproc flag to cli/train
|
2017-08-20 11:07:00 -05:00 |
|
Matthew Honnibal
|
3fe0d76e6d
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-20 14:50:01 +02:00 |
|
Matthew Honnibal
|
c1d3ff517a
|
Track loss in tagger
|
2017-08-20 14:42:23 +02:00 |
|
Matthew Honnibal
|
8875590081
|
Add optimizer in Language.update if sgd=None
|
2017-08-20 14:42:07 +02:00 |
|
Matthew Honnibal
|
84b7ed49e4
|
Ensure updates aren't made if no gold available
|
2017-08-20 14:41:38 +02:00 |
|
Ines Montani
|
c2bbd393af
|
Merge pull request #1276 from oroszgy/model_cli_v2
Ported model cli from v1
|
2017-08-20 11:52:59 +02:00 |
|
Jim Geovedi
|
f77443ab68
|
reworked
|
2017-08-20 13:43:21 +07:00 |
|
Jim Geovedi
|
fbc62a09c7
|
added {pre,suf,in}fix tests
|
2017-08-20 13:43:00 +07:00 |
|
Jim Geovedi
|
713d7c0aa0
|
added indonesian lang test
|
2017-08-20 12:17:14 +07:00 |
|
Jim Geovedi
|
b7d83f37c8
|
indonesian abbr.
|
2017-08-20 12:16:50 +07:00 |
|
Jim Geovedi
|
7193c47f0b
|
direct lookup
|
2017-08-20 11:57:52 +07:00 |
|
Jim Geovedi
|
fdf802d505
|
added examples
|
2017-08-20 11:57:10 +07:00 |
|
Jim Geovedi
|
fa544e6c9a
|
Merge remote-tracking branch 'upstream/develop' into indonesian
|
2017-08-20 11:49:40 +07:00 |
|
Matthew Honnibal
|
42fa84075f
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-19 22:42:50 +02:00 |
|
Matthew Honnibal
|
aefef6fd28
|
Prevent strings from being lost during from_disk and from_bytes
|
2017-08-19 22:42:17 +02:00 |
|
ines
|
281e7e58b3
|
Don't escape forward slashes on ujson.dumps
|
2017-08-19 22:32:16 +02:00 |
|
ines
|
2d126a00ae
|
Fix typo
|
2017-08-19 22:32:07 +02:00 |
|
Matthew Honnibal
|
41c2218c53
|
Fix test for vectors
|
2017-08-19 22:09:12 +02:00 |
|