Matthew Honnibal
|
c6395b057a
|
Improve parser feature extraction, for missing values
|
2017-09-14 16:18:02 +02:00 |
|
Matthew Honnibal
|
daf869ab3b
|
Fix add_action for NER, so labelled 'O' actions aren't added
|
2017-09-14 16:16:41 +02:00 |
|
Matthew Honnibal
|
683d81bb49
|
Update example for adding entity type
|
2017-09-14 16:15:59 +02:00 |
|
Ines Montani
|
4bee26188d
|
Merge pull request #1323 from galaxyh/master
Set the "cut_all" parameter in jieba.cut() to False, or jieba will return ALL POSSIBLE word segmentations.
|
2017-09-14 15:23:41 +02:00 |
|
Matthew Honnibal
|
9cb2aef587
|
Remove print statement
|
2017-09-14 13:38:28 +02:00 |
|
Matthew Honnibal
|
ba23d63c35
|
Fix minibatch function, for fixed batch size
|
2017-09-14 13:37:41 +02:00 |
|
Ines Montani
|
b8e81daccf
|
Fix typo (closes #1312)
|
2017-09-14 12:49:59 +02:00 |
|
Yu-chun Huang
|
7692b8c071
|
Update __init__.py
Set the "cut_all" parameter to False, or jieba will return ALL POSSIBLE word segmentations.
|
2017-09-12 16:23:47 +08:00 |
|
Jim O'Regan
|
7de709483b
|
missed adding here
|
2017-09-11 10:51:21 +01:00 |
|
Jim O'Regan
|
b1b6123867
|
add ga_tokenizer
|
2017-09-11 10:31:41 +01:00 |
|
Jim O'Regan
|
9dfd301962
|
rearrange
|
2017-09-11 10:14:18 +01:00 |
|
Jim O'Regan
|
187be6d372
|
copy/paste error
|
2017-09-11 09:33:17 +01:00 |
|
Jim O'Regan
|
c283e9edfe
|
first stab at test
|
2017-09-11 08:57:48 +01:00 |
|
Jim O'Regan
|
1ee75ae337
|
Merge remote-tracking branch 'origin/develop' into develop-irish
|
2017-09-11 08:40:11 +01:00 |
|
Matthew Honnibal
|
ddaff6ca56
|
Merge pull request #1287 from IamJeffG/feature/1226-more-complete-noun-chunks
Capture more noun chunks
|
2017-09-08 07:59:10 +02:00 |
|
Matthew Honnibal
|
456bb8a74c
|
Unxfail and close #1305
|
2017-09-06 19:14:17 +02:00 |
|
Matthew Honnibal
|
99e44fbdbb
|
Update regression test
|
2017-09-06 19:13:51 +02:00 |
|
Matthew Honnibal
|
5c3ff06924
|
Fix lemmatizer rules
|
2017-09-06 19:13:24 +02:00 |
|
Matthew Honnibal
|
dd9cab0faf
|
Fix type-check for int/long
|
2017-09-06 19:03:05 +02:00 |
|
Matthew Honnibal
|
497a9308a8
|
Xfail new lemmatizer test
|
2017-09-06 18:41:22 +02:00 |
|
Matthew Honnibal
|
dcbf866970
|
Merge parser changes
|
2017-09-06 18:41:05 +02:00 |
|
Matthew Honnibal
|
5384fff5ce
|
Add test for 1305: Incorrect lemmatization of VBZ for English
|
2017-09-06 18:40:18 +02:00 |
|
Matthew Honnibal
|
95bca20c17
|
Revert changes to spacy/cli/train.py from branch
|
2017-09-06 05:52:32 -05:00 |
|
Matthew Honnibal
|
24ff6b0ad9
|
Fix parsing and tok2vec models
|
2017-09-06 05:50:58 -05:00 |
|
Matthew Honnibal
|
c537154b21
|
Revert gold pre-processing to True
|
2017-09-06 04:59:08 -05:00 |
|
Matthew Honnibal
|
167f6a8938
|
Revert noise-level back to default 0.0
|
2017-09-06 04:58:33 -05:00 |
|
Matthew Honnibal
|
1b65115bc2
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 20:02:53 -05:00 |
|
Matthew Honnibal
|
33fa91feb7
|
Restore correctness of parser model
|
2017-09-04 21:19:30 +02:00 |
|
Matthew Honnibal
|
e88a42e460
|
Increment version
|
2017-09-04 21:14:39 +02:00 |
|
Matthew Honnibal
|
45029a550e
|
Fix customized-tokenizer tests
|
2017-09-04 20:13:13 +02:00 |
|
Matthew Honnibal
|
48f4abdcf2
|
Update travis, removing pypi build
|
2017-09-04 20:05:37 +02:00 |
|
Matthew Honnibal
|
6bd0a0df9a
|
Update travis
|
2017-09-04 19:49:35 +02:00 |
|
Matthew Honnibal
|
34c585396a
|
Merge pull request #1294 from Vimos/master
Fix issue #1292 and add test case for the Assertion Error
|
2017-09-04 19:20:40 +02:00 |
|
Matthew Honnibal
|
d9c609c0f5
|
Update travis
|
2017-09-04 19:01:38 +02:00 |
|
Matthew Honnibal
|
c68f188eb0
|
Fix error on test
|
2017-09-04 18:59:36 +02:00 |
|
Matthew Honnibal
|
33313c01ad
|
Merge pull request #1298 from ericzhao28/master
Lowest common ancestor matrix for spans and docs
|
2017-09-04 18:57:54 +02:00 |
|
Matthew Honnibal
|
3ba9994f1f
|
Update travis
|
2017-09-04 18:44:23 +02:00 |
|
Matthew Honnibal
|
d47af99561
|
Update travis.yml
|
2017-09-04 18:43:33 +02:00 |
|
Matthew Honnibal
|
66646ead26
|
Update travis
|
2017-09-04 18:14:15 +02:00 |
|
Matthew Honnibal
|
9d65d67985
|
Preserve model compatibility in parser, for now
|
2017-09-04 16:46:22 +02:00 |
|
Matthew Honnibal
|
d5fbf27335
|
Fix test
|
2017-09-04 16:45:11 +02:00 |
|
Matthew Honnibal
|
7fdafcc4c4
|
Fix config loading in tagger
|
2017-09-04 16:38:49 +02:00 |
|
Matthew Honnibal
|
058372d120
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 16:27:53 +02:00 |
|
Matthew Honnibal
|
16e25ce3b5
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-04 09:26:53 -05:00 |
|
Matthew Honnibal
|
9f512e657a
|
Fix drop_layer calculation
|
2017-09-04 09:26:38 -05:00 |
|
Matthew Honnibal
|
e8a26ebfab
|
Add efficiency note to new get_lca_matrix() method
|
2017-09-04 15:43:52 +02:00 |
|
Matthew Honnibal
|
cb4839033c
|
Fix loader for EN tests
|
2017-09-04 15:19:18 +02:00 |
|
Matthew Honnibal
|
382ce566eb
|
Fix deserialization bug
|
2017-09-04 15:19:01 +02:00 |
|
Matthew Honnibal
|
bfddf50081
|
Fix #1296: Incorrect lemmatization of base form verbs
|
2017-09-04 15:18:41 +02:00 |
|
Matthew Honnibal
|
b29e6bff46
|
Improve lemmatization rule for am|VBP
|
2017-09-04 15:18:10 +02:00 |
|