Matthew Honnibal
|
c003c561c3
|
Revert NER action loading change, for model compatibility
|
2017-09-17 05:46:03 -05:00 |
|
Matthew Honnibal
|
43210abacc
|
Resolve fine-tuning conflict
|
2017-09-17 05:30:04 -05:00 |
|
ines
|
ece30c28a8
|
Don't split hyphenated words in German
This way, the tokenizer matches the tokenization in German treebanks
|
2017-09-16 20:40:15 +02:00 |
|
ines
|
68f66aebf8
|
Use pkg_resources instead of pip for is_package (resolves #1293)
|
2017-09-16 20:27:59 +02:00 |
|
Matthew Honnibal
|
5ff2491f24
|
Pass option for pre-trained vectors in parser
|
2017-09-16 12:47:21 -05:00 |
|
Matthew Honnibal
|
8665a77f48
|
Fix feature error in NER
|
2017-09-16 12:46:57 -05:00 |
|
Matthew Honnibal
|
e37a50a436
|
Pass documents to tensorizer, not 'features'
|
2017-09-16 12:46:36 -05:00 |
|
Matthew Honnibal
|
84e637e2e6
|
Pass option for pretrained vectors in pipeline
|
2017-09-16 12:46:02 -05:00 |
|
Matthew Honnibal
|
2a93404da6
|
Support optional pre-trained vectors in tensorizer model
|
2017-09-16 12:45:37 -05:00 |
|
Matthew Honnibal
|
e0a2aa9289
|
Support having word vectors data on GPU
|
2017-09-16 12:45:09 -05:00 |
|
Matthew Honnibal
|
ebf8942564
|
Fix test for Python3
|
2017-09-16 16:22:38 +02:00 |
|
Matthew Honnibal
|
8c945310fb
|
Excuse emoji failure on narrow unicode builds
|
2017-09-16 16:21:13 +02:00 |
|
Matthew Honnibal
|
11f2a05ede
|
Fix code explosion from long enum in Python 3, Cython 0.24+
|
2017-09-16 12:20:04 +02:00 |
|
Matthew Honnibal
|
8a829eb98c
|
Fix travis.sh
|
2017-09-16 11:49:31 +02:00 |
|
Matthew Honnibal
|
3fa5b40b5c
|
Add test for hash consistency
|
2017-09-16 11:21:35 +02:00 |
|
Matthew Honnibal
|
f730d07e4e
|
Fix prange error for Windows
|
2017-09-16 00:25:33 +02:00 |
|
Matthew Honnibal
|
1ffc9a7fbf
|
Fix appveyor
|
2017-09-15 23:59:36 +02:00 |
|
Matthew Honnibal
|
2432308f3e
|
Build in separate step for appveyor
|
2017-09-15 23:55:19 +02:00 |
|
Matthew Honnibal
|
07cdbd1219
|
Require thinc 6.8.1, for Windows
|
2017-09-15 22:47:53 +02:00 |
|
Matthew Honnibal
|
02273eeca8
|
Appveyor
|
2017-09-15 12:55:33 +02:00 |
|
Matthew Honnibal
|
25ec8935ad
|
Appveyor
|
2017-09-15 12:53:07 +02:00 |
|
Matthew Honnibal
|
1f840a9211
|
Appveyor
|
2017-09-15 12:49:02 +02:00 |
|
Matthew Honnibal
|
86367ab092
|
Start work on appveyor, for Windows build
|
2017-09-15 12:41:59 +02:00 |
|
Matthew Honnibal
|
4b2065430e
|
Merge branch 'feature/parser-history' into develop
|
2017-09-15 10:42:20 +02:00 |
|
Matthew Honnibal
|
2f08489694
|
Remove AddHistory layer -- didnt work as planned
|
2017-09-15 10:41:40 +02:00 |
|
Matthew Honnibal
|
8b481e0465
|
Remove redundant brackets
|
2017-09-15 10:38:08 +02:00 |
|
Matthew Honnibal
|
027a5d8b75
|
Update train_ner_standalone example
|
2017-09-15 10:36:46 +02:00 |
|
Matthew Honnibal
|
d84607f6bb
|
Vectorize update in AddHistory
|
2017-09-14 20:34:40 +02:00 |
|
Ines Montani
|
bd3da3d6fb
|
Port over change from #1323 and tidy up
|
2017-09-14 19:23:13 +02:00 |
|
Matthew Honnibal
|
18347ab69c
|
Implement AddHistory layer wrapper
|
2017-09-14 19:07:35 +02:00 |
|
Matthew Honnibal
|
d4ca6cef9e
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-14 17:00:07 +02:00 |
|
Matthew Honnibal
|
8c503487af
|
Fix lookup of missing NER actions
|
2017-09-14 16:59:45 +02:00 |
|
Matthew Honnibal
|
664c5af745
|
Revert padding in parser
|
2017-09-14 16:59:25 +02:00 |
|
Matthew Honnibal
|
8496d76224
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-14 09:21:20 -05:00 |
|
Matthew Honnibal
|
ae3bc72b29
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-14 16:21:04 +02:00 |
|
Matthew Honnibal
|
d1518027a9
|
Increment version
|
2017-09-14 16:18:46 +02:00 |
|
Matthew Honnibal
|
70da88a3a7
|
Update comment on Language.begin_training
|
2017-09-14 16:18:30 +02:00 |
|
Matthew Honnibal
|
c6395b057a
|
Improve parser feature extraction, for missing values
|
2017-09-14 16:18:02 +02:00 |
|
Matthew Honnibal
|
daf869ab3b
|
Fix add_action for NER, so labelled 'O' actions aren't added
|
2017-09-14 16:16:41 +02:00 |
|
Matthew Honnibal
|
683d81bb49
|
Update example for adding entity type
|
2017-09-14 16:15:59 +02:00 |
|
Ines Montani
|
4bee26188d
|
Merge pull request #1323 from galaxyh/master
Set the "cut_all" parameter in jieba.cut() to False, or jieba will return ALL POSSIBLE word segmentations.
|
2017-09-14 15:23:41 +02:00 |
|
Matthew Honnibal
|
9cb2aef587
|
Remove print statement
|
2017-09-14 13:38:28 +02:00 |
|
Matthew Honnibal
|
ba23d63c35
|
Fix minibatch function, for fixed batch size
|
2017-09-14 13:37:41 +02:00 |
|
Ines Montani
|
b8e81daccf
|
Fix typo (closes #1312)
|
2017-09-14 12:49:59 +02:00 |
|
Yu-chun Huang
|
7692b8c071
|
Update __init__.py
Set the "cut_all" parameter to False, or jieba will return ALL POSSIBLE word segmentations.
|
2017-09-12 16:23:47 +08:00 |
|
Jim O'Regan
|
7de709483b
|
missed adding here
|
2017-09-11 10:51:21 +01:00 |
|
Jim O'Regan
|
b1b6123867
|
add ga_tokenizer
|
2017-09-11 10:31:41 +01:00 |
|
Jim O'Regan
|
9dfd301962
|
rearrange
|
2017-09-11 10:14:18 +01:00 |
|
Jim O'Regan
|
187be6d372
|
copy/paste error
|
2017-09-11 09:33:17 +01:00 |
|
Jim O'Regan
|
c283e9edfe
|
first stab at test
|
2017-09-11 08:57:48 +01:00 |
|