Matthew Honnibal
|
a2357cce3f
|
Set random seed in train script
|
2017-09-23 02:57:31 +02:00 |
|
Matthew Honnibal
|
05596159bf
|
Fix serialization when pre-trained vectors
|
2017-09-22 15:33:27 -05:00 |
|
Matthew Honnibal
|
980fb6e854
|
Refactor Tok2Vec
|
2017-09-22 09:38:36 -05:00 |
|
Matthew Honnibal
|
d9124f1aa3
|
Add link_vectors_to_models function
|
2017-09-22 09:38:22 -05:00 |
|
Matthew Honnibal
|
a186596307
|
Add 'reapply' combinator, for iterated CNN
|
2017-09-22 09:37:03 -05:00 |
|
Matthew Honnibal
|
9177313063
|
Merge pull request #1352 from hscspring/patch-5
Update customizing-tokenizer.jade
|
2017-09-22 16:11:49 +02:00 |
|
Matthew Honnibal
|
1dbc2285b8
|
Merge pull request #1350 from hscspring/patch-3
Update word-vectors-similarities.jade
|
2017-09-22 16:11:05 +02:00 |
|
Yam
|
54855f0eee
|
Update customizing-tokenizer.jade
|
2017-09-22 12:15:48 +08:00 |
|
Yam
|
6f450306c3
|
Update customizing-tokenizer.jade
update some codes:
- `me` -> `-PRON`
- `TAG` -> `POS`
- `create_tokenizer` function
|
2017-09-22 10:53:22 +08:00 |
|
Yam
|
923c4c2fb2
|
Update punctuation.py
add `……`
|
2017-09-22 09:50:46 +08:00 |
|
Yam
|
425c09488d
|
Update word-vectors-similarities.jade
add
```
import spacy
nlp = spacy.load('en') ```
|
2017-09-22 08:56:34 +08:00 |
|
Matthew Honnibal
|
40a4873b70
|
Fix serialization of model options
|
2017-09-21 13:07:26 -05:00 |
|
Matthew Honnibal
|
0a9016cade
|
Fix serialization during training
|
2017-09-21 13:06:45 -05:00 |
|
Matthew Honnibal
|
20193371f5
|
Don't share CNN, to reduce complexities
|
2017-09-21 14:59:48 +02:00 |
|
Wannaphong Phatthiyaphaibun
|
1abf472068
|
add th test
|
2017-09-21 12:56:58 +07:00 |
|
Matthew Honnibal
|
1d73dec8b1
|
Refactor train script
|
2017-09-20 19:17:10 -05:00 |
|
Matthew Honnibal
|
ffda38356a
|
Add util function to enable GPU
|
2017-09-20 19:16:35 -05:00 |
|
Matthew Honnibal
|
24e85c2048
|
Pass values for CNN maxout pieces option
|
2017-09-20 19:16:12 -05:00 |
|
Matthew Honnibal
|
b832f89ff8
|
Add resume_training function
|
2017-09-20 19:15:20 -05:00 |
|
Matthew Honnibal
|
f5144f04be
|
Add argument for CNN maxout pieces
|
2017-09-20 19:14:41 -05:00 |
|
Matthew Honnibal
|
ea2732469b
|
Merge pull request #1340 from hscspring/patch-1
Update punctuation.py
|
2017-09-20 23:57:00 +02:00 |
|
Matthew Honnibal
|
842e21de9f
|
Fix int type error for Python 2
|
2017-09-20 23:55:30 +02:00 |
|
Matthew Honnibal
|
f92ab03dc8
|
Rename phrase matcher example
|
2017-09-20 22:51:58 +02:00 |
|
Matthew Honnibal
|
01858e9b59
|
Fix PhraseMatcher example
|
2017-09-20 22:51:41 +02:00 |
|
Matthew Honnibal
|
0c93c73e49
|
Add __reduce__ method for PhraseMatcher
|
2017-09-20 22:26:40 +02:00 |
|
Matthew Honnibal
|
cc408fc189
|
Make PhraseMatcher API like Matcher API
|
2017-09-20 22:20:35 +02:00 |
|
Matthew Honnibal
|
43ad250dd5
|
Update matcher tests
|
2017-09-20 21:54:49 +02:00 |
|
Matthew Honnibal
|
828cc91545
|
Fix PhraseMatcher for spaCy 2
|
2017-09-20 21:54:31 +02:00 |
|
Wannaphong Phatthiyaphaibun
|
39bb5690f0
|
update th
|
2017-09-21 00:36:02 +07:00 |
|
Wannaphong Phatthiyaphaibun
|
44291f6697
|
add thai
|
2017-09-20 23:26:34 +07:00 |
|
Yam
|
978b24ccd4
|
Update punctuation.py
In Chinese, `~` and `——` is hyphens,
`·` is intermittent symbol
|
2017-09-20 23:02:22 +08:00 |
|
Matthew Honnibal
|
78301b2d29
|
Avoid comparison to None in Tok2Vec
|
2017-09-20 00:19:34 +02:00 |
|
Matthew Honnibal
|
b36a38f63d
|
Fix serialization of pretrained_dims property
|
2017-09-19 23:42:27 +02:00 |
|
Matthew Honnibal
|
2489dcaccf
|
Fix serialization of parser
|
2017-09-19 23:42:12 +02:00 |
|
Matthew Honnibal
|
aa728b33ca
|
Merge pull request #1333 from galaxyh/master
Add Chinese punctuation
|
2017-09-19 15:09:30 +02:00 |
|
Yu-chun Huang
|
188b439b25
|
Add Chinese punctuation
Add Chinese punctuation.
|
2017-09-19 16:58:42 +08:00 |
|
Yu-chun Huang
|
1f1f35dcd0
|
Add Chinese punctuation
Add Chinese punctuation.
|
2017-09-19 16:57:24 +08:00 |
|
Matthew Honnibal
|
40837b275d
|
Fix tensorizer with pretrained vectors
|
2017-09-18 18:05:38 -05:00 |
|
Matthew Honnibal
|
a0c4b33d03
|
Support resuming a model during spacy train
|
2017-09-18 18:04:47 -05:00 |
|
Matthew Honnibal
|
c858927271
|
Copy vectors to GPU on begin training
|
2017-09-18 18:04:16 -05:00 |
|
Matthew Honnibal
|
3fa76c17d1
|
Refactor Tok2Vec
|
2017-09-18 15:00:05 -05:00 |
|
Matthew Honnibal
|
217e7891cd
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-18 11:36:21 -05:00 |
|
Matthew Honnibal
|
7b3f391f80
|
Try dropping the Affine layer, conditionally
|
2017-09-18 11:35:59 -05:00 |
|
ines
|
2480f8f521
|
Add missing return in Doc.from_disk() (closes #1330)
|
2017-09-18 15:32:00 +02:00 |
|
Matthew Honnibal
|
2148ae605b
|
Dont use iterated convolutions
|
2017-09-17 17:36:04 -05:00 |
|
Matthew Honnibal
|
c013e5996f
|
Fix parser test
|
2017-09-17 13:13:20 -05:00 |
|
Matthew Honnibal
|
8f42f8d305
|
Remove unused 'preprocess' argument in Tok2Vec'
|
2017-09-17 12:30:16 -05:00 |
|
Matthew Honnibal
|
039d609362
|
Remove hard-coded default vectors width
|
2017-09-17 12:29:39 -05:00 |
|
Matthew Honnibal
|
4f38a67a89
|
Make width default to 0 in vectors.pyx
|
2017-09-17 12:29:14 -05:00 |
|
Matthew Honnibal
|
16122f566e
|
Fix cpdef enum in attrs.pyx
|
2017-09-17 12:28:53 -05:00 |
|