Matthew Honnibal
|
cc408fc189
|
Make PhraseMatcher API like Matcher API
|
2017-09-20 22:20:35 +02:00 |
|
Matthew Honnibal
|
43ad250dd5
|
Update matcher tests
|
2017-09-20 21:54:49 +02:00 |
|
Matthew Honnibal
|
828cc91545
|
Fix PhraseMatcher for spaCy 2
|
2017-09-20 21:54:31 +02:00 |
|
Wannaphong Phatthiyaphaibun
|
39bb5690f0
|
update th
|
2017-09-21 00:36:02 +07:00 |
|
Wannaphong Phatthiyaphaibun
|
44291f6697
|
add thai
|
2017-09-20 23:26:34 +07:00 |
|
Yam
|
978b24ccd4
|
Update punctuation.py
In Chinese, `~` and `——` is hyphens,
`·` is intermittent symbol
|
2017-09-20 23:02:22 +08:00 |
|
Matthew Honnibal
|
78301b2d29
|
Avoid comparison to None in Tok2Vec
|
2017-09-20 00:19:34 +02:00 |
|
Matthew Honnibal
|
b36a38f63d
|
Fix serialization of pretrained_dims property
|
2017-09-19 23:42:27 +02:00 |
|
Matthew Honnibal
|
2489dcaccf
|
Fix serialization of parser
|
2017-09-19 23:42:12 +02:00 |
|
Matthew Honnibal
|
aa728b33ca
|
Merge pull request #1333 from galaxyh/master
Add Chinese punctuation
|
2017-09-19 15:09:30 +02:00 |
|
Yu-chun Huang
|
188b439b25
|
Add Chinese punctuation
Add Chinese punctuation.
|
2017-09-19 16:58:42 +08:00 |
|
Yu-chun Huang
|
1f1f35dcd0
|
Add Chinese punctuation
Add Chinese punctuation.
|
2017-09-19 16:57:24 +08:00 |
|
Matthew Honnibal
|
40837b275d
|
Fix tensorizer with pretrained vectors
|
2017-09-18 18:05:38 -05:00 |
|
Matthew Honnibal
|
a0c4b33d03
|
Support resuming a model during spacy train
|
2017-09-18 18:04:47 -05:00 |
|
Matthew Honnibal
|
c858927271
|
Copy vectors to GPU on begin training
|
2017-09-18 18:04:16 -05:00 |
|
Matthew Honnibal
|
3fa76c17d1
|
Refactor Tok2Vec
|
2017-09-18 15:00:05 -05:00 |
|
Matthew Honnibal
|
217e7891cd
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-18 11:36:21 -05:00 |
|
Matthew Honnibal
|
7b3f391f80
|
Try dropping the Affine layer, conditionally
|
2017-09-18 11:35:59 -05:00 |
|
ines
|
2480f8f521
|
Add missing return in Doc.from_disk() (closes #1330)
|
2017-09-18 15:32:00 +02:00 |
|
Matthew Honnibal
|
2148ae605b
|
Dont use iterated convolutions
|
2017-09-17 17:36:04 -05:00 |
|
Matthew Honnibal
|
c013e5996f
|
Fix parser test
|
2017-09-17 13:13:20 -05:00 |
|
Matthew Honnibal
|
8f42f8d305
|
Remove unused 'preprocess' argument in Tok2Vec'
|
2017-09-17 12:30:16 -05:00 |
|
Matthew Honnibal
|
039d609362
|
Remove hard-coded default vectors width
|
2017-09-17 12:29:39 -05:00 |
|
Matthew Honnibal
|
4f38a67a89
|
Make width default to 0 in vectors.pyx
|
2017-09-17 12:29:14 -05:00 |
|
Matthew Honnibal
|
16122f566e
|
Fix cpdef enum in attrs.pyx
|
2017-09-17 12:28:53 -05:00 |
|
Matthew Honnibal
|
b159e0eb50
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-09-17 05:47:50 -05:00 |
|
Matthew Honnibal
|
2b0efc77ae
|
Fix wiring of pre-trained vectors in parser loading
|
2017-09-17 05:47:34 -05:00 |
|
Matthew Honnibal
|
31c2e91c35
|
Fix wiring of pre-trained vectors in parser loading
|
2017-09-17 05:46:55 -05:00 |
|
Matthew Honnibal
|
8f913a74ca
|
Fix defaults and args to build_tagger_model
|
2017-09-17 05:46:36 -05:00 |
|
Matthew Honnibal
|
c003c561c3
|
Revert NER action loading change, for model compatibility
|
2017-09-17 05:46:03 -05:00 |
|
Matthew Honnibal
|
43210abacc
|
Resolve fine-tuning conflict
|
2017-09-17 05:30:04 -05:00 |
|
ines
|
ece30c28a8
|
Don't split hyphenated words in German
This way, the tokenizer matches the tokenization in German treebanks
|
2017-09-16 20:40:15 +02:00 |
|
ines
|
68f66aebf8
|
Use pkg_resources instead of pip for is_package (resolves #1293)
|
2017-09-16 20:27:59 +02:00 |
|
Matthew Honnibal
|
5ff2491f24
|
Pass option for pre-trained vectors in parser
|
2017-09-16 12:47:21 -05:00 |
|
Matthew Honnibal
|
8665a77f48
|
Fix feature error in NER
|
2017-09-16 12:46:57 -05:00 |
|
Matthew Honnibal
|
e37a50a436
|
Pass documents to tensorizer, not 'features'
|
2017-09-16 12:46:36 -05:00 |
|
Matthew Honnibal
|
84e637e2e6
|
Pass option for pretrained vectors in pipeline
|
2017-09-16 12:46:02 -05:00 |
|
Matthew Honnibal
|
2a93404da6
|
Support optional pre-trained vectors in tensorizer model
|
2017-09-16 12:45:37 -05:00 |
|
Matthew Honnibal
|
e0a2aa9289
|
Support having word vectors data on GPU
|
2017-09-16 12:45:09 -05:00 |
|
Matthew Honnibal
|
ebf8942564
|
Fix test for Python3
|
2017-09-16 16:22:38 +02:00 |
|
Matthew Honnibal
|
8c945310fb
|
Excuse emoji failure on narrow unicode builds
|
2017-09-16 16:21:13 +02:00 |
|
Matthew Honnibal
|
11f2a05ede
|
Fix code explosion from long enum in Python 3, Cython 0.24+
|
2017-09-16 12:20:04 +02:00 |
|
Matthew Honnibal
|
8a829eb98c
|
Fix travis.sh
|
2017-09-16 11:49:31 +02:00 |
|
Matthew Honnibal
|
3fa5b40b5c
|
Add test for hash consistency
|
2017-09-16 11:21:35 +02:00 |
|
Matthew Honnibal
|
f730d07e4e
|
Fix prange error for Windows
|
2017-09-16 00:25:33 +02:00 |
|
Matthew Honnibal
|
1ffc9a7fbf
|
Fix appveyor
|
2017-09-15 23:59:36 +02:00 |
|
Matthew Honnibal
|
2432308f3e
|
Build in separate step for appveyor
|
2017-09-15 23:55:19 +02:00 |
|
Matthew Honnibal
|
07cdbd1219
|
Require thinc 6.8.1, for Windows
|
2017-09-15 22:47:53 +02:00 |
|
Matthew Honnibal
|
02273eeca8
|
Appveyor
|
2017-09-15 12:55:33 +02:00 |
|
Matthew Honnibal
|
25ec8935ad
|
Appveyor
|
2017-09-15 12:53:07 +02:00 |
|