Matthew Honnibal
|
52fc338001
|
* Set is_parsed and is_tagged attrs when loading annotations into Doc, re Issue #152
|
2015-10-28 10:43:22 +11:00 |
|
Matthew Honnibal
|
1c0356e4c2
|
* Set test file mode to w+t
|
2015-10-26 22:40:48 +11:00 |
|
Matthew Honnibal
|
0fe98f358b
|
* Fix mode on text file for Python3 in strings test
|
2015-10-26 22:25:16 +11:00 |
|
Matthew Honnibal
|
8ba9cf905e
|
* Fix mode on text file for Python3 in strings test
|
2015-10-26 21:44:34 +11:00 |
|
Matthew Honnibal
|
a0730699b1
|
* Fix mode on text file for Python3 in strings test
|
2015-10-26 21:25:56 +11:00 |
|
Matthew Honnibal
|
725344d349
|
* Fix tempfile in test
|
2015-10-26 21:08:18 +11:00 |
|
Matthew Honnibal
|
f11030aadc
|
* Remove out-dated TODO comment
|
2015-10-26 12:33:38 +11:00 |
|
Matthew Honnibal
|
a371a1071d
|
* Save and load word vectors during pickling, re Issue #125
|
2015-10-26 12:33:04 +11:00 |
|
Matthew Honnibal
|
a824a98312
|
* Add tests for pickling vectors, re: Issue #125
|
2015-10-26 12:31:05 +11:00 |
|
Matthew Honnibal
|
314090cc78
|
* Set vectors length when unpickling vocab, re Issue #125
|
2015-10-26 12:05:08 +11:00 |
|
Matthew Honnibal
|
4e16f9e435
|
* Move tests underneath spacy/
|
2015-10-26 00:07:31 +11:00 |
|
Matthew Honnibal
|
3a6e48e814
|
Merge pull request #149 from chrisdubois/pickle-patch
Add __reduce__ to Tokenizer so that English pickles.
|
2015-10-25 15:30:31 +11:00 |
|
Chris DuBois
|
dac8fe7bdb
|
Add __reduce__ to Tokenizer so that English pickles.
- Add tests to test_pickle and test_tokenizer that save to tempfiles.
|
2015-10-23 22:24:03 -07:00 |
|
Matthew Honnibal
|
ff4fe524ee
|
* Fix exception for python 2
|
2015-10-23 01:56:13 +02:00 |
|
Matthew Honnibal
|
341a3e85cd
|
* Upd downloaded data version
|
2015-10-23 00:56:57 +02:00 |
|
Matthew Honnibal
|
f18fd8c659
|
* Fix language.py for change in StringStore load API
|
2015-10-23 03:48:12 +11:00 |
|
Matthew Honnibal
|
23855db3ca
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy into develop
|
2015-10-23 03:46:09 +11:00 |
|
Matthew Honnibal
|
4f13849065
|
Merge pull request #145 from henningpeters/master
better error reporting, cleanup
|
2015-10-23 03:45:47 +11:00 |
|
Matthew Honnibal
|
3be94be0c0
|
Merge pull request #148 from maxirmx/master
Utf8 encoding for lemma_rules.json
|
2015-10-22 21:46:28 +11:00 |
|
Matthew Honnibal
|
c86bda8d1a
|
* Fix import of uget
|
2015-10-22 21:13:56 +11:00 |
|
Matthew Honnibal
|
2348a08481
|
* Load/dump strings with a json file, instead of the hacky strings file we were using.
|
2015-10-22 21:13:03 +11:00 |
|
Matthew Honnibal
|
9baf0abd59
|
* Save vocab after training.
|
2015-10-22 21:09:14 +11:00 |
|
maxirmx
|
f07e4accd7
|
Fixing encoding issue #4
|
2015-10-21 20:45:56 +03:00 |
|
maxirmx
|
fcbfff043f
|
Fixing encoding issue #3
|
2015-10-21 15:52:34 +03:00 |
|
maxirmx
|
fe9d2e2c4e
|
Fixing encode issue #2
|
2015-10-21 15:36:21 +03:00 |
|
maxirmx
|
e4a1726f77
|
Fixing encoding issue
UTF-8
|
2015-10-21 14:16:37 +03:00 |
|
Andreas Grivas
|
93ada458e2
|
added __repr__ that prints text in ipython for doc, token, and span objects
|
2015-10-21 14:11:46 +03:00 |
|
Henning Peters
|
ccffd2ef53
|
fixed extract directory
|
2015-10-21 07:59:34 +02:00 |
|
Henning Peters
|
da4c9cee06
|
assert filename match
|
2015-10-20 19:33:59 +02:00 |
|
Henning Peters
|
4f703f0cb4
|
better error reporting, cleanup
|
2015-10-20 19:11:29 +02:00 |
|
Matthew Honnibal
|
9cdea6e450
|
* Import uget correctly
|
2015-10-19 08:32:41 +02:00 |
|
Matthew Honnibal
|
6727a46bb5
|
* Fix Issue #118: Matcher behaves unpredictably when matches overlap.
|
2015-10-19 16:45:32 +11:00 |
|
Matthew Honnibal
|
135062d23c
|
* Fix error with merged text when merged region did not have trailing whitespace
|
2015-10-19 15:47:04 +11:00 |
|
Henning Peters
|
bfde91fa49
|
add custom download tool (uget), replace wget with uget
|
2015-10-18 12:35:04 +02:00 |
|
Matthew Honnibal
|
9839cd2c0b
|
* Fix whitespace_ calculation in Token
|
2015-10-18 17:21:11 +11:00 |
|
Matthew Honnibal
|
c99285b8b9
|
* Clean up C++ usage in spacy/matcher.pyx
|
2015-10-18 17:20:50 +11:00 |
|
Matthew Honnibal
|
a7e6c5ac8f
|
* Fix Issue #122: Incorrect calculation of children after Doc.merge()
|
2015-10-18 17:17:27 +11:00 |
|
Matthew Honnibal
|
3ba66f2dc7
|
* Add string length cap in Tokenizer.__call__
|
2015-10-16 04:54:16 +11:00 |
|
Matthew Honnibal
|
6e0f985afc
|
* Fix token.conjuncts
|
2015-10-15 03:49:45 +11:00 |
|
Matthew Honnibal
|
2e0104ac81
|
* Fix token.conjuncts
|
2015-10-15 03:47:45 +11:00 |
|
Matthew Honnibal
|
b8f3345a82
|
* Fix token.conjuncts method
|
2015-10-15 03:36:01 +11:00 |
|
Matthew Honnibal
|
23818f89b8
|
* Fix token.conjuncts method
|
2015-10-15 03:34:57 +11:00 |
|
Matthew Honnibal
|
7a15d1b60c
|
* Add Python 2/3 compatibility fix for copy_reg
|
2015-10-13 20:04:40 +11:00 |
|
Matthew Honnibal
|
329ae57520
|
* Fix whitespace attachment thing
|
2015-10-13 09:46:38 +02:00 |
|
Matthew Honnibal
|
37919eac82
|
* Fix whitespace attachment in simpler way. Leaves problem with setting left/right children.
|
2015-10-13 18:23:24 +11:00 |
|
Matthew Honnibal
|
c70eb776ae
|
* Fix whitespace attachment, so that left/right children are consistent with head.
|
2015-10-13 15:58:22 +11:00 |
|
Matthew Honnibal
|
531182f937
|
* Fix Model.__reduce__
|
2015-10-13 15:14:38 +11:00 |
|
Matthew Honnibal
|
6c227a6c1f
|
* Fix Model.__reduce__
|
2015-10-13 15:10:04 +11:00 |
|
Matthew Honnibal
|
358c82595c
|
* Fix NAMES list in spacy/parts_of_speech.pyx
|
2015-10-13 14:18:45 +11:00 |
|
Matthew Honnibal
|
c1fdc487bc
|
Merge branch 'attrs'
|
2015-10-13 14:03:41 +11:00 |
|