Matthew Honnibal
|
2348a08481
|
* Load/dump strings with a json file, instead of the hacky strings file we were using.
|
2015-10-22 21:13:03 +11:00 |
|
Matthew Honnibal
|
9baf0abd59
|
* Save vocab after training.
|
2015-10-22 21:09:14 +11:00 |
|
maxirmx
|
64a61d52db
|
Appveyor.yml cleanup
|
2015-10-22 01:15:53 +03:00 |
|
maxirmx
|
6bb8e05fe2
|
Win32 build added
|
2015-10-21 23:13:42 +03:00 |
|
maxirmx
|
f07e4accd7
|
Fixing encoding issue #4
|
2015-10-21 20:45:56 +03:00 |
|
maxirmx
|
fcbfff043f
|
Fixing encoding issue #3
|
2015-10-21 15:52:34 +03:00 |
|
maxirmx
|
fe9d2e2c4e
|
Fixing encode issue #2
|
2015-10-21 15:36:21 +03:00 |
|
maxirmx
|
e4a1726f77
|
Fixing encoding issue
UTF-8
|
2015-10-21 14:16:37 +03:00 |
|
Andreas Grivas
|
93ada458e2
|
added __repr__ that prints text in ipython for doc, token, and span objects
|
2015-10-21 14:11:46 +03:00 |
|
Henning Peters
|
ccffd2ef53
|
fixed extract directory
|
2015-10-21 07:59:34 +02:00 |
|
maxirmx
|
b8d07f35dd
|
Push
|
2015-10-20 23:28:37 +03:00 |
|
maxirmx
|
88d7eb6d26
|
Push
|
2015-10-20 23:27:32 +03:00 |
|
maxirmx
|
14b89ff1c5
|
Merge remote-tracking branch 'refs/remotes/honnibal/master'
|
2015-10-20 23:27:20 +03:00 |
|
maxirmx
|
aefc6b37b8
|
Trash deleted
|
2015-10-20 23:20:32 +03:00 |
|
maxirmx
|
685dc89754
|
Test build #2
|
2015-10-20 22:53:14 +03:00 |
|
maxirmx
|
f43dee555b
|
Test build #1
|
2015-10-20 22:52:11 +03:00 |
|
Henning Peters
|
da4c9cee06
|
assert filename match
|
2015-10-20 19:33:59 +02:00 |
|
Henning Peters
|
4f703f0cb4
|
better error reporting, cleanup
|
2015-10-20 19:11:29 +02:00 |
|
Matthew Honnibal
|
f02a428fc7
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-10-19 14:52:03 +02:00 |
|
Matthew Honnibal
|
9d95c26179
|
* Add simple deep feed-forward neural network text classification example.
|
2015-10-19 23:44:49 +11:00 |
|
Matthew Honnibal
|
9cdea6e450
|
* Import uget correctly
|
2015-10-19 08:32:41 +02:00 |
|
Matthew Honnibal
|
579670e4c7
|
* Fix uget
|
2015-10-19 17:23:33 +11:00 |
|
Matthew Honnibal
|
984775e5e2
|
* Fix setup of uget
|
2015-10-19 17:19:05 +11:00 |
|
Matthew Honnibal
|
e25adce54d
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy
|
2015-10-19 17:17:33 +11:00 |
|
Matthew Honnibal
|
382cbc8cab
|
* Add uget to setup.py
|
2015-10-19 17:15:40 +11:00 |
|
Matthew Honnibal
|
941dff9141
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-10-19 07:48:01 +02:00 |
|
Matthew Honnibal
|
54dbbc0bfc
|
* Don't use cache dir in prebuild
|
2015-10-19 07:47:14 +02:00 |
|
Matthew Honnibal
|
a43777cef8
|
* Inc version
|
2015-10-19 07:46:42 +02:00 |
|
Matthew Honnibal
|
6727a46bb5
|
* Fix Issue #118: Matcher behaves unpredictably when matches overlap.
|
2015-10-19 16:45:32 +11:00 |
|
Matthew Honnibal
|
135062d23c
|
* Fix error with merged text when merged region did not have trailing whitespace
|
2015-10-19 15:47:04 +11:00 |
|
Matthew Honnibal
|
0ce12e4548
|
* Import io in get_freqs
|
2015-10-19 12:56:18 +11:00 |
|
Matthew Honnibal
|
d51579ffe6
|
* Pedantic edits to website/create_code_samples. Make it use plac for interface, remove unnecessary regex, ensure unicode is handled correctly under Python 2.
|
2015-10-19 12:56:00 +11:00 |
|
Matthew Honnibal
|
726bb648da
|
* Fix non-breaking space in specials.json
|
2015-10-19 12:46:11 +11:00 |
|
Matthew Honnibal
|
7ecafdab7f
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy
|
2015-10-19 12:39:27 +11:00 |
|
Matthew Honnibal
|
e39095da82
|
* Fix designation of non-breaking space in specials.json.
|
2015-10-19 12:39:03 +11:00 |
|
Matthew Honnibal
|
295f3213fe
|
Merge pull request #143 from henningpeters/master
add custom download tool (uget), replace wget with uget
|
2015-10-18 21:48:42 +11:00 |
|
Henning Peters
|
bfde91fa49
|
add custom download tool (uget), replace wget with uget
|
2015-10-18 12:35:04 +02:00 |
|
Matthew Honnibal
|
9839cd2c0b
|
* Fix whitespace_ calculation in Token
|
2015-10-18 17:21:11 +11:00 |
|
Matthew Honnibal
|
c99285b8b9
|
* Clean up C++ usage in spacy/matcher.pyx
|
2015-10-18 17:20:50 +11:00 |
|
Matthew Honnibal
|
fc261195f7
|
* Fix compilation for OSX
|
2015-10-18 17:19:07 +11:00 |
|
Matthew Honnibal
|
a7e6c5ac8f
|
* Fix Issue #122: Incorrect calculation of children after Doc.merge()
|
2015-10-18 17:17:27 +11:00 |
|
Matthew Honnibal
|
454c1996d0
|
* Add tokenizer rule to fix numeric range tokenization
|
2015-10-17 15:49:51 +11:00 |
|
maxirmx
|
6de26d312c
|
Merge remote-tracking branch 'refs/remotes/honnibal/master'
|
2015-10-16 11:59:57 +03:00 |
|
Maxim Samsonov
|
f81fa55db6
|
Added win32 build to the matrix
|
2015-10-16 00:04:55 +03:00 |
|
Maxim Samsonov
|
6cff3533b5
|
Update README.md
|
2015-10-16 00:01:03 +03:00 |
|
Maxim Samsonov
|
e6e6018356
|
Update README.md
|
2015-10-16 00:00:47 +03:00 |
|
Maxim Samsonov
|
da7426d198
|
Update README.md
|
2015-10-15 23:56:28 +03:00 |
|
Maxim Samsonov
|
6ee1c03f51
|
Update README.md
|
2015-10-15 23:53:47 +03:00 |
|
Maxim Samsonov
|
ffeb0aed82
|
Added Windows build status to README.md
|
2015-10-15 23:52:20 +03:00 |
|
Matthew Honnibal
|
3ba66f2dc7
|
* Add string length cap in Tokenizer.__call__
|
2015-10-16 04:54:16 +11:00 |
|