Commit Graph

2301 Commits

Author SHA1 Message Date
Matthew Honnibal
4f13849065 Merge pull request #145 from henningpeters/master
better error reporting, cleanup
2015-10-23 03:45:47 +11:00
Matthew Honnibal
4f5b4a88f2 Merge branch 'master' of https://github.com/honnibal/spaCy into develop 2015-10-22 13:33:15 +02:00
Matthew Honnibal
3be94be0c0 Merge pull request #148 from maxirmx/master
Utf8 encoding for lemma_rules.json
2015-10-22 21:46:28 +11:00
Matthew Honnibal
57a68b0d03 Merge pull request #147 from andreasgrv/master
add __repr__  print text default for doc, token, and span
2015-10-22 21:45:41 +11:00
Matthew Honnibal
c86bda8d1a * Fix import of uget 2015-10-22 21:13:56 +11:00
Matthew Honnibal
1c50110585 * Fix twine publishing in fabfile 2015-10-22 21:13:37 +11:00
Matthew Honnibal
2348a08481 * Load/dump strings with a json file, instead of the hacky strings file we were using. 2015-10-22 21:13:03 +11:00
Matthew Honnibal
9baf0abd59 * Save vocab after training. 2015-10-22 21:09:14 +11:00
maxirmx
64a61d52db Appveyor.yml cleanup 2015-10-22 01:15:53 +03:00
maxirmx
6bb8e05fe2 Win32 build added 2015-10-21 23:13:42 +03:00
maxirmx
f07e4accd7 Fixing encoding issue #4 2015-10-21 20:45:56 +03:00
maxirmx
fcbfff043f Fixing encoding issue #3 2015-10-21 15:52:34 +03:00
maxirmx
fe9d2e2c4e Fixing encode issue #2 2015-10-21 15:36:21 +03:00
maxirmx
e4a1726f77 Fixing encoding issue
UTF-8
2015-10-21 14:16:37 +03:00
Andreas Grivas
93ada458e2 added __repr__ that prints text in ipython for doc, token, and span objects 2015-10-21 14:11:46 +03:00
Henning Peters
ccffd2ef53 fixed extract directory 2015-10-21 07:59:34 +02:00
maxirmx
b8d07f35dd Push 2015-10-20 23:28:37 +03:00
maxirmx
88d7eb6d26 Push 2015-10-20 23:27:32 +03:00
maxirmx
14b89ff1c5 Merge remote-tracking branch 'refs/remotes/honnibal/master' 2015-10-20 23:27:20 +03:00
maxirmx
aefc6b37b8 Trash deleted 2015-10-20 23:20:32 +03:00
maxirmx
685dc89754 Test build #2 2015-10-20 22:53:14 +03:00
maxirmx
f43dee555b Test build #1 2015-10-20 22:52:11 +03:00
Henning Peters
da4c9cee06 assert filename match 2015-10-20 19:33:59 +02:00
Henning Peters
4f703f0cb4 better error reporting, cleanup 2015-10-20 19:11:29 +02:00
Matthew Honnibal
f02a428fc7 Merge branch 'master' of https://github.com/honnibal/spaCy 2015-10-19 14:52:03 +02:00
Matthew Honnibal
9d95c26179 * Add simple deep feed-forward neural network text classification example. 2015-10-19 23:44:49 +11:00
Matthew Honnibal
9cdea6e450 * Import uget correctly 2015-10-19 08:32:41 +02:00
Matthew Honnibal
579670e4c7 * Fix uget 2015-10-19 17:23:33 +11:00
Matthew Honnibal
984775e5e2 * Fix setup of uget 2015-10-19 17:19:05 +11:00
Matthew Honnibal
e25adce54d Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-10-19 17:17:33 +11:00
Matthew Honnibal
382cbc8cab * Add uget to setup.py 2015-10-19 17:15:40 +11:00
Matthew Honnibal
941dff9141 Merge branch 'master' of https://github.com/honnibal/spaCy 2015-10-19 07:48:01 +02:00
Matthew Honnibal
54dbbc0bfc * Don't use cache dir in prebuild 2015-10-19 07:47:14 +02:00
Matthew Honnibal
a43777cef8 * Inc version 2015-10-19 07:46:42 +02:00
Matthew Honnibal
6727a46bb5 * Fix Issue #118: Matcher behaves unpredictably when matches overlap. 2015-10-19 16:45:32 +11:00
Matthew Honnibal
135062d23c * Fix error with merged text when merged region did not have trailing whitespace 2015-10-19 15:47:04 +11:00
Matthew Honnibal
0ce12e4548 * Import io in get_freqs 2015-10-19 12:56:18 +11:00
Matthew Honnibal
d51579ffe6 * Pedantic edits to website/create_code_samples. Make it use plac for interface, remove unnecessary regex, ensure unicode is handled correctly under Python 2. 2015-10-19 12:56:00 +11:00
Matthew Honnibal
726bb648da * Fix non-breaking space in specials.json 2015-10-19 12:46:11 +11:00
Matthew Honnibal
7ecafdab7f Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-10-19 12:39:27 +11:00
Matthew Honnibal
e39095da82 * Fix designation of non-breaking space in specials.json. 2015-10-19 12:39:03 +11:00
Matthew Honnibal
295f3213fe Merge pull request #143 from henningpeters/master
add custom download tool (uget), replace wget with uget
2015-10-18 21:48:42 +11:00
Henning Peters
bfde91fa49 add custom download tool (uget), replace wget with uget 2015-10-18 12:35:04 +02:00
Matthew Honnibal
9839cd2c0b * Fix whitespace_ calculation in Token 2015-10-18 17:21:11 +11:00
Matthew Honnibal
c99285b8b9 * Clean up C++ usage in spacy/matcher.pyx 2015-10-18 17:20:50 +11:00
Matthew Honnibal
fc261195f7 * Fix compilation for OSX 2015-10-18 17:19:07 +11:00
Matthew Honnibal
a7e6c5ac8f * Fix Issue #122: Incorrect calculation of children after Doc.merge() 2015-10-18 17:17:27 +11:00
Matthew Honnibal
454c1996d0 * Add tokenizer rule to fix numeric range tokenization 2015-10-17 15:49:51 +11:00
maxirmx
6de26d312c Merge remote-tracking branch 'refs/remotes/honnibal/master' 2015-10-16 11:59:57 +03:00
Maxim Samsonov
f81fa55db6 Added win32 build to the matrix 2015-10-16 00:04:55 +03:00