Commit Graph

952 Commits

Author SHA1 Message Date
Matthew Honnibal
998b607f65 * Upd download script, having it download all data if there's no data/ directory, allowing easier compilation from source 2015-01-30 18:04:01 +11:00
Matthew Honnibal
0b53fd7daa * Add test for parse tree navigation 2015-01-30 18:02:58 +11:00
Matthew Honnibal
ef2493a3bd * Upd gitignore 2015-01-30 16:49:44 +11:00
Matthew Honnibal
d5d1578e44 * Add manifest file 2015-01-30 16:49:02 +11:00
Matthew Honnibal
0a1ec40f76 * Add draft work on features 2015-01-30 16:46:52 +11:00
Matthew Honnibal
7d432b7e39 * Add tests for vector-space model 2015-01-30 16:45:45 +11:00
Matthew Honnibal
48b98e3fb5 * Add test for tag names 2015-01-30 16:45:11 +11:00
Matthew Honnibal
613a195f92 * Add test for indices 2015-01-30 16:44:29 +11:00
Matthew Honnibal
03cc2ee08e * Add test for numpy array transport 2015-01-30 16:43:55 +11:00
Matthew Honnibal
d20eeac156 * Start work on lexrank tutorial 2015-01-30 16:42:43 +11:00
Matthew Honnibal
b3f9b199cf Merge branch 'punctparse' 2015-01-30 16:38:56 +11:00
Matthew Honnibal
ca7577d8a9 * Allow parsers and taggers to be trained on text without gold pre-processing. 2015-01-30 16:36:24 +11:00
Matthew Honnibal
67d6e53a69 * Ensure parser and tagger function correctly when training from missing values, indicated by -1 2015-01-30 14:08:56 +11:00
Matthew Honnibal
4ff180db74 * Fix off-by-one error in commit 0a7fceb 2015-01-30 12:49:33 +11:00
Matthew Honnibal
d0e08a5b57 * Upd index tests 2015-01-30 12:35:13 +11:00
Matthew Honnibal
0a7fcebdf7 * Fix Issue #12: Incorrect token.idx calculations for some punctuation, in the presence of token cache 2015-01-30 12:33:38 +11:00
Matthew Honnibal
b38093237e * More debug prints 2015-01-30 11:15:54 +11:00
Matthew Honnibal
35a18250cc * Upd tests, avoiding unnecessary processing to make testing faster 2015-01-30 10:41:55 +11:00
Matthew Honnibal
5458f220f8 * Fix quickstart instructions 2015-01-30 10:31:25 +11:00
Matthew Honnibal
11ed65b93c * Work on alignment, for evaluation with non-gold preprocessing 2015-01-30 10:31:03 +11:00
Matthew Honnibal
ebf7d2fab1 * Use non-joint sbd, for more simplicity and fewer classes 2015-01-29 06:22:03 +11:00
Matthew Honnibal
d05c5bf141 * Remove comment 2015-01-29 05:19:27 +11:00
Matthew Honnibal
b4348ce1c3 * Messily use unsegmented sentences to train the parser 2015-01-29 04:21:13 +11:00
Matthew Honnibal
320b045daa * Oracle now consistent over gold standard derivation 2015-01-29 03:41:58 +11:00
Matthew Honnibal
f590382134 * Work on sbd 2015-01-29 03:18:29 +11:00
Matthew Honnibal
9e78d673d5 * Fix quickstart installation docs 2015-01-28 14:28:34 +11:00
Matthew Honnibal
fe5f34c37c Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-01-28 14:01:00 +11:00
Matthew Honnibal
781dd712dc * Fix numpy commit problem 2015-01-28 14:00:20 +11:00
Matthew Honnibal
b08c0ce54e * Fix numpy install problem 2015-01-28 13:58:33 +11:00
Matthew Honnibal
9171284d62 * Fix compile-from-source instructions 2015-01-28 12:27:44 +11:00
Matthew Honnibal
1884a7a0be * Attach comment with paper 2015-01-28 03:18:43 +11:00
Matthew Honnibal
a2d6b195db * Add messy Break transitions, carefully following the scheme of Dd Zhang et al (2013) 2015-01-28 03:09:45 +11:00
Matthew Honnibal
f9ee5d9934 * Build a python list of word strings, for debugging 2015-01-28 01:06:13 +11:00
Matthew Honnibal
d819101571 * Improve error message on oracle failure 2015-01-28 00:58:03 +11:00
Matthew Honnibal
1d39562946 * Ws 2015-01-28 00:57:16 +11:00
honnibal
97d91ada3a Update README.md 2015-01-27 21:55:36 +11:00
honnibal
51ca3a285d Update README.md 2015-01-27 21:55:23 +11:00
Matthew Honnibal
e6c3d3471f * Tweak documentation for Tokens, and hide constructor as __cinit__ 2015-01-27 18:57:52 +11:00
Matthew Honnibal
e10b712d29 * Ammend warning 2015-01-27 18:56:18 +11:00
Matthew Honnibal
a5a9417d92 * Fix typo from toby 2015-01-27 18:53:29 +11:00
Matthew Honnibal
f5fab2d9d7 * Switch to using sphinx_rtd_theme locally, instead of via pip install, for better control. Specifically, need to support google analytics 2015-01-27 18:52:47 +11:00
Matthew Honnibal
7b6bf87c25 * Fix api docs 2015-01-27 17:58:29 +11:00
Matthew Honnibal
d9bd3c611d * Improve API docs 2015-01-27 17:56:36 +11:00
Matthew Honnibal
fd1bb648cc * Work on api.rst 2015-01-27 16:47:28 +11:00
Matthew Honnibal
c38c62d4a3 * Add docstring to English class 2015-01-27 02:45:21 +11:00
Matthew Honnibal
830b9358f8 * Fix links in API docs 2015-01-27 01:08:56 +11:00
Matthew Honnibal
d4c99f7dec * Add attrs.pxd 2015-01-26 22:22:09 +11:00
Matthew Honnibal
af88a70a25 * Docs edit 2015-01-26 13:31:27 +11:00
Matthew Honnibal
3096b0e0f1 * Another docs edit 2015-01-26 13:29:02 +11:00
Matthew Honnibal
00974959a9 * Fixes to examples 2015-01-26 13:26:42 +11:00