Commit Graph

667 Commits

Author SHA1 Message Date
Matthew Honnibal
845bd2e50d * Add parts_of_speech to setup 2015-01-25 16:32:48 +11:00
Matthew Honnibal
5049d4c2e6 * Add parts_of_speech.pyx 2015-01-25 16:32:26 +11:00
Matthew Honnibal
12b034e3ef * Move POS tag definitions to parts_of_speech.pxd 2015-01-25 16:31:07 +11:00
Matthew Honnibal
7431c133d8 * Add error if try to access head and not is_parsed 2015-01-25 15:33:54 +11:00
Matthew Honnibal
e2ea0fb47a * Fix clean command 2015-01-25 14:49:29 +11:00
Matthew Honnibal
7588adf5e7 * Add numpy to install requires 2015-01-25 14:49:10 +11:00
Matthew Honnibal
951d06c824 * Silently don't parse if data is not present 2015-01-25 14:47:38 +11:00
Matthew Honnibal
0d62236247 * Add numpy to requirements 2015-01-25 02:25:29 +11:00
Matthew Honnibal
0250f39741 * Inc version 2015-01-25 02:25:16 +11:00
Matthew Honnibal
72ff9c5082 * Update parser training script for tweaked parser API 2015-01-25 02:20:49 +11:00
Matthew Honnibal
4e857ab7a6 * Fix bug in POS tagger feature 2015-01-25 02:20:15 +11:00
Matthew Honnibal
dd56e298e2 * Ensure tagging is applied if parse=True 2015-01-25 02:19:44 +11:00
Matthew Honnibal
97a9bc6f61 * Add summary paragraph to howworks 2015-01-25 01:58:48 +11:00
Matthew Honnibal
94750819cd * Set parse=True by default --- i.e. parse unless told not to. 2015-01-25 01:28:28 +11:00
Matthew Honnibal
96c96696b8 * Add benchmark details. 2015-01-25 01:25:27 +11:00
Matthew Honnibal
7770057174 * Minor edits to intro 2015-01-25 01:06:14 +11:00
Matthew Honnibal
70d4a9dcc5 * Rework intro text 2015-01-25 00:58:52 +11:00
Matthew Honnibal
83a7e91f3c * Edt API docs 2015-01-24 20:49:44 +11:00
Matthew Honnibal
71b95202eb * Add docstring to StringStore 2015-01-24 20:49:15 +11:00
Matthew Honnibal
6d1c08dafd * Add docstring to Lexeme 2015-01-24 20:48:34 +11:00
Matthew Honnibal
32f58b19d1 * Edits to quickstart 2015-01-24 17:47:51 +11:00
Matthew Honnibal
a97bed9359 * Fix POS and dependency label tag names. Add parse and string navigation functions. 2015-01-24 17:29:04 +11:00
Matthew Honnibal
cb6a526fcd * Update quickstart, with work on api-at-a-glance 2015-01-24 17:27:50 +11:00
Matthew Honnibal
76cd024095 * Add whitespace property to Token 2015-01-24 07:41:21 +11:00
Matthew Honnibal
5fd72bc220 * Have 'string' refer to the whitespace-padded string 2015-01-24 07:32:38 +11:00
Matthew Honnibal
706305ee26 * Upd tests for new meaning of 'string' 2015-01-24 07:22:30 +11:00
Matthew Honnibal
fda94271af * Rename NORM1 and NORM2 attrs to lower and norm 2015-01-24 06:17:03 +11:00
Matthew Honnibal
75feb52c5d * Work on quickstart 2015-01-24 02:53:55 +11:00
Matthew Honnibal
fb6f079092 * Update main page 2015-01-23 23:11:16 +11:00
Matthew Honnibal
edd898947c * Improve example functionality, adding usage of word vectors 2015-01-23 08:22:00 +11:00
Matthew Honnibal
5ed8b2b98f * Rename sic to orth 2015-01-23 02:08:25 +11:00
Matthew Honnibal
93d4bd6c2e * Add test for ). in tokenizer 2015-01-22 22:25:18 +11:00
Matthew Honnibal
a27b23cc8f * Have SBD return start/end indices 2015-01-22 22:24:44 +11:00
Matthew Honnibal
b183dff72d * Remove stray print statement from setup 2015-01-22 02:06:42 +11:00
Matthew Honnibal
d460c28838 * Rename vec to repvec 2015-01-22 02:06:22 +11:00
Matthew Honnibal
8b9d913d97 * Rename vec to repvec 2015-01-22 02:05:58 +11:00
Matthew Honnibal
9cd0b6b3e9 * Various tweaks to Tokens class 2015-01-22 02:05:37 +11:00
Matthew Honnibal
5928d158ce * Pass the string to Tokens 2015-01-22 02:04:58 +11:00
Matthew Honnibal
45264e356b * Rename vec to repvec 2015-01-22 02:04:24 +11:00
Matthew Honnibal
5e63c606ad * Rename vec to repvec 2015-01-22 02:03:54 +11:00
Matthew Honnibal
56e6cf0672 * Add _string attr to Tokens object 2015-01-21 18:57:09 +11:00
Matthew Honnibal
d6ac60e91c * Bug fixes to sentences method, and improved vector transport for tokens 2015-01-21 18:56:32 +11:00
Matthew Honnibal
f2a229136c * Fix data_dir=None argument to English class 2015-01-21 18:27:31 +11:00
Matthew Honnibal
ef49b8c179 * Add stop-word flag 2015-01-21 18:22:31 +11:00
Matthew Honnibal
6646bfc5df * Add LOWER attr 2015-01-21 18:19:08 +11:00
Matthew Honnibal
f149259bf5 * Fix negative indices in tokens 2015-01-20 01:16:29 +11:00
Matthew Honnibal
b65b0c07bf * Messily hook up vector in tokens 2015-01-19 19:59:55 +11:00
Matthew Honnibal
06e7456c65 * Upd tests 2015-01-17 17:33:23 +11:00
Matthew Honnibal
8ff5b8bd84 * Add attribute for POS scheme 2015-01-17 17:33:16 +11:00
Matthew Honnibal
6c7e44140b * Work on word vectors, and other stuff 2015-01-17 16:21:17 +11:00