Commit Graph

593 Commits

Author SHA1 Message Date
Matthew Honnibal
5fd72bc220 * Have 'string' refer to the whitespace-padded string 2015-01-24 07:32:38 +11:00
Matthew Honnibal
706305ee26 * Upd tests for new meaning of 'string' 2015-01-24 07:22:30 +11:00
Matthew Honnibal
fda94271af * Rename NORM1 and NORM2 attrs to lower and norm 2015-01-24 06:17:03 +11:00
Matthew Honnibal
75feb52c5d * Work on quickstart 2015-01-24 02:53:55 +11:00
Matthew Honnibal
fb6f079092 * Update main page 2015-01-23 23:11:16 +11:00
Matthew Honnibal
edd898947c * Improve example functionality, adding usage of word vectors 2015-01-23 08:22:00 +11:00
Matthew Honnibal
5ed8b2b98f * Rename sic to orth 2015-01-23 02:08:25 +11:00
Matthew Honnibal
93d4bd6c2e * Add test for ). in tokenizer 2015-01-22 22:25:18 +11:00
Matthew Honnibal
a27b23cc8f * Have SBD return start/end indices 2015-01-22 22:24:44 +11:00
Matthew Honnibal
b183dff72d * Remove stray print statement from setup 2015-01-22 02:06:42 +11:00
Matthew Honnibal
d460c28838 * Rename vec to repvec 2015-01-22 02:06:22 +11:00
Matthew Honnibal
8b9d913d97 * Rename vec to repvec 2015-01-22 02:05:58 +11:00
Matthew Honnibal
9cd0b6b3e9 * Various tweaks to Tokens class 2015-01-22 02:05:37 +11:00
Matthew Honnibal
5928d158ce * Pass the string to Tokens 2015-01-22 02:04:58 +11:00
Matthew Honnibal
45264e356b * Rename vec to repvec 2015-01-22 02:04:24 +11:00
Matthew Honnibal
5e63c606ad * Rename vec to repvec 2015-01-22 02:03:54 +11:00
Matthew Honnibal
56e6cf0672 * Add _string attr to Tokens object 2015-01-21 18:57:09 +11:00
Matthew Honnibal
d6ac60e91c * Bug fixes to sentences method, and improved vector transport for tokens 2015-01-21 18:56:32 +11:00
Matthew Honnibal
f2a229136c * Fix data_dir=None argument to English class 2015-01-21 18:27:31 +11:00
Matthew Honnibal
ef49b8c179 * Add stop-word flag 2015-01-21 18:22:31 +11:00
Matthew Honnibal
6646bfc5df * Add LOWER attr 2015-01-21 18:19:08 +11:00
Matthew Honnibal
f149259bf5 * Fix negative indices in tokens 2015-01-20 01:16:29 +11:00
Matthew Honnibal
b65b0c07bf * Messily hook up vector in tokens 2015-01-19 19:59:55 +11:00
Matthew Honnibal
06e7456c65 * Upd tests 2015-01-17 17:33:23 +11:00
Matthew Honnibal
8ff5b8bd84 * Add attribute for POS scheme 2015-01-17 17:33:16 +11:00
Matthew Honnibal
6c7e44140b * Work on word vectors, and other stuff 2015-01-17 16:21:17 +11:00
Matthew Honnibal
7e69e17161 * Upd requirements.txt 2015-01-17 16:20:16 +11:00
Matthew Honnibal
1df8db51be * Upd fabfile 2015-01-17 16:20:03 +11:00
Matthew Honnibal
e579dd39ca * Load numpy headers 2015-01-17 16:19:54 +11:00
Matthew Honnibal
7d9306c3bd * Upd docs 2015-01-17 16:19:21 +11:00
Matthew Honnibal
2e14f09d2f * Add features table 2015-01-16 19:04:03 +11:00
Matthew Honnibal
1590788dd4 * Add quickstart page to docs 2015-01-16 17:09:46 +11:00
Matthew Honnibal
43b5a0f4c7 * Add How It Works page to docs 2015-01-16 17:09:14 +11:00
Matthew Honnibal
e28b224b80 * Impove index docs 2015-01-16 07:08:35 +11:00
Matthew Honnibal
e8dbac8a0c * Rename license files 2015-01-16 07:06:17 +11:00
Matthew Honnibal
12e18d9fa9 * Add license page 2015-01-16 07:05:36 +11:00
Matthew Honnibal
802867e96a * Revise interface to Token. Strings now have attribute names like norm1_ 2015-01-15 03:51:47 +11:00
Matthew Honnibal
7d3c40de7d * Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme 2015-01-15 00:33:16 +11:00
Matthew Honnibal
0930892fc1 * Tmp. Working on refactor. Compiles, must hook up lexical feats. 2015-01-14 00:03:48 +11:00
Matthew Honnibal
46da3d74d2 * Tmp. Refactoring, introducing a Lexeme PyObject. 2015-01-12 11:23:44 +11:00
Matthew Honnibal
ce2edd6312 * Tmp commit. Refactoring to create a Python Lexeme class. 2015-01-12 10:26:22 +11:00
Matthew Honnibal
61904e590f * Add parser training script 2015-01-10 04:53:26 +11:00
Matthew Honnibal
c918de68fa * Fix pos command 2015-01-09 05:14:45 +11:00
Matthew Honnibal
9818d7419e * Inc version 2015-01-09 05:14:29 +11:00
Matthew Honnibal
a0eb450e82 * Inc version 2015-01-08 01:19:57 +11:00
Matthew Honnibal
aacaf1a0f0 * Fix parser 2015-01-08 01:19:23 +11:00
Matthew Honnibal
03a10e6cf2 * Inc version --- last didn't pack the correct cpp files. 2015-01-08 01:08:17 +11:00
Matthew Honnibal
c096fe84f7 * Inc version 2015-01-08 00:10:31 +11:00
Matthew Honnibal
9a21127bf7 * Fix parser, which was importing the wrong model 2015-01-08 00:10:15 +11:00
Matthew Honnibal
33bf76db63 * Allow pypy test to fail 2015-01-06 13:13:04 +11:00