Matthew Honnibal
|
76cd024095
|
* Add whitespace property to Token
|
2015-01-24 07:41:21 +11:00 |
|
Matthew Honnibal
|
5fd72bc220
|
* Have 'string' refer to the whitespace-padded string
|
2015-01-24 07:32:38 +11:00 |
|
Matthew Honnibal
|
706305ee26
|
* Upd tests for new meaning of 'string'
|
2015-01-24 07:22:30 +11:00 |
|
Matthew Honnibal
|
fda94271af
|
* Rename NORM1 and NORM2 attrs to lower and norm
|
2015-01-24 06:17:03 +11:00 |
|
Matthew Honnibal
|
75feb52c5d
|
* Work on quickstart
|
2015-01-24 02:53:55 +11:00 |
|
Matthew Honnibal
|
fb6f079092
|
* Update main page
|
2015-01-23 23:11:16 +11:00 |
|
Matthew Honnibal
|
edd898947c
|
* Improve example functionality, adding usage of word vectors
|
2015-01-23 08:22:00 +11:00 |
|
Matthew Honnibal
|
5ed8b2b98f
|
* Rename sic to orth
|
2015-01-23 02:08:25 +11:00 |
|
Matthew Honnibal
|
93d4bd6c2e
|
* Add test for ). in tokenizer
|
2015-01-22 22:25:18 +11:00 |
|
Matthew Honnibal
|
a27b23cc8f
|
* Have SBD return start/end indices
|
2015-01-22 22:24:44 +11:00 |
|
Matthew Honnibal
|
b183dff72d
|
* Remove stray print statement from setup
|
2015-01-22 02:06:42 +11:00 |
|
Matthew Honnibal
|
d460c28838
|
* Rename vec to repvec
|
2015-01-22 02:06:22 +11:00 |
|
Matthew Honnibal
|
8b9d913d97
|
* Rename vec to repvec
|
2015-01-22 02:05:58 +11:00 |
|
Matthew Honnibal
|
9cd0b6b3e9
|
* Various tweaks to Tokens class
|
2015-01-22 02:05:37 +11:00 |
|
Matthew Honnibal
|
5928d158ce
|
* Pass the string to Tokens
|
2015-01-22 02:04:58 +11:00 |
|
Matthew Honnibal
|
45264e356b
|
* Rename vec to repvec
|
2015-01-22 02:04:24 +11:00 |
|
Matthew Honnibal
|
5e63c606ad
|
* Rename vec to repvec
|
2015-01-22 02:03:54 +11:00 |
|
Matthew Honnibal
|
56e6cf0672
|
* Add _string attr to Tokens object
|
2015-01-21 18:57:09 +11:00 |
|
Matthew Honnibal
|
d6ac60e91c
|
* Bug fixes to sentences method, and improved vector transport for tokens
|
2015-01-21 18:56:32 +11:00 |
|
Matthew Honnibal
|
f2a229136c
|
* Fix data_dir=None argument to English class
|
2015-01-21 18:27:31 +11:00 |
|
Matthew Honnibal
|
ef49b8c179
|
* Add stop-word flag
|
2015-01-21 18:22:31 +11:00 |
|
Matthew Honnibal
|
6646bfc5df
|
* Add LOWER attr
|
2015-01-21 18:19:08 +11:00 |
|
Matthew Honnibal
|
f149259bf5
|
* Fix negative indices in tokens
|
2015-01-20 01:16:29 +11:00 |
|
Matthew Honnibal
|
b65b0c07bf
|
* Messily hook up vector in tokens
|
2015-01-19 19:59:55 +11:00 |
|
Matthew Honnibal
|
06e7456c65
|
* Upd tests
|
2015-01-17 17:33:23 +11:00 |
|
Matthew Honnibal
|
8ff5b8bd84
|
* Add attribute for POS scheme
|
2015-01-17 17:33:16 +11:00 |
|
Matthew Honnibal
|
6c7e44140b
|
* Work on word vectors, and other stuff
|
2015-01-17 16:21:17 +11:00 |
|
Matthew Honnibal
|
7e69e17161
|
* Upd requirements.txt
|
2015-01-17 16:20:16 +11:00 |
|
Matthew Honnibal
|
1df8db51be
|
* Upd fabfile
|
2015-01-17 16:20:03 +11:00 |
|
Matthew Honnibal
|
e579dd39ca
|
* Load numpy headers
|
2015-01-17 16:19:54 +11:00 |
|
Matthew Honnibal
|
7d9306c3bd
|
* Upd docs
|
2015-01-17 16:19:21 +11:00 |
|
Matthew Honnibal
|
2e14f09d2f
|
* Add features table
|
2015-01-16 19:04:03 +11:00 |
|
Matthew Honnibal
|
1590788dd4
|
* Add quickstart page to docs
|
2015-01-16 17:09:46 +11:00 |
|
Matthew Honnibal
|
43b5a0f4c7
|
* Add How It Works page to docs
|
2015-01-16 17:09:14 +11:00 |
|
Matthew Honnibal
|
e28b224b80
|
* Impove index docs
|
2015-01-16 07:08:35 +11:00 |
|
Matthew Honnibal
|
e8dbac8a0c
|
* Rename license files
|
2015-01-16 07:06:17 +11:00 |
|
Matthew Honnibal
|
12e18d9fa9
|
* Add license page
|
2015-01-16 07:05:36 +11:00 |
|
Matthew Honnibal
|
802867e96a
|
* Revise interface to Token. Strings now have attribute names like norm1_
|
2015-01-15 03:51:47 +11:00 |
|
Matthew Honnibal
|
7d3c40de7d
|
* Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme
|
2015-01-15 00:33:16 +11:00 |
|
Matthew Honnibal
|
0930892fc1
|
* Tmp. Working on refactor. Compiles, must hook up lexical feats.
|
2015-01-14 00:03:48 +11:00 |
|
Matthew Honnibal
|
46da3d74d2
|
* Tmp. Refactoring, introducing a Lexeme PyObject.
|
2015-01-12 11:23:44 +11:00 |
|
Matthew Honnibal
|
ce2edd6312
|
* Tmp commit. Refactoring to create a Python Lexeme class.
|
2015-01-12 10:26:22 +11:00 |
|
Matthew Honnibal
|
61904e590f
|
* Add parser training script
|
2015-01-10 04:53:26 +11:00 |
|
Matthew Honnibal
|
c918de68fa
|
* Fix pos command
|
2015-01-09 05:14:45 +11:00 |
|
Matthew Honnibal
|
9818d7419e
|
* Inc version
|
2015-01-09 05:14:29 +11:00 |
|
Matthew Honnibal
|
a0eb450e82
|
* Inc version
|
2015-01-08 01:19:57 +11:00 |
|
Matthew Honnibal
|
aacaf1a0f0
|
* Fix parser
|
2015-01-08 01:19:23 +11:00 |
|
Matthew Honnibal
|
03a10e6cf2
|
* Inc version --- last didn't pack the correct cpp files.
|
2015-01-08 01:08:17 +11:00 |
|
Matthew Honnibal
|
c096fe84f7
|
* Inc version
|
2015-01-08 00:10:31 +11:00 |
|
Matthew Honnibal
|
9a21127bf7
|
* Fix parser, which was importing the wrong model
|
2015-01-08 00:10:15 +11:00 |
|