leofidus
|
0ae05f77ab
|
Add rokenizer test for zero length string
|
2015-02-07 03:01:44 +01:00 |
|
Matthew Honnibal
|
7de00c5a79
|
* Try not holding a reference to Pool, since that seems to confuse the GC
|
2015-01-31 22:10:22 +11:00 |
|
Matthew Honnibal
|
ce3ae8b5d9
|
* Fix platform-specific lexicon bug.
|
2015-01-31 16:38:58 +11:00 |
|
Matthew Honnibal
|
a1ed574b7b
|
* Fix default model path for English
|
2015-01-31 16:38:27 +11:00 |
|
Matthew Honnibal
|
543fe5a280
|
* Comment training function back in
|
2015-01-31 16:37:57 +11:00 |
|
Matthew Honnibal
|
018e0bfa24
|
* Bug fixes to parse navigation
|
2015-01-31 16:37:13 +11:00 |
|
Matthew Honnibal
|
e013555b25
|
* Add option to download script
|
2015-01-31 13:51:56 +11:00 |
|
Matthew Honnibal
|
a3955fd8d5
|
* Require plac
|
2015-01-31 13:50:53 +11:00 |
|
Matthew Honnibal
|
951684c4ac
|
* Upd travis.yml
|
2015-01-31 13:50:30 +11:00 |
|
Matthew Honnibal
|
0fd4a71bed
|
* Add provisional tests for sentence boundary detection
|
2015-01-31 13:46:11 +11:00 |
|
Matthew Honnibal
|
df749f257d
|
* Upd train script
|
2015-01-31 13:44:37 +11:00 |
|
Matthew Honnibal
|
08ca5c8970
|
* Add sent_end flag to TokenC struct
|
2015-01-31 13:44:16 +11:00 |
|
Matthew Honnibal
|
024cfd485c
|
* Pass tag_strings as a tuple, to support new Tokens API
|
2015-01-31 13:43:37 +11:00 |
|
Matthew Honnibal
|
77d62d0179
|
* Large refactor of Token objects, making them much thinner. This is to support fast parse-tree navigation.
|
2015-01-31 13:42:58 +11:00 |
|
Matthew Honnibal
|
88170e6295
|
* Supply dep_strings as a tuple, for the changed API on Tokens
|
2015-01-31 13:42:09 +11:00 |
|
Matthew Honnibal
|
0981d68022
|
* Set a sent_end flag during parsing, for later use
|
2015-01-31 13:41:46 +11:00 |
|
Matthew Honnibal
|
251dbf24d7
|
* Fix unintialised variable error
|
2015-01-30 20:46:34 +11:00 |
|
Matthew Honnibal
|
83a4df5a1a
|
* Fix download script
|
2015-01-30 20:40:42 +11:00 |
|
Matthew Honnibal
|
6f9ebc2f34
|
* Fix download script
|
2015-01-30 20:33:19 +11:00 |
|
Matthew Honnibal
|
a2bed49ac7
|
* Upd travis.yml
|
2015-01-30 20:27:35 +11:00 |
|
Matthew Honnibal
|
8b85d0bb8a
|
* Only download small data if no data dir exists
|
2015-01-30 20:27:14 +11:00 |
|
Matthew Honnibal
|
e03b1fea22
|
* Don't download full data during test
|
2015-01-30 20:12:33 +11:00 |
|
Matthew Honnibal
|
2da694f65e
|
* Don't load parser in test_pre_punct
|
2015-01-30 20:11:47 +11:00 |
|
Matthew Honnibal
|
e88ceda0ab
|
* Set PYTHONPATH in travis.yml
|
2015-01-30 19:48:34 +11:00 |
|
Matthew Honnibal
|
6c081dd1fc
|
* Handle failure when numpy headers are already installed correctly
|
2015-01-30 19:48:19 +11:00 |
|
Matthew Honnibal
|
1a7a1c2771
|
* Fix Issue #16: tokens recurse when printing
|
2015-01-30 19:47:50 +11:00 |
|
Matthew Honnibal
|
cb95ef6934
|
* Fix download script
|
2015-01-30 19:28:43 +11:00 |
|
Matthew Honnibal
|
e578bd37bd
|
* Fix download script
|
2015-01-30 18:59:31 +11:00 |
|
Matthew Honnibal
|
df52014d12
|
* Fix download script
|
2015-01-30 18:36:24 +11:00 |
|
Matthew Honnibal
|
f0bbffca8d
|
* Fix the way numpy headers are installed during compilation from source
|
2015-01-30 18:14:45 +11:00 |
|
Matthew Honnibal
|
1ef0e04aa0
|
* Change travis.yml to build from source, instead of from PyPi. PyPi checking will be done in a different branch.
|
2015-01-30 18:06:02 +11:00 |
|
Matthew Honnibal
|
0f95712189
|
* Improve accuracy reporting during training
|
2015-01-30 18:05:06 +11:00 |
|
Matthew Honnibal
|
b68f563c2f
|
* Fix Issue #14: Improve parsing API
|
2015-01-30 18:04:41 +11:00 |
|
Matthew Honnibal
|
998b607f65
|
* Upd download script, having it download all data if there's no data/ directory, allowing easier compilation from source
|
2015-01-30 18:04:01 +11:00 |
|
Matthew Honnibal
|
0b53fd7daa
|
* Add test for parse tree navigation
|
2015-01-30 18:02:58 +11:00 |
|
Matthew Honnibal
|
ef2493a3bd
|
* Upd gitignore
|
2015-01-30 16:49:44 +11:00 |
|
Matthew Honnibal
|
d5d1578e44
|
* Add manifest file
|
2015-01-30 16:49:02 +11:00 |
|
Matthew Honnibal
|
0a1ec40f76
|
* Add draft work on features
|
2015-01-30 16:46:52 +11:00 |
|
Matthew Honnibal
|
7d432b7e39
|
* Add tests for vector-space model
|
2015-01-30 16:45:45 +11:00 |
|
Matthew Honnibal
|
48b98e3fb5
|
* Add test for tag names
|
2015-01-30 16:45:11 +11:00 |
|
Matthew Honnibal
|
613a195f92
|
* Add test for indices
|
2015-01-30 16:44:29 +11:00 |
|
Matthew Honnibal
|
03cc2ee08e
|
* Add test for numpy array transport
|
2015-01-30 16:43:55 +11:00 |
|
Matthew Honnibal
|
d20eeac156
|
* Start work on lexrank tutorial
|
2015-01-30 16:42:43 +11:00 |
|
Matthew Honnibal
|
b3f9b199cf
|
Merge branch 'punctparse'
|
2015-01-30 16:38:56 +11:00 |
|
Matthew Honnibal
|
ca7577d8a9
|
* Allow parsers and taggers to be trained on text without gold pre-processing.
|
2015-01-30 16:36:24 +11:00 |
|
Matthew Honnibal
|
67d6e53a69
|
* Ensure parser and tagger function correctly when training from missing values, indicated by -1
|
2015-01-30 14:08:56 +11:00 |
|
Matthew Honnibal
|
4ff180db74
|
* Fix off-by-one error in commit 0a7fceb
|
2015-01-30 12:49:33 +11:00 |
|
Matthew Honnibal
|
d0e08a5b57
|
* Upd index tests
|
2015-01-30 12:35:13 +11:00 |
|
Matthew Honnibal
|
0a7fcebdf7
|
* Fix Issue #12: Incorrect token.idx calculations for some punctuation, in the presence of token cache
|
2015-01-30 12:33:38 +11:00 |
|
Matthew Honnibal
|
b38093237e
|
* More debug prints
|
2015-01-30 11:15:54 +11:00 |
|