Matthew Honnibal
|
60d26243e3
|
* Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers.
|
2015-06-18 16:35:27 +02:00 |
|
Matthew Honnibal
|
f868175e43
|
* Whitespace
|
2015-06-16 23:37:46 +02:00 |
|
Matthew Honnibal
|
ab110be125
|
* Remove debugging in parser.pyx
|
2015-06-16 23:37:25 +02:00 |
|
Matthew Honnibal
|
4dad4058c3
|
* Uncomment NER training
|
2015-06-16 23:36:54 +02:00 |
|
Matthew Honnibal
|
9b13d11ab3
|
* Fix handling of entities in StateClass
|
2015-06-16 23:35:21 +02:00 |
|
Matthew Honnibal
|
5699585278
|
* Use tree_arc_eager system as baseline in experiments
|
2015-06-15 08:23:43 +02:00 |
|
Matthew Honnibal
|
c40a2c661c
|
* Add tree_arc_eager
|
2015-06-15 08:23:24 +02:00 |
|
Matthew Honnibal
|
a5ae98a543
|
* Add tree_arc_eager to setup.py
|
2015-06-15 08:22:59 +02:00 |
|
Matthew Honnibal
|
5da5cf7084
|
* Add some more features for S1/S0
|
2015-06-15 04:07:13 +02:00 |
|
Matthew Honnibal
|
8156a01bca
|
* Fix root label for orig_arc_eager
|
2015-06-15 02:54:55 +02:00 |
|
Matthew Honnibal
|
21930ede15
|
* Switch toggle on USE_ROOT_ARC_SEGMENT
|
2015-06-15 02:54:32 +02:00 |
|
Matthew Honnibal
|
4841f8ad5e
|
* Set transition system early
|
2015-06-15 02:54:12 +02:00 |
|
Matthew Honnibal
|
38a6afa484
|
* Make possibly dubious correction to the unshift oracle
|
2015-06-15 02:50:00 +02:00 |
|
Matthew Honnibal
|
f66228f253
|
* Add some more features, esp for labels
|
2015-06-14 21:18:02 +02:00 |
|
Matthew Honnibal
|
3da8e0f317
|
* Add orig_arc_eager
|
2015-06-14 20:31:44 +02:00 |
|
Matthew Honnibal
|
bcfdf126a4
|
* Add toggle for OrigArcEager system
|
2015-06-14 20:28:14 +02:00 |
|
Matthew Honnibal
|
ea8a103007
|
* Fix import of TransitionSystem in parser.pyx
|
2015-06-14 19:01:26 +02:00 |
|
Matthew Honnibal
|
e0984ca139
|
* Fix valency features in StateClass
|
2015-06-14 17:50:26 +02:00 |
|
Matthew Honnibal
|
e50ac1a47f
|
* Add verbose printing to scorer
|
2015-06-14 17:45:50 +02:00 |
|
Matthew Honnibal
|
c500d72dc2
|
* Temporarily disable NER, and wire up the verbose flag during training
|
2015-06-14 17:45:31 +02:00 |
|
Matthew Honnibal
|
763cbd23d5
|
* Upd stateclass.print_state
|
2015-06-14 17:44:29 +02:00 |
|
Matthew Honnibal
|
bdd07bf000
|
* Fix Break oracle, but disable the Break transition for now, while we finalize the gold-standard experiments
|
2015-06-14 17:44:03 +02:00 |
|
Matthew Honnibal
|
399f15fbdf
|
* Add flag to toggle handling of multi-root inputs without the Break transition. Clear up now unused best_valid stuff.
|
2015-06-14 00:28:37 +02:00 |
|
Matthew Honnibal
|
75289b4761
|
* Don't refuse to parse single token sentences, incase some transition system needs them, e.g. single word entity. Instead fix error in _init_state.
|
2015-06-13 22:55:55 +02:00 |
|
Matthew Honnibal
|
77d7e79c7e
|
* Fix r/l and distance features.
|
2015-06-12 13:06:15 +02:00 |
|
Matthew Honnibal
|
b643cb3d5c
|
* Allow training documents to be filtered in gold.pyx
|
2015-06-12 02:42:08 +02:00 |
|
Matthew Honnibal
|
15e177d7a1
|
* Fixes to unshift/fast-forward strategy. Getting 91.55 greedy on NW dev, gold preproc
|
2015-06-12 01:50:23 +02:00 |
|
Matthew Honnibal
|
afd77a529b
|
* Prepare for break transition, with fast-forwarding. 86.5 on 1k nw gold preproc
|
2015-06-10 14:08:30 +02:00 |
|
Matthew Honnibal
|
495f528709
|
* Add support for sentence breaks in stateclass
|
2015-06-10 12:34:28 +02:00 |
|
Matthew Honnibal
|
b7b18c279d
|
* Fix Reduce oracle. Getting 86.35
|
2015-06-10 11:33:39 +02:00 |
|
Matthew Honnibal
|
bb09b5d91a
|
* Fix shifted bit vector in stateclass --- should reflect whether the word has been *unshifted*.
|
2015-06-10 11:33:09 +02:00 |
|
Matthew Honnibal
|
aa9625f688
|
* Do non-monotonic Unshift. Every word can be shifted at most 1 time. When the Reduce move is used, if S0 has no head, we put the word back on the buffer. Gets 86.4 on nw 1k with gold pre-proc. Break transition not yet implemented for this.
|
2015-06-10 10:15:56 +02:00 |
|
Matthew Honnibal
|
7bf6b7de3e
|
* Add unshift action to StateClass, and track which moves have been shifted
|
2015-06-10 10:13:03 +02:00 |
|
Matthew Honnibal
|
f7c8069e65
|
* Fix bug in distance feature
|
2015-06-10 10:12:17 +02:00 |
|
Matthew Honnibal
|
abd07c067a
|
* Inline B and S methods on stateclass
|
2015-06-10 07:22:33 +02:00 |
|
Matthew Honnibal
|
e2f9a80713
|
* Remove old _state imports
|
2015-06-10 07:09:17 +02:00 |
|
Matthew Honnibal
|
e9aaecc619
|
* Remove from_struct method from StateClass
|
2015-06-10 06:58:27 +02:00 |
|
Matthew Honnibal
|
18cc326dc0
|
* Bug fixes to ner.pyx
|
2015-06-10 06:57:41 +02:00 |
|
Matthew Honnibal
|
90a3add8d7
|
* Require thinc 2.0
|
2015-06-10 06:57:13 +02:00 |
|
Matthew Honnibal
|
e5570c9700
|
* Set nogil for oracle functions
|
2015-06-10 06:56:56 +02:00 |
|
Matthew Honnibal
|
4575e7a60f
|
* Fix beam search with new StateClass
|
2015-06-10 06:33:39 +02:00 |
|
Matthew Honnibal
|
d70304b7dd
|
* Require newer thinc
|
2015-06-10 04:20:42 +02:00 |
|
Matthew Honnibal
|
04b1cd9b8c
|
* Greedy parsing working with new StateClass. Beam parsing broken
|
2015-06-10 04:20:23 +02:00 |
|
Matthew Honnibal
|
6a94b64eca
|
* Remove State* from parser.pyx entirely, switching over to StateClass. Beam parsing still untested.
|
2015-06-10 02:03:38 +02:00 |
|
Matthew Honnibal
|
f14a1526aa
|
* Remove version of fill_context that takes State*
|
2015-06-10 01:39:07 +02:00 |
|
Matthew Honnibal
|
d68c686ec1
|
* Move StateClass into interface of transition functions
|
2015-06-10 01:35:28 +02:00 |
|
Matthew Honnibal
|
4b98b3e9c8
|
* Cost functions now take StateClass argument, instead of State*.
|
2015-06-10 00:40:43 +02:00 |
|
Matthew Honnibal
|
e0cf61f591
|
* Move StateClass into the interface for is_valid
|
2015-06-09 23:23:28 +02:00 |
|
Matthew Honnibal
|
09617a4638
|
* Whitespace
|
2015-06-09 21:20:33 +02:00 |
|
Matthew Honnibal
|
0895d454fb
|
* Prepare to switch to using state class, instead of state struct
|
2015-06-09 21:20:14 +02:00 |
|