| 
							
							
								 Matthew Honnibal | 60d26243e3 | * Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers. | 2015-06-18 16:35:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f868175e43 | * Whitespace | 2015-06-16 23:37:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ab110be125 | * Remove debugging in parser.pyx | 2015-06-16 23:37:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4dad4058c3 | * Uncomment NER training | 2015-06-16 23:36:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9b13d11ab3 | * Fix handling of entities in StateClass | 2015-06-16 23:35:21 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5699585278 | * Use tree_arc_eager system as baseline in experiments | 2015-06-15 08:23:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c40a2c661c | * Add tree_arc_eager | 2015-06-15 08:23:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a5ae98a543 | * Add tree_arc_eager to setup.py | 2015-06-15 08:22:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5da5cf7084 | * Add some more features for S1/S0 | 2015-06-15 04:07:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8156a01bca | * Fix root label for orig_arc_eager | 2015-06-15 02:54:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 21930ede15 | * Switch toggle on USE_ROOT_ARC_SEGMENT | 2015-06-15 02:54:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4841f8ad5e | * Set transition system early | 2015-06-15 02:54:12 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 38a6afa484 | * Make possibly dubious correction to the unshift oracle | 2015-06-15 02:50:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f66228f253 | * Add some more features, esp for labels | 2015-06-14 21:18:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3da8e0f317 | * Add orig_arc_eager | 2015-06-14 20:31:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bcfdf126a4 | * Add toggle for OrigArcEager system | 2015-06-14 20:28:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ea8a103007 | * Fix import of TransitionSystem in parser.pyx | 2015-06-14 19:01:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e0984ca139 | * Fix valency features in StateClass | 2015-06-14 17:50:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e50ac1a47f | * Add verbose printing to scorer | 2015-06-14 17:45:50 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c500d72dc2 | * Temporarily disable NER, and wire up the verbose flag during training | 2015-06-14 17:45:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 763cbd23d5 | * Upd stateclass.print_state | 2015-06-14 17:44:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bdd07bf000 | * Fix Break oracle, but disable the Break transition for now, while we finalize the gold-standard experiments | 2015-06-14 17:44:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 399f15fbdf | * Add flag to toggle handling of multi-root inputs without the Break transition. Clear up now unused best_valid stuff. | 2015-06-14 00:28:37 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 75289b4761 | * Don't refuse to parse single token sentences, incase some transition system needs them, e.g. single word entity. Instead fix error in _init_state. | 2015-06-13 22:55:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 77d7e79c7e | * Fix r/l and distance features. | 2015-06-12 13:06:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b643cb3d5c | * Allow training documents to be filtered in gold.pyx | 2015-06-12 02:42:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 15e177d7a1 | * Fixes to unshift/fast-forward strategy. Getting 91.55 greedy on NW dev, gold preproc | 2015-06-12 01:50:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | afd77a529b | * Prepare for break transition, with fast-forwarding. 86.5 on 1k nw gold preproc | 2015-06-10 14:08:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 495f528709 | * Add support for sentence breaks in stateclass | 2015-06-10 12:34:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b7b18c279d | * Fix Reduce oracle. Getting 86.35 | 2015-06-10 11:33:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb09b5d91a | * Fix shifted bit vector in stateclass --- should reflect whether the word has been *unshifted*. | 2015-06-10 11:33:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aa9625f688 | * Do non-monotonic Unshift. Every word can be shifted at most 1 time. When the Reduce move is used, if S0 has no head, we put the word back on the buffer. Gets 86.4 on nw 1k with gold pre-proc. Break transition not yet implemented for this. | 2015-06-10 10:15:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7bf6b7de3e | * Add unshift action to StateClass, and track which moves have been shifted | 2015-06-10 10:13:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f7c8069e65 | * Fix bug in distance feature | 2015-06-10 10:12:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | abd07c067a | * Inline B and S methods on stateclass | 2015-06-10 07:22:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e2f9a80713 | * Remove old _state imports | 2015-06-10 07:09:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e9aaecc619 | * Remove from_struct method from StateClass | 2015-06-10 06:58:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 18cc326dc0 | * Bug fixes to ner.pyx | 2015-06-10 06:57:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 90a3add8d7 | * Require thinc 2.0 | 2015-06-10 06:57:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e5570c9700 | * Set nogil for oracle functions | 2015-06-10 06:56:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4575e7a60f | * Fix beam search with new StateClass | 2015-06-10 06:33:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d70304b7dd | * Require newer thinc | 2015-06-10 04:20:42 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04b1cd9b8c | * Greedy parsing working with new StateClass. Beam parsing broken | 2015-06-10 04:20:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6a94b64eca | * Remove State* from parser.pyx entirely, switching over to StateClass. Beam parsing still untested. | 2015-06-10 02:03:38 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f14a1526aa | * Remove version of fill_context that takes State* | 2015-06-10 01:39:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d68c686ec1 | * Move StateClass into interface of transition functions | 2015-06-10 01:35:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4b98b3e9c8 | * Cost functions now take StateClass argument, instead of State*. | 2015-06-10 00:40:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e0cf61f591 | * Move StateClass into the interface for is_valid | 2015-06-09 23:23:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 09617a4638 | * Whitespace | 2015-06-09 21:20:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0895d454fb | * Prepare to switch to using state class, instead of state struct | 2015-06-09 21:20:14 +02:00 |  |