| 
							
							
								 Matthew Honnibal | 221e2e485f | * Assign 'ROOT' as label, not 'root' | 2015-06-23 15:09:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a7bf7b0626 | * Rename sent_start to sent_end, to reflect its new usage in the Break transition | 2015-06-23 05:39:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 34c0ef2ee8 | * Don't compile the orig_arc_eager and tree_arc_eager modules used for the EMNLP paper | 2015-06-23 05:38:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 59e9f9153c | * Remove projectivity constraint in train.py, but raise Exception if non-projective sentence is encountered, since we've told GoldParse to projectivize | 2015-06-23 05:04:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ee3e56f27b | * Fix bounds checking on entities | 2015-06-23 04:35:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 43ef5ddea5 | * Ensure root albel is spelled ROOT, for backwards compatibility | 2015-06-23 04:14:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 065c2e1d2d | * Add some bounds checking around state arrays | 2015-06-23 04:13:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 89ae218b75 | * Add import to tokens.pyx from weird Cython compiler issue with casting from memory views | 2015-06-23 03:04:34 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f01b3d043e | * Add padding to arrays in stateclass. May be papering over a deeper bug. | 2015-06-23 03:03:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5e94b5d581 | * Have Tokens return proper numpy arrays, not Cython views. | 2015-06-23 00:07:34 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 69507bc729 | * Re-enable Break transition in arc_eager.pyx | 2015-06-23 00:03:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cc579ed429 | * Add __len__ function to StringStore | 2015-06-23 00:02:50 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 46fb24e9fd | * Add cycle-checking code in gold.pyx | 2015-06-23 00:02:22 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 839e5038b7 | * Raise exception on non-projective input | 2015-06-23 00:01:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fe9118a528 | * Add test for strip_bad_periods reading in read_conll.parse | 2015-06-18 16:36:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 60d26243e3 | * Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers. | 2015-06-18 16:35:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f868175e43 | * Whitespace | 2015-06-16 23:37:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ab110be125 | * Remove debugging in parser.pyx | 2015-06-16 23:37:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4dad4058c3 | * Uncomment NER training | 2015-06-16 23:36:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9b13d11ab3 | * Fix handling of entities in StateClass | 2015-06-16 23:35:21 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5699585278 | * Use tree_arc_eager system as baseline in experiments | 2015-06-15 08:23:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c40a2c661c | * Add tree_arc_eager | 2015-06-15 08:23:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a5ae98a543 | * Add tree_arc_eager to setup.py | 2015-06-15 08:22:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5da5cf7084 | * Add some more features for S1/S0 | 2015-06-15 04:07:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8156a01bca | * Fix root label for orig_arc_eager | 2015-06-15 02:54:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 21930ede15 | * Switch toggle on USE_ROOT_ARC_SEGMENT | 2015-06-15 02:54:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4841f8ad5e | * Set transition system early | 2015-06-15 02:54:12 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 38a6afa484 | * Make possibly dubious correction to the unshift oracle | 2015-06-15 02:50:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f66228f253 | * Add some more features, esp for labels | 2015-06-14 21:18:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3da8e0f317 | * Add orig_arc_eager | 2015-06-14 20:31:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bcfdf126a4 | * Add toggle for OrigArcEager system | 2015-06-14 20:28:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ea8a103007 | * Fix import of TransitionSystem in parser.pyx | 2015-06-14 19:01:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e0984ca139 | * Fix valency features in StateClass | 2015-06-14 17:50:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e50ac1a47f | * Add verbose printing to scorer | 2015-06-14 17:45:50 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c500d72dc2 | * Temporarily disable NER, and wire up the verbose flag during training | 2015-06-14 17:45:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 763cbd23d5 | * Upd stateclass.print_state | 2015-06-14 17:44:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bdd07bf000 | * Fix Break oracle, but disable the Break transition for now, while we finalize the gold-standard experiments | 2015-06-14 17:44:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 399f15fbdf | * Add flag to toggle handling of multi-root inputs without the Break transition. Clear up now unused best_valid stuff. | 2015-06-14 00:28:37 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 75289b4761 | * Don't refuse to parse single token sentences, incase some transition system needs them, e.g. single word entity. Instead fix error in _init_state. | 2015-06-13 22:55:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 77d7e79c7e | * Fix r/l and distance features. | 2015-06-12 13:06:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b643cb3d5c | * Allow training documents to be filtered in gold.pyx | 2015-06-12 02:42:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 15e177d7a1 | * Fixes to unshift/fast-forward strategy. Getting 91.55 greedy on NW dev, gold preproc | 2015-06-12 01:50:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | afd77a529b | * Prepare for break transition, with fast-forwarding. 86.5 on 1k nw gold preproc | 2015-06-10 14:08:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 495f528709 | * Add support for sentence breaks in stateclass | 2015-06-10 12:34:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b7b18c279d | * Fix Reduce oracle. Getting 86.35 | 2015-06-10 11:33:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb09b5d91a | * Fix shifted bit vector in stateclass --- should reflect whether the word has been *unshifted*. | 2015-06-10 11:33:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aa9625f688 | * Do non-monotonic Unshift. Every word can be shifted at most 1 time. When the Reduce move is used, if S0 has no head, we put the word back on the buffer. Gets 86.4 on nw 1k with gold pre-proc. Break transition not yet implemented for this. | 2015-06-10 10:15:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7bf6b7de3e | * Add unshift action to StateClass, and track which moves have been shifted | 2015-06-10 10:13:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f7c8069e65 | * Fix bug in distance feature | 2015-06-10 10:12:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | abd07c067a | * Inline B and S methods on stateclass | 2015-06-10 07:22:33 +02:00 |  |