| 
							
							
								 Matthew Honnibal | 0ceb1f71c2 | * Update parse features | 2015-07-08 19:11:36 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb522496dd | * Rename Tokens to Doc | 2015-07-08 18:53:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ff885e8511 | * Add ParserFactory convenience function | 2015-07-08 12:35:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 52fd80c6c6 | * Add experimental supersense features for parsing, based on lookup into wordnet. | 2015-07-01 20:12:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e20106fdff | * Begin reorganizing neuralnet work | 2015-06-30 14:26:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3bb5876c5a | * Inline methods in StateClass | 2015-06-29 01:10:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 313a7f87b3 | * Inline methods in StateClass | 2015-06-29 01:06:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a02fd3af5d | * Check valency in L and R feature methods, to make feaure calculation faster | 2015-06-29 00:27:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5d870720bc | * Check valency in L and R feature methods, to make feaure calculation faster | 2015-06-29 00:17:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f4986d5d3c | * Use new Example class | 2015-06-28 22:36:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 735f1af91f | * Fix neural net stuff | 2015-06-28 11:44:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e7003f1cf3 | * Remove hard-coding of vector lengths | 2015-06-28 11:37:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 897dd0dd0b | * Merge changes, and adjust Example to use memoryview | 2015-06-28 11:36:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9282a8e72c | * Prepare for new models to be plugged in by using Example class | 2015-06-28 11:02:35 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 75aeccc064 | * Rejig parser interface to use new thinc.api.Example class, in prep of theano model. Comment out beam search | 2015-06-28 11:02:34 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bbef71f213 | * Fix min function in fill_context | 2015-06-28 10:46:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 142b6f9510 | * Revert last changes | 2015-06-28 10:44:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b06962f18b | * Pad buffers in state | 2015-06-28 10:36:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 53be72387c | * Hack at fill_context to investigate performance loss | 2015-06-28 10:34:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 71a4e876a9 | * Fix parse features | 2015-06-28 09:27:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5af500909c | * Remove unused directve from parser.pyx | 2015-06-28 06:20:21 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d5b4090705 | * Add profile directive | 2015-06-28 06:19:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2b5421e60c | * Add profile directive | 2015-06-28 06:07:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8b5de4a411 | * Add word / tag / label sets, for use in neural net | 2015-06-28 05:46:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ed40a8380e | * Remove hard-coding of vector lengths | 2015-06-27 04:18:47 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ebe630cc8d | * Enable more features for NN | 2015-06-27 04:17:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f8bb43475e | * Bridge to Theano working. Very disorganised. Using thinc adb60aba966ed2 | 2015-06-27 02:39:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2fe98b8a9a | * Prepare for new models to be plugged in by using Example class | 2015-06-26 13:51:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6896455884 | * Rejig parser interface to use new thinc.api.Example class, in prep of theano model. Comment out beam search | 2015-06-26 06:25:36 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 02b171ee67 | * Bug fixes to edge calculation | 2015-06-24 04:28:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7f9384f53c | * Remove deprecated _state module | 2015-06-23 17:28:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6dbe182491 | * Fix merge conflicts | 2015-06-23 17:28:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 579735a095 | * Remove import of _state module | 2015-06-23 17:25:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 88f55d136b | * Remove deprecated _state module | 2015-06-23 17:19:51 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9ab9dd2bf7 | * Clean up unused orig_arc_eager and tree_arc_eager modules, which were only added for EMNLP experiments | 2015-06-23 17:17:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7ebfe4b983 | * Fixes to edge features | 2015-06-23 16:32:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7b125f5a86 | * Fixes to edge features | 2015-06-23 16:31:01 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 35c290bee4 | * Fix edge features | 2015-06-23 15:50:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 221e2e485f | * Assign 'ROOT' as label, not 'root' | 2015-06-23 15:09:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a7bf7b0626 | * Rename sent_start to sent_end, to reflect its new usage in the Break transition | 2015-06-23 05:39:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ee3e56f27b | * Fix bounds checking on entities | 2015-06-23 04:35:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 43ef5ddea5 | * Ensure root albel is spelled ROOT, for backwards compatibility | 2015-06-23 04:14:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 065c2e1d2d | * Add some bounds checking around state arrays | 2015-06-23 04:13:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f01b3d043e | * Add padding to arrays in stateclass. May be papering over a deeper bug. | 2015-06-23 03:03:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 69507bc729 | * Re-enable Break transition in arc_eager.pyx | 2015-06-23 00:03:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ab110be125 | * Remove debugging in parser.pyx | 2015-06-16 23:37:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9b13d11ab3 | * Fix handling of entities in StateClass | 2015-06-16 23:35:21 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c40a2c661c | * Add tree_arc_eager | 2015-06-15 08:23:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5da5cf7084 | * Add some more features for S1/S0 | 2015-06-15 04:07:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8156a01bca | * Fix root label for orig_arc_eager | 2015-06-15 02:54:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 21930ede15 | * Switch toggle on USE_ROOT_ARC_SEGMENT | 2015-06-15 02:54:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 38a6afa484 | * Make possibly dubious correction to the unshift oracle | 2015-06-15 02:50:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f66228f253 | * Add some more features, esp for labels | 2015-06-14 21:18:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3da8e0f317 | * Add orig_arc_eager | 2015-06-14 20:31:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ea8a103007 | * Fix import of TransitionSystem in parser.pyx | 2015-06-14 19:01:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e0984ca139 | * Fix valency features in StateClass | 2015-06-14 17:50:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 763cbd23d5 | * Upd stateclass.print_state | 2015-06-14 17:44:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bdd07bf000 | * Fix Break oracle, but disable the Break transition for now, while we finalize the gold-standard experiments | 2015-06-14 17:44:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 399f15fbdf | * Add flag to toggle handling of multi-root inputs without the Break transition. Clear up now unused best_valid stuff. | 2015-06-14 00:28:37 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 75289b4761 | * Don't refuse to parse single token sentences, incase some transition system needs them, e.g. single word entity. Instead fix error in _init_state. | 2015-06-13 22:55:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 77d7e79c7e | * Fix r/l and distance features. | 2015-06-12 13:06:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 15e177d7a1 | * Fixes to unshift/fast-forward strategy. Getting 91.55 greedy on NW dev, gold preproc | 2015-06-12 01:50:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | afd77a529b | * Prepare for break transition, with fast-forwarding. 86.5 on 1k nw gold preproc | 2015-06-10 14:08:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 495f528709 | * Add support for sentence breaks in stateclass | 2015-06-10 12:34:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b7b18c279d | * Fix Reduce oracle. Getting 86.35 | 2015-06-10 11:33:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb09b5d91a | * Fix shifted bit vector in stateclass --- should reflect whether the word has been *unshifted*. | 2015-06-10 11:33:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aa9625f688 | * Do non-monotonic Unshift. Every word can be shifted at most 1 time. When the Reduce move is used, if S0 has no head, we put the word back on the buffer. Gets 86.4 on nw 1k with gold pre-proc. Break transition not yet implemented for this. | 2015-06-10 10:15:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7bf6b7de3e | * Add unshift action to StateClass, and track which moves have been shifted | 2015-06-10 10:13:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f7c8069e65 | * Fix bug in distance feature | 2015-06-10 10:12:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | abd07c067a | * Inline B and S methods on stateclass | 2015-06-10 07:22:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e2f9a80713 | * Remove old _state imports | 2015-06-10 07:09:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e9aaecc619 | * Remove from_struct method from StateClass | 2015-06-10 06:58:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 18cc326dc0 | * Bug fixes to ner.pyx | 2015-06-10 06:57:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e5570c9700 | * Set nogil for oracle functions | 2015-06-10 06:56:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4575e7a60f | * Fix beam search with new StateClass | 2015-06-10 06:33:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04b1cd9b8c | * Greedy parsing working with new StateClass. Beam parsing broken | 2015-06-10 04:20:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6a94b64eca | * Remove State* from parser.pyx entirely, switching over to StateClass. Beam parsing still untested. | 2015-06-10 02:03:38 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f14a1526aa | * Remove version of fill_context that takes State* | 2015-06-10 01:39:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d68c686ec1 | * Move StateClass into interface of transition functions | 2015-06-10 01:35:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4b98b3e9c8 | * Cost functions now take StateClass argument, instead of State*. | 2015-06-10 00:40:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e0cf61f591 | * Move StateClass into the interface for is_valid | 2015-06-09 23:23:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0895d454fb | * Prepare to switch to using state class, instead of state struct | 2015-06-09 21:20:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2b9629ed62 | * Begin adding stateclass to ArcEager | 2015-06-09 01:41:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ba10fd8af5 | * Add StateClass, to replace/refactor the mess in _state | 2015-06-09 01:39:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c7e3dfc1dc | * Don't automatically push words when stack is empty, as it messes up beam parsing. Add hash method to beam state. | 2015-06-08 14:49:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6e2564239d | * Bug fixes to beam parser. Search still broken on non-gold sentences | 2015-06-07 19:12:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 731e5f1e46 | * Add get() function in spacy/syntax/Config | 2015-06-07 19:09:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8f142c1838 | * Refactor transition system oracles, to split out move and label cost. Preparing to add Unshift move. Will exclude non-monotonic. | 2015-06-07 03:21:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1fee7ade61 | * Tweak to ner | 2015-06-05 23:48:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 33e70b167f | * Remove dead code from ner.pyx | 2015-06-05 17:12:47 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 88ac5c6e98 | * Send beam_width < 0 to greedy parser | 2015-06-05 17:12:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0114e7600d | * Fix NER oracle | 2015-06-05 17:11:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6bf35cecc3 | * Refactor transition system to use classes with staticmethods. | 2015-06-05 02:27:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 36a34d544b | * Refactoring arc_eager, grouping oracle functions into transitions | 2015-06-04 22:43:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4433396005 | * Impove efficiency of dynamic oracle, making beam training faster | 2015-06-04 21:15:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 079dad28a7 | * Update for faster beam training | 2015-06-04 19:32:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a2627b6102 | * Fix bug in refactored init_transition | 2015-06-03 06:01:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | dd0867645d | * Remove stray const from State header | 2015-06-03 00:10:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6c47b10a6e | * Make optimization to children_in_buffer: stop searching when we would cross a bracket. | 2015-06-02 21:05:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a513ec500f | * Have oracle functions take a struct instead of a Python object | 2015-06-02 20:01:06 +02:00 |  |