| 
							
							
								 Matthew Honnibal | 99c9ecfc18 | * Fix bug in prefix, suffix and word shape features in parser and NER | 2015-04-10 03:53:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5a075ea3fc | * Ensure NER moves are available for single-word tokens | 2015-04-05 22:30:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a60a366b2c | * Support 'punct' dep label in conll.pyx | 2015-04-05 22:30:19 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a3af6b7c3d | * Left-Arc from Root, to allow non-monotonic reduce to compete with left-arc when the stack is not empty. | 2015-03-27 17:39:16 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | db5a43318c | * Improve print_state debug printer | 2015-03-27 17:29:58 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1705eccbbe | * Remove whitespace | 2015-03-27 15:22:39 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3feb52374c | * Break apart a condition, for ease of debug printing | 2015-03-27 15:21:38 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b32f581acb | * Fix bug in ArcEager.get_labels | 2015-03-27 15:21:06 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1320bd19db | * Move Span class to own file | 2015-03-26 16:45:38 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e854ba0a13 | * Remove support for force_gold flag from GreedyParser, since it's not so useful, and it's clutter | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6a6085f8b9 | * Clean up GreedyParser.train function a bit | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b3157927e6 | * Clean up unused feature templates | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 411bf377d4 | * Remove dependency on ner_util module | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 01c892f583 | * Add comment to fill_context | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2741179aff | * Important bug fix: Fill token N2w, which was being unfilled, after a bad edit while writing the NER features. | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 71648205d9 | * Add support for debug feature set. Just use unigrams for this. | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3b70b304b2 | * Add words to gold_tuples from gold conll file | 2015-03-26 16:44:47 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 05d6065e2e | * Add assertion | 2015-03-26 16:44:46 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 377e9b29b1 | * Whitespace | 2015-03-26 16:44:46 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9f4ad8fdfb | * Assign root words the ROOT label via the Break transition. Something is still wrong here... | 2015-03-26 16:44:46 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f729164c01 | * Fix bug in label assignment: ensure null-label transitions receive the label 0 | 2015-03-26 16:44:46 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 31fad99518 | * Use StringStore to encode label names, instead of label_ids | 2015-03-26 16:44:45 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b9b695fb1b | * Remove debug word list | 2015-03-26 16:44:45 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1c843934be | * Fix oracle bug in NER. Now getting 77% F on ontonotes | 2015-03-26 16:44:44 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e181c051d5 | * Improve features for NER | 2015-03-26 16:44:44 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8057a95f20 | * NER seems to be working, scoring 69 F. Need to add decision-history features --- currently only use current word, 2 words context. Need refactoring. | 2015-03-26 16:44:44 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ae235e07b9 | * Refactoring working for parser, but now need to rig up features for NER, and then debug oracle etc. | 2015-03-26 16:44:44 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b3eda03c9c | * Tmp | 2015-03-26 16:44:44 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6b6bce9e7a | * Fix label loading for transition system | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5278c7504b | * Hacks to conll.pyx. Should clean these up. | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f321b2b2eb | * Remove TODO comment | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fdabd93bfb | * Ensure high loss for invalid moves, and fix label reading for arc-eager | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 10ed738df2 | * Tmp commit | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4f83c9b3d5 | * Make costs label-sensitive | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8c883cef58 | * Refactored transition system code now compiling. Still need to hook up label oracle, and test | 2015-03-26 16:44:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f0159ab4b6 | * Add file to hold GoldParse class | 2015-03-26 16:44:42 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8eadb984cb | * Refactor arc_eager to use new TransitionSystem base class. Need to fix oracle | 2015-03-26 16:44:42 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b063001596 | * Add base TransitionSystem class. Still need to rethink how non-monotonic labelling will work for best_valid | 2015-03-26 16:44:42 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | dc986dbc0b | * Work on refactored parser, where TransitionSystem can be easily subclassed | 2015-03-26 16:44:42 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 135756ac3d | * Tmp commit of NER refactoring | 2015-03-26 16:44:42 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0ff078876a | * Commit some work on ner.yx done on the plane | 2015-03-26 16:44:41 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d81b7be6a2 | * Merge train.py | 2015-03-26 16:44:41 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3d0570685c | * Add NER transition system | 2015-03-26 16:44:41 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ea90d136e8 | * Fix bug in labelled parsing, that caused an 8% drop in labelled accuracy. | 2015-02-27 03:56:10 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 312b3a45f3 | * Fix issue #19: Allow parsing/pos tagging of empty strings | 2015-02-10 10:15:58 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5c3513583d | * Clear buffered python tokens when modifying the Tokens object. Need to clean this up, and modify via a method on Tokens. | 2015-02-09 03:57:10 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c55a33d045 | * Catch oracle errors | 2015-02-02 23:02:04 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d68678a93e | * Add Exception class, OracleError | 2015-02-02 11:57:32 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 88170e6295 | * Supply dep_strings as a tuple, for the changed API on Tokens | 2015-01-31 13:42:09 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0981d68022 | * Set a sent_end flag during parsing, for later use | 2015-01-31 13:41:46 +11:00 |  |