Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a3af6b7c3d
							
						
					 | 
					
						
						
							
							* Left-Arc from Root, to allow non-monotonic reduce to compete with left-arc when the stack is not empty.
						
						
						
						
						
					 | 
					
						2015-03-27 17:39:16 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							db5a43318c
							
						
					 | 
					
						
						
							
							* Improve print_state debug printer
						
						
						
						
						
					 | 
					
						2015-03-27 17:29:58 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1705eccbbe
							
						
					 | 
					
						
						
							
							* Remove whitespace
						
						
						
						
						
					 | 
					
						2015-03-27 15:22:39 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3feb52374c
							
						
					 | 
					
						
						
							
							* Break apart a condition, for ease of debug printing
						
						
						
						
						
					 | 
					
						2015-03-27 15:21:38 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b32f581acb
							
						
					 | 
					
						
						
							
							* Fix bug in ArcEager.get_labels
						
						
						
						
						
					 | 
					
						2015-03-27 15:21:06 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1320bd19db
							
						
					 | 
					
						
						
							
							* Move Span class to own file
						
						
						
						
						
					 | 
					
						2015-03-26 16:45:38 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e854ba0a13
							
						
					 | 
					
						
						
							
							* Remove support for force_gold flag from GreedyParser, since it's not so useful, and it's clutter
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6a6085f8b9
							
						
					 | 
					
						
						
							
							* Clean up GreedyParser.train function a bit
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b3157927e6
							
						
					 | 
					
						
						
							
							* Clean up unused feature templates
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							411bf377d4
							
						
					 | 
					
						
						
							
							* Remove dependency on ner_util module
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							01c892f583
							
						
					 | 
					
						
						
							
							* Add comment to fill_context
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							2741179aff
							
						
					 | 
					
						
						
							
							* Important bug fix: Fill token N2w, which was being unfilled, after a bad edit while writing the NER features.
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							71648205d9
							
						
					 | 
					
						
						
							
							* Add support for debug feature set. Just use unigrams for this.
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3b70b304b2
							
						
					 | 
					
						
						
							
							* Add words to gold_tuples from gold conll file
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:47 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							05d6065e2e
							
						
					 | 
					
						
						
							
							* Add assertion
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:46 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							377e9b29b1
							
						
					 | 
					
						
						
							
							* Whitespace
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:46 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							9f4ad8fdfb
							
						
					 | 
					
						
						
							
							* Assign root words the ROOT label via the Break transition. Something is still wrong here...
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:46 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f729164c01
							
						
					 | 
					
						
						
							
							* Fix bug in label assignment: ensure null-label transitions receive the label 0
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:46 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							31fad99518
							
						
					 | 
					
						
						
							
							* Use StringStore to encode label names, instead of label_ids
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:45 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b9b695fb1b
							
						
					 | 
					
						
						
							
							* Remove debug word list
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:45 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1c843934be
							
						
					 | 
					
						
						
							
							* Fix oracle bug in NER. Now getting 77% F on ontonotes
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:44 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e181c051d5
							
						
					 | 
					
						
						
							
							* Improve features for NER
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:44 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8057a95f20
							
						
					 | 
					
						
						
							
							* NER seems to be working, scoring 69 F. Need to add decision-history features --- currently only use current word, 2 words context. Need refactoring.
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:44 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ae235e07b9
							
						
					 | 
					
						
						
							
							* Refactoring working for parser, but now need to rig up features for NER, and then debug oracle etc.
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:44 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b3eda03c9c
							
						
					 | 
					
						
						
							
							* Tmp
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:44 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6b6bce9e7a
							
						
					 | 
					
						
						
							
							* Fix label loading for transition system
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5278c7504b
							
						
					 | 
					
						
						
							
							* Hacks to conll.pyx. Should clean these up.
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f321b2b2eb
							
						
					 | 
					
						
						
							
							* Remove TODO comment
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							fdabd93bfb
							
						
					 | 
					
						
						
							
							* Ensure high loss for invalid moves, and fix label reading for arc-eager
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							10ed738df2
							
						
					 | 
					
						
						
							
							* Tmp commit
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4f83c9b3d5
							
						
					 | 
					
						
						
							
							* Make costs label-sensitive
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8c883cef58
							
						
					 | 
					
						
						
							
							* Refactored transition system code now compiling. Still need to hook up label oracle, and test
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:43 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f0159ab4b6
							
						
					 | 
					
						
						
							
							* Add file to hold GoldParse class
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:42 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8eadb984cb
							
						
					 | 
					
						
						
							
							* Refactor arc_eager to use new TransitionSystem base class. Need to fix oracle
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:42 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b063001596
							
						
					 | 
					
						
						
							
							* Add base TransitionSystem class. Still need to rethink how non-monotonic labelling will work for best_valid
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:42 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							dc986dbc0b
							
						
					 | 
					
						
						
							
							* Work on refactored parser, where TransitionSystem can be easily subclassed
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:42 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							135756ac3d
							
						
					 | 
					
						
						
							
							* Tmp commit of NER refactoring
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:42 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0ff078876a
							
						
					 | 
					
						
						
							
							* Commit some work on ner.yx done on the plane
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:41 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d81b7be6a2
							
						
					 | 
					
						
						
							
							* Merge train.py
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:41 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3d0570685c
							
						
					 | 
					
						
						
							
							* Add NER transition system
						
						
						
						
						
					 | 
					
						2015-03-26 16:44:41 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ea90d136e8
							
						
					 | 
					
						
						
							
							* Fix bug in labelled parsing, that caused an 8% drop in labelled accuracy.
						
						
						
						
						
					 | 
					
						2015-02-27 03:56:10 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							312b3a45f3
							
						
					 | 
					
						
						
							
							* Fix issue #19: Allow parsing/pos tagging of empty strings
						
						
						
						
						
					 | 
					
						2015-02-10 10:15:58 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5c3513583d
							
						
					 | 
					
						
						
							
							* Clear buffered python tokens when modifying the Tokens object. Need to clean this up, and modify via a method on Tokens.
						
						
						
						
						
					 | 
					
						2015-02-09 03:57:10 -05:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c55a33d045
							
						
					 | 
					
						
						
							
							* Catch oracle errors
						
						
						
						
						
					 | 
					
						2015-02-02 23:02:04 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d68678a93e
							
						
					 | 
					
						
						
							
							* Add Exception class, OracleError
						
						
						
						
						
					 | 
					
						2015-02-02 11:57:32 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							88170e6295
							
						
					 | 
					
						
						
							
							* Supply dep_strings as a tuple, for the changed API on Tokens
						
						
						
						
						
					 | 
					
						2015-01-31 13:42:09 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0981d68022
							
						
					 | 
					
						
						
							
							* Set a sent_end flag during parsing, for later use
						
						
						
						
						
					 | 
					
						2015-01-31 13:41:46 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0f95712189
							
						
					 | 
					
						
						
							
							* Improve accuracy reporting during training
						
						
						
						
						
					 | 
					
						2015-01-30 18:05:06 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							67d6e53a69
							
						
					 | 
					
						
						
							
							* Ensure parser and tagger function correctly when training from missing values, indicated by -1
						
						
						
						
						
					 | 
					
						2015-01-30 14:08:56 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ebf7d2fab1
							
						
					 | 
					
						
						
							
							* Use non-joint sbd, for more simplicity and fewer classes
						
						
						
						
						
					 | 
					
						2015-01-29 06:22:03 +11:00 | 
					
					
						
						
							
							
							
						
					 |