Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							097ab9c6e4 
							
						 
					 
					
						
						
							
							Fix transition system to/from disk  
						
						 
						
						
						
					 
					
						2017-05-31 13:44:00 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							ff26aa6c37 
							
						 
					 
					
						
						
							
							Work on to/from bytes/disk serialization methods  
						
						 
						
						
						
					 
					
						2017-05-29 11:45:45 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							7996d21717 
							
						 
					 
					
						
						
							
							Fixes for new StringStore  
						
						 
						
						
						
					 
					
						2017-05-28 11:09:27 -05:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							84e66ca6d4 
							
						 
					 
					
						
						
							
							WIP on stringstore change. 27 failures  
						
						 
						
						
						
					 
					
						2017-05-28 14:06:40 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							7ebd26b8aa 
							
						 
					 
					
						
						
							
							Use ordered dict to specify transitions  
						
						 
						
						
						
					 
					
						2017-05-27 15:52:20 -05:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3d5a536eaa 
							
						 
					 
					
						
						
							
							Improve efficiency of parser batching  
						
						 
						
						
						
					 
					
						2017-05-26 11:31:23 -05:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e2136232f9 
							
						 
					 
					
						
						
							
							Exclude states with no matching gold annotations from parsing  
						
						 
						
						
						
					 
					
						2017-05-22 10:30:12 -05:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							8b04b0af9f 
							
						 
					 
					
						
						
							
							Remove freqs from transition_system  
						
						 
						
						
						
					 
					
						2017-05-20 02:20:48 -05:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a9edb3aa1d 
							
						 
					 
					
						
						
							
							Improve integration of NN parser, to support unified training API  
						
						 
						
						
						
					 
					
						2017-05-15 21:53:27 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							45464d065e 
							
						 
					 
					
						
						
							
							Remove print statement  
						
						 
						
						
						
					 
					
						2017-04-15 16:11:43 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								ines 
							
						 
					 
					
						
						
						
						
							
						
						
							0739ae7b76 
							
						 
					 
					
						
						
							
							Tidy up and fix formatting and imports  
						
						 
						
						
						
					 
					
						2017-04-15 13:05:15 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							354458484c 
							
						 
					 
					
						
						
							
							WIP on add_label bug during NER training  
						
						 
						
						... 
						
						
						
						Currently when a new label is introduced to NER during training,
it causes the labels to be read in in an unexpected order. This
invalidates the model. 
						
					 
					
						2017-04-14 23:52:17 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c90dc7ac29 
							
						 
					 
					
						
						
							
							Clean up state initiatisation in transition system  
						
						 
						
						
						
					 
					
						2017-03-16 11:59:11 -05:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							931feb3360 
							
						 
					 
					
						
						
							
							Allow beam parsing for NER  
						
						 
						
						
						
					 
					
						2017-03-11 11:12:01 -06:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							708ea22208 
							
						 
					 
					
						
						
							
							Infer types in transition_system.pyx  
						
						 
						
						
						
					 
					
						2016-10-27 18:08:13 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							508fd1f6dc 
							
						 
					 
					
						
						
							
							* Refactor noun chunk iterators, so that they're simple functions. Install the iterator when the Doc is created, but allow users to write to the noun_chunk_iterator attribute. The iterator functions accept an object and yield (int start, int end, int label) triples.  
						
						 
						
						
						
					 
					
						2016-05-02 14:25:10 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							bcf8f7ba40 
							
						 
					 
					
						
						
							
							* Add a parse_batch method to Parser, that releases the GIL around a batch of documents.  
						
						 
						
						
						
					 
					
						2016-02-01 08:34:55 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							28e5ad62bc 
							
						 
					 
					
						
						
							
							* Pass a StateC pointer into the transition and validation methods in the parser, so that the GIL can be released over a batch of documents  
						
						 
						
						
						
					 
					
						2016-02-01 03:00:15 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a47f00901b 
							
						 
					 
					
						
						
							
							* Pass a StateC pointer into the transition and validation methods in the parser, so that the GIL can be released over a batch of documents  
						
						 
						
						
						
					 
					
						2016-02-01 02:58:14 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							10877a7791 
							
						 
					 
					
						
						
							
							* Update for thinc 5.0, including changing cost from int to weight_t, and updating the tagger and parser  
						
						 
						
						
						
					 
					
						2016-01-30 14:31:36 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							04d0686b26 
							
						 
					 
					
						
						
							
							* Make TransitionSystem.add_action idempotent, i.e. ignore duplicate added actions.  
						
						 
						
						
						
					 
					
						2016-01-19 20:10:04 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							151aa0b0e2 
							
						 
					 
					
						
						
							
							* Allow users to add_label, in order to extend the entity recogniser to new classes. Does not by itself add a class to the model  
						
						 
						
						
						
					 
					
						2016-01-19 19:09:33 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							20fd36a0f7 
							
						 
					 
					
						
						
							
							* Very scrappy, likely buggy first-cut pickle implementation, to work on Issue  #125 : allow pickle for Apache Spark. The current implementation sends stuff to temp files, and does almost nothing to ensure all modifiable state is actually preserved. The Language() instance is a deep tree of extension objects, and if pickling during training, some of the C-data state is hard to preserve.  
						
						 
						
						
						
					 
					
						2015-10-13 13:44:41 +11:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							cc9deae960 
							
						 
					 
					
						
						
							
							* Add is_valid method to transition_system  
						
						 
						
						
						
					 
					
						2015-08-08 23:36:18 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a8bbd7312c 
							
						 
					 
					
						
						
							
							* Hackishly patch long dependencies problem  
						
						 
						
						
						
					 
					
						2015-07-28 00:14:29 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							bb583f7f09 
							
						 
					 
					
						
						
							
							* Hackishly patch long dependencies problem  
						
						 
						
						
						
					 
					
						2015-07-27 23:14:33 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							12699a1152 
							
						 
					 
					
						
						
							
							* Set initial freqs, to avoid missing values in serializer  
						
						 
						
						
						
					 
					
						2015-07-23 01:16:27 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							317cbbc015 
							
						 
					 
					
						
						
							
							* Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time.  
						
						 
						
						
						
					 
					
						2015-07-19 15:18:17 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e29daea85f 
							
						 
					 
					
						
						
							
							* Fix bint/int typing problem in TransitionSystem. In C++ bint* means bool*, but in C it means int*. So, type-casting to bint* is unsafe.  
						
						 
						
						
						
					 
					
						2015-07-17 22:37:24 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							9a8db9743c 
							
						 
					 
					
						
						
							
							* Remove gil from parser.call  
						
						 
						
						
						
					 
					
						2015-07-14 23:47:33 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							75aeccc064 
							
						 
					 
					
						
						
							
							* Rejig parser interface to use new thinc.api.Example class, in prep of theano model. Comment out beam search  
						
						 
						
						
						
					 
					
						2015-06-28 11:02:34 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							04b1cd9b8c 
							
						 
					 
					
						
						
							
							* Greedy parsing working with new StateClass. Beam parsing broken  
						
						 
						
						
						
					 
					
						2015-06-10 04:20:23 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							d68c686ec1 
							
						 
					 
					
						
						
							
							* Move StateClass into interface of transition functions  
						
						 
						
						
						
					 
					
						2015-06-10 01:35:28 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							4b98b3e9c8 
							
						 
					 
					
						
						
							
							* Cost functions now take StateClass argument, instead of State*.  
						
						 
						
						
						
					 
					
						2015-06-10 00:40:43 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e0cf61f591 
							
						 
					 
					
						
						
							
							* Move StateClass into the interface for is_valid  
						
						 
						
						
						
					 
					
						2015-06-09 23:23:28 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							0895d454fb 
							
						 
					 
					
						
						
							
							* Prepare to switch to using state class, instead of state struct  
						
						 
						
						
						
					 
					
						2015-06-09 21:20:14 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							6bf35cecc3 
							
						 
					 
					
						
						
							
							* Refactor transition system to use classes with staticmethods.  
						
						 
						
						
						
					 
					
						2015-06-05 02:27:17 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							079dad28a7 
							
						 
					 
					
						
						
							
							* Update for faster beam training  
						
						 
						
						
						
					 
					
						2015-06-04 19:32:32 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a513ec500f 
							
						 
					 
					
						
						
							
							* Have oracle functions take a struct instead of a Python object  
						
						 
						
						
						
					 
					
						2015-06-02 20:01:06 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							0786d9b3c7 
							
						 
					 
					
						
						
							
							* Refactor TransitionSystem, adding set_valid method  
						
						 
						
						
						
					 
					
						2015-06-02 18:38:07 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c7876aa8b6 
							
						 
					 
					
						
						
							
							* Add get_valid method  
						
						 
						
						
						
					 
					
						2015-06-01 23:06:00 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b3fd48c97b 
							
						 
					 
					
						
						
							
							* Fix missing root labels bug identified in Issue  #57  
						
						 
						
						
						
					 
					
						2015-04-28 20:45:51 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3feb52374c 
							
						 
					 
					
						
						
							
							* Break apart a condition, for ease of debug printing  
						
						 
						
						
						
					 
					
						2015-03-27 15:21:38 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f729164c01 
							
						 
					 
					
						
						
							
							* Fix bug in label assignment: ensure null-label transitions receive the label 0  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:46 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							31fad99518 
							
						 
					 
					
						
						
							
							* Use StringStore to encode label names, instead of label_ids  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:45 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							8057a95f20 
							
						 
					 
					
						
						
							
							* NER seems to be working, scoring 69 F. Need to add decision-history features --- currently only use current word, 2 words context. Need refactoring.  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:44 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							ae235e07b9 
							
						 
					 
					
						
						
							
							* Refactoring working for parser, but now need to rig up features for NER, and then debug oracle etc.  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:44 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b3eda03c9c 
							
						 
					 
					
						
						
							
							* Tmp  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:44 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							6b6bce9e7a 
							
						 
					 
					
						
						
							
							* Fix label loading for transition system  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:43 +01:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							10ed738df2 
							
						 
					 
					
						
						
							
							* Tmp commit  
						
						 
						
						
						
					 
					
						2015-03-26 16:44:43 +01:00