Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							20fd36a0f7
							
						
					 | 
					
						
						
							
							* Very scrappy, likely buggy first-cut pickle implementation, to work on Issue #125: allow pickle for Apache Spark. The current implementation sends stuff to temp files, and does almost nothing to ensure all modifiable state is actually preserved. The Language() instance is a deep tree of extension objects, and if pickling during training, some of the C-data state is hard to preserve.
						
						
						
						
						
					 | 
					
						2015-10-13 13:44:41 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							534e3dda3c
							
						
					 | 
					
						
						
							
							* More work on language independent parsing
						
						
						
						
						
					 | 
					
						2015-08-28 03:44:54 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c2307fa9ee
							
						
					 | 
					
						
						
							
							* More work on language-generic parsing
						
						
						
						
						
					 | 
					
						2015-08-28 02:02:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5b89e2454c
							
						
					 | 
					
						
						
							
							* Improve error-reporting in tagger
						
						
						
						
						
					 | 
					
						2015-08-27 10:26:36 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0af139e183
							
						
					 | 
					
						
						
							
							* Tagger training now working. Still need to test load/save of model. Morphology still broken.
						
						
						
						
						
					 | 
					
						2015-08-27 09:16:11 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b4faf551f5
							
						
					 | 
					
						
						
							
							* Refactor language-independent tagger class
						
						
						
						
						
					 | 
					
						2015-08-26 19:19:21 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5dd76be446
							
						
					 | 
					
						
						
							
							* Split EnPosTagger up into base class and subclass
						
						
						
						
						
					 | 
					
						2015-08-24 05:25:55 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							aac5028b6e
							
						
					 | 
					
						
						
							
							* Move tagger to _ml
						
						
						
						
						
					 | 
					
						2014-12-30 21:21:38 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							73f200436f
							
						
					 | 
					
						
						
							
							* Tests passing except for morphology/lemmatization stuff
						
						
						
						
						
					 | 
					
						2014-12-23 11:40:32 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							cf8d26c3d2
							
						
					 | 
					
						
						
							
							* POS tagger training working after reorg
						
						
						
						
						
					 | 
					
						2014-12-22 08:54:47 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1879abd16a
							
						
					 | 
					
						
						
							
							* Set const-correctness for tagger
						
						
						
						
						
					 | 
					
						2014-12-18 20:41:52 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a432862fde
							
						
					 | 
					
						
						
							
							* Add exception type to _arg_max_among in tagger
						
						
						
						
						
					 | 
					
						2014-12-16 09:44:19 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							42973c4b37
							
						
					 | 
					
						
						
							
							* Improve efficiency of tagger, and improve morphological processing
						
						
						
						
						
					 | 
					
						2014-12-10 01:02:04 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6b34a2f34b
							
						
					 | 
					
						
						
							
							* Move morphological analysis into its own module, morphology.pyx
						
						
						
						
						
					 | 
					
						2014-12-09 21:16:17 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							99bbbb6feb
							
						
					 | 
					
						
						
							
							* Work on morphological processing
						
						
						
						
						
					 | 
					
						2014-12-08 21:12:15 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c20dd79748
							
						
					 | 
					
						
						
							
							* Fiddle with const correctness and comments
						
						
						
						
						
					 | 
					
						2014-12-08 00:03:55 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ef4398b204
							
						
					 | 
					
						
						
							
							* Rearrange POS stuff, so that language-specific stuff can live in language-specific modules
						
						
						
						
						
					 | 
					
						2014-12-07 23:52:41 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							327383e38a
							
						
					 | 
					
						
						
							
							* Remove unused code in tagger.pyx
						
						
						
						
						
					 | 
					
						2014-12-07 22:16:17 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3819a88e1b
							
						
					 | 
					
						
						
							
							* Add support for tag dictionary, and fix error-code for predict method
						
						
						
						
						
					 | 
					
						2014-12-07 22:07:16 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0c7aeb9de7
							
						
					 | 
					
						
						
							
							* Begin revising tagger, focussing on POS tagging
						
						
						
						
						
					 | 
					
						2014-12-07 15:29:04 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f307eb2e36
							
						
					 | 
					
						
						
							
							* Refactor context extraction, and start breaking out gold standards into their own functions
						
						
						
						
						
					 | 
					
						2014-11-09 15:43:07 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							602f993af9
							
						
					 | 
					
						
						
							
							* Moving tagger to accept multiple correct answers
						
						
						
						
						
					 | 
					
						2014-11-09 15:18:33 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							949a6245f9
							
						
					 | 
					
						
						
							
							* Increase default number of iterations from 5 to 10
						
						
						
						
						
					 | 
					
						2014-11-07 04:42:04 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4ecbe8c893
							
						
					 | 
					
						
						
							
							* Complete refactor of Tagger features, to use a generic list of context names.
						
						
						
						
						
					 | 
					
						2014-11-05 20:45:29 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3733444101
							
						
					 | 
					
						
						
							
							* Generalize tagger code, in preparation for NER and supersense tagging.
						
						
						
						
						
					 | 
					
						2014-11-05 03:42:14 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							abbe3e44b0
							
						
					 | 
					
						
						
							
							* Move spacy.pos tagger to spacy.tagger, and generalize it so that it can take on other tagging tasks, given a different set of feature templates.
						
						
						
						
						
					 | 
					
						2014-11-05 00:37:59 +11:00 | 
					
					
						
						
							
							
							
						
					 |