Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ab61673edd
							
						
					 | 
					
						
						
							
							* Fix api of array method
						
						
						
						
						
					 | 
					
						2014-12-23 15:18:48 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ed0ff63c09
							
						
					 | 
					
						
						
							
							* Compile attrs and parser in setup
						
						
						
						
						
					 | 
					
						2014-12-23 15:18:20 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							9dda8b4500
							
						
					 | 
					
						
						
							
							* Play with examples in index.rst
						
						
						
						
						
					 | 
					
						2014-12-23 15:17:56 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							7708d0e24a
							
						
					 | 
					
						
						
							
							* Move lemmatizer to en dir
						
						
						
						
						
					 | 
					
						2014-12-23 15:16:57 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							98eb4c0426
							
						
					 | 
					
						
						
							
							* Fix path to parser model
						
						
						
						
						
					 | 
					
						2014-12-23 15:09:09 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b00bc01d8c
							
						
					 | 
					
						
						
							
							* All tests now passing for reorg
						
						
						
						
						
					 | 
					
						2014-12-23 13:18:59 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							73f200436f
							
						
					 | 
					
						
						
							
							* Tests passing except for morphology/lemmatization stuff
						
						
						
						
						
					 | 
					
						2014-12-23 11:40:32 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							cf8d26c3d2
							
						
					 | 
					
						
						
							
							* POS tagger training working after reorg
						
						
						
						
						
					 | 
					
						2014-12-22 08:54:47 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4c4aa2c5c9
							
						
					 | 
					
						
						
							
							* Work on train
						
						
						
						
						
					 | 
					
						2014-12-22 07:25:43 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4d4d2c0db4
							
						
					 | 
					
						
						
							
							* Upd test
						
						
						
						
						
					 | 
					
						2014-12-21 21:05:28 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d047dc0d0f
							
						
					 | 
					
						
						
							
							Upd lemmatizer test
						
						
						
						
						
					 | 
					
						2014-12-21 21:02:44 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b864f0e539
							
						
					 | 
					
						
						
							
							* Upd iteration test
						
						
						
						
						
					 | 
					
						2014-12-21 21:01:46 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							61df50b598
							
						
					 | 
					
						
						
							
							* Add English-subclass POS tagger
						
						
						
						
						
					 | 
					
						2014-12-21 20:59:07 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c1ab134159
							
						
					 | 
					
						
						
							
							* Upd lemmas test
						
						
						
						
						
					 | 
					
						2014-12-21 20:58:21 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							82bd57c76f
							
						
					 | 
					
						
						
							
							* Upd intern test
						
						
						
						
						
					 | 
					
						2014-12-21 20:44:21 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							734d1da55c
							
						
					 | 
					
						
						
							
							* Upd emoticons test
						
						
						
						
						
					 | 
					
						2014-12-21 20:43:27 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							199025609f
							
						
					 | 
					
						
						
							
							* Upd contractions test
						
						
						
						
						
					 | 
					
						2014-12-21 20:41:13 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0d9972f4b0
							
						
					 | 
					
						
						
							
							* Upd tokenizer test
						
						
						
						
						
					 | 
					
						2014-12-21 20:38:27 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							69e3a07fa1
							
						
					 | 
					
						
						
							
							* More index.rst fiddling
						
						
						
						
						
					 | 
					
						2014-12-21 17:40:12 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							9f3f07cab6
							
						
					 | 
					
						
						
							
							* Add attrs file for English
						
						
						
						
						
					 | 
					
						2014-12-21 11:29:11 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							2a89d70429
							
						
					 | 
					
						
						
							
							* Add vocab.pyx to setup, and ensure we can import spacy.en.lang
						
						
						
						
						
					 | 
					
						2014-12-21 06:03:53 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b34a1325d3
							
						
					 | 
					
						
						
							
							* Everything compiling after reorg. About to start testing.
						
						
						
						
						
					 | 
					
						2014-12-21 05:42:23 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e1c1a4b868
							
						
					 | 
					
						
						
							
							* Tmp
						
						
						
						
						
					 | 
					
						2014-12-21 05:36:29 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d11c1edf8c
							
						
					 | 
					
						
						
							
							* Import slice_unicode from strings.pyx
						
						
						
						
						
					 | 
					
						2014-12-20 07:56:26 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							be1bdcbd85
							
						
					 | 
					
						
						
							
							* Move lang.pyx to tokenizer.pyx
						
						
						
						
						
					 | 
					
						2014-12-20 07:55:40 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							89a1cc1a48
							
						
					 | 
					
						
						
							
							* Move murmurhash to .pxd in strings file
						
						
						
						
						
					 | 
					
						2014-12-20 07:41:08 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d5a942c4a4
							
						
					 | 
					
						
						
							
							* Rename lang.pyx to tokenizer.pyx
						
						
						
						
						
					 | 
					
						2014-12-20 07:30:39 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a60ae261ae
							
						
					 | 
					
						
						
							
							* Move tokenizer to its own file, and refactor
						
						
						
						
						
					 | 
					
						2014-12-20 07:29:16 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							867a4a000c
							
						
					 | 
					
						
						
							
							* Export set_morph_from_dict function
						
						
						
						
						
					 | 
					
						2014-12-20 07:28:27 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4e30195c6d
							
						
					 | 
					
						
						
							
							* Refactor morphology.pyx
						
						
						
						
						
					 | 
					
						2014-12-20 07:27:28 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4c6ce7ee84
							
						
					 | 
					
						
						
							
							* Update tokens.pyx as part of reorg
						
						
						
						
						
					 | 
					
						2014-12-20 07:03:26 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							116f7f3bc1
							
						
					 | 
					
						
						
							
							* Rename Lexicon to Vocab, and move it to its own file
						
						
						
						
						
					 | 
					
						2014-12-20 06:54:03 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							780cbd68b1
							
						
					 | 
					
						
						
							
							* Move all struct definitions to structs.pxd, to avoid circular dependencies
						
						
						
						
						
					 | 
					
						2014-12-20 06:51:33 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f6556d8e5d
							
						
					 | 
					
						
						
							
							* Refactor, move Lexeme struct to structs.pxd
						
						
						
						
						
					 | 
					
						2014-12-20 06:51:03 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							7d48bba6c4
							
						
					 | 
					
						
						
							
							* Move StringStore class to its own file
						
						
						
						
						
					 | 
					
						2014-12-20 06:42:01 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e15b9da7db
							
						
					 | 
					
						
						
							
							* Pin preshed to a particular version
						
						
						
						
						
					 | 
					
						2014-12-20 04:01:32 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ed2fff6128
							
						
					 | 
					
						
						
							
							* Add tests
						
						
						
						
						
					 | 
					
						2014-12-20 03:51:25 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b066102d2d
							
						
					 | 
					
						
						
							
							* Remove POS cache for now
						
						
						
						
						
					 | 
					
						2014-12-20 03:49:58 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ff252dd535
							
						
					 | 
					
						
						
							
							* Clean up 'guess_cache' idea, which didnt work well enough
						
						
						
						
						
					 | 
					
						2014-12-20 03:49:11 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							9d3ca13909
							
						
					 | 
					
						
						
							
							* Start work on parse-tree iteration classes
						
						
						
						
						
					 | 
					
						2014-12-20 03:48:10 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							bed680c632
							
						
					 | 
					
						
						
							
							* Remove commented-out features
						
						
						
						
						
					 | 
					
						2014-12-20 03:47:32 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3d178c03ae
							
						
					 | 
					
						
						
							
							* Prune the features a bit
						
						
						
						
						
					 | 
					
						2014-12-20 02:46:14 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a0408e1758
							
						
					 | 
					
						
						
							
							* Working DecisionMemory class
						
						
						
						
						
					 | 
					
						2014-12-20 01:43:26 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							7920ea72b4
							
						
					 | 
					
						
						
							
							* Working parser with the decision memory idea. Disabling that for now, for simplicity
						
						
						
						
						
					 | 
					
						2014-12-20 01:43:15 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a2f2a48da9
							
						
					 | 
					
						
						
							
							* Add some extra features
						
						
						
						
						
					 | 
					
						2014-12-20 01:42:24 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8fd9762d91
							
						
					 | 
					
						
						
							
							* Start laying out parse tree iteration methods
						
						
						
						
						
					 | 
					
						2014-12-20 01:42:09 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							53b8bc1f3c
							
						
					 | 
					
						
						
							
							* Work on implementing a trainable cache for the parser. So far, doesn't improve efficiency
						
						
						
						
						
					 | 
					
						2014-12-19 09:30:50 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							033d6c9ac2
							
						
					 | 
					
						
						
							
							* Adapt POS tagger decision-memory for use in parser
						
						
						
						
						
					 | 
					
						2014-12-19 07:23:04 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							809ddf7887
							
						
					 | 
					
						
						
							
							* Add index.pxd
						
						
						
						
						
					 | 
					
						2014-12-19 07:23:00 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1879abd16a
							
						
					 | 
					
						
						
							
							* Set const-correctness for tagger
						
						
						
						
						
					 | 
					
						2014-12-18 20:41:52 +11:00 | 
					
					
						
						
							
							
							
						
					 |