| 
							
							
								 Matthew Honnibal | 2a0615104b | * Upd download script | 2015-02-09 10:22:59 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5c3513583d | * Clear buffered python tokens when modifying the Tokens object. Need to clean this up, and modify via a method on Tokens. | 2015-02-09 03:57:10 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | be5536d239 | * Fix Issue #22: PRP and PRP$ were mapped to NOUN. Should be PRON. | 2015-02-08 18:36:18 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 44c7eafe44 | * Fix download.py | 2015-02-07 12:00:36 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6ca7f2eedc | * Upd download script | 2015-02-07 11:32:33 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 56c2ef2982 | * Tweak POS features for web text | 2015-02-02 11:59:36 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a20fdbd8ee | * Upd download script | 2015-02-01 13:22:23 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 63abdf154c | * Hastily hack download file | 2015-01-31 22:48:32 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a1ed574b7b | * Fix default model path for English | 2015-01-31 16:38:27 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e013555b25 | * Add option to download script | 2015-01-31 13:51:56 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 024cfd485c | * Pass tag_strings as a tuple, to support new Tokens API | 2015-01-31 13:43:37 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 83a4df5a1a | * Fix download script | 2015-01-30 20:40:42 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6f9ebc2f34 | * Fix download script | 2015-01-30 20:33:19 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8b85d0bb8a | * Only download small data if no data dir exists | 2015-01-30 20:27:14 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cb95ef6934 | * Fix download script | 2015-01-30 19:28:43 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e578bd37bd | * Fix download script | 2015-01-30 18:59:31 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | df52014d12 | * Fix download script | 2015-01-30 18:36:24 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 998b607f65 | * Upd download script, having it download all data if there's no data/ directory, allowing easier compilation from source | 2015-01-30 18:04:01 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 67d6e53a69 | * Ensure parser and tagger function correctly when training from missing values, indicated by -1 | 2015-01-30 14:08:56 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c38c62d4a3 | * Add docstring to English class | 2015-01-27 02:45:21 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7f87716cf7 | * Fix download script | 2015-01-25 23:01:10 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 12b034e3ef | * Move POS tag definitions to parts_of_speech.pxd | 2015-01-25 16:31:07 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7431c133d8 | * Add error if try to access head and not is_parsed | 2015-01-25 15:33:54 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 951d06c824 | * Silently don't parse if data is not present | 2015-01-25 14:47:38 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4e857ab7a6 | * Fix bug in POS tagger feature | 2015-01-25 02:20:15 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | dd56e298e2 | * Ensure tagging is applied if parse=True | 2015-01-25 02:19:44 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 94750819cd | * Set parse=True by default --- i.e. parse unless told not to. | 2015-01-25 01:28:28 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a97bed9359 | * Fix POS and dependency label tag names.  Add parse and string navigation functions. | 2015-01-24 17:29:04 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fda94271af | * Rename NORM1 and NORM2 attrs to lower and norm | 2015-01-24 06:17:03 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5ed8b2b98f | * Rename sic to orth | 2015-01-23 02:08:25 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f2a229136c | * Fix data_dir=None argument to English class | 2015-01-21 18:27:31 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ef49b8c179 | * Add stop-word flag | 2015-01-21 18:22:31 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6646bfc5df | * Add LOWER attr | 2015-01-21 18:19:08 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6c7e44140b | * Work on word vectors, and other stuff | 2015-01-17 16:21:17 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7d3c40de7d | * Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme | 2015-01-15 00:33:16 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0930892fc1 | * Tmp. Working on refactor. Compiles, must hook up lexical feats. | 2015-01-14 00:03:48 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 46da3d74d2 | * Tmp. Refactoring, introducing a Lexeme PyObject. | 2015-01-12 11:23:44 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ce2edd6312 | * Tmp commit. Refactoring to create a Python Lexeme class. | 2015-01-12 10:26:22 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7689dccd0f | * Remove unused import | 2015-01-05 18:48:48 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3f1944d688 | * Make PyPy work | 2015-01-05 17:54:38 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a510d9f677 | * Another assertion removed | 2015-01-05 13:01:40 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2856946a66 | * Remove assertion that doesn't work on Python 3 | 2015-01-05 12:51:16 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 94034f1112 | * Fix encoding in lemmatization | 2015-01-05 11:54:29 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b132b3caa6 | * Fix unicode error in lemmatizer | 2015-01-05 11:53:54 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 477e7fbffe | * Fix data reading for lemmatizer | 2015-01-05 06:01:32 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4e085d5166 | * Fix lemmatizer for Python3 | 2015-01-05 05:51:26 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0e4c2ba036 | * Fix loading of special morph words | 2015-01-03 23:13:00 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f5d41028b5 | * Move around data files for test release | 2015-01-03 01:59:22 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a24321b63a | * Add downloader | 2015-01-02 21:44:41 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5d9a096e2f | * Some minor clean-up after HastyModel | 2014-12-31 19:46:04 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aafaf58cbe | * Refactor _ml.Model, and finish implementing HastyModel so far not worthwhile. | 2014-12-31 19:40:59 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1a075f77ff | * Don't over-ride pre-loaded POS tags, if set by special-cases | 2014-12-30 23:26:32 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 785c7ba76a | * Embed signature on attrs | 2014-12-30 23:25:31 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 30e5805656 | * Lazy-load tagger and parser | 2014-12-30 23:25:09 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb0b00f819 | * Repurporse the Tagger class as a generic Model, wrapping thinc's interface | 2014-12-30 21:20:15 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb80937544 | * Upd docstrings | 2014-12-27 18:45:16 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b8b65903fc | * Tmp | 2014-12-24 17:42:00 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7708d0e24a | * Move lemmatizer to en dir | 2014-12-23 15:16:57 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 98eb4c0426 | * Fix path to parser model | 2014-12-23 15:09:09 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b00bc01d8c | * All tests now passing for reorg | 2014-12-23 13:18:59 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 73f200436f | * Tests passing except for morphology/lemmatization stuff | 2014-12-23 11:40:32 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cf8d26c3d2 | * POS tagger training working after reorg | 2014-12-22 08:54:47 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4c4aa2c5c9 | * Work on train | 2014-12-22 07:25:43 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 61df50b598 | * Add English-subclass POS tagger | 2014-12-21 20:59:07 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9f3f07cab6 | * Add attrs file for English | 2014-12-21 11:29:11 +11:00 |  |