| .. | 
		
		
			
			
			
			
				| data | add data dir | 2015-11-18 11:48:55 +01:00 | 
		
			
			
			
			
				| de | access model via sputnik | 2015-12-07 06:01:28 +01:00 | 
		
			
			
			
			
				| en | fix py2/3 issue | 2016-01-16 12:44:53 +01:00 | 
		
			
			
			
			
				| fi | access model via sputnik | 2015-12-07 06:01:28 +01:00 | 
		
			
			
			
			
				| it | access model via sputnik | 2015-12-07 06:01:28 +01:00 | 
		
			
			
			
			
				| munge | * Fix Python3 problem in align_raw | 2015-07-28 16:06:53 +02:00 | 
		
			
			
			
			
				| serialize | * Rename Doc.data to Doc.c | 2015-11-04 00:17:13 +11:00 | 
		
			
			
			
			
				| syntax | * Fix merge conflict in requirements.txt | 2016-01-16 16:20:49 +01:00 | 
		
			
			
			
			
				| tests | * Set heads for test_merge_tokens, to make the test run without models | 2016-01-18 17:00:11 +01:00 | 
		
			
			
			
			
				| tokens | * Bug fix to _count_words_to_root | 2016-01-18 16:59:38 +01:00 | 
		
			
			
			
			
				| __init__.pxd | * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. | 2014-10-24 02:23:42 +11:00 | 
		
			
			
			
			
				| __init__.py | distinct load() and from_package() methods | 2016-01-16 10:00:57 +01:00 | 
		
			
			
			
			
				| about.py | add about.py, adapt setup.py | 2016-01-15 18:57:01 +01:00 | 
		
			
			
			
			
				| attrs.pxd | * Refactor symbols, so that frequency rank can be derived from the orth id of a word. | 2015-10-13 13:44:39 +11:00 | 
		
			
			
			
			
				| attrs.pyx | * Map empty string to NULL_ATTR in attrs | 2015-10-13 13:44:40 +11:00 | 
		
			
			
			
			
				| cfile.pxd | * Add cfile.pyx | 2015-07-23 01:10:36 +02:00 | 
		
			
			
			
			
				| cfile.pyx | * Fix CFile for Python2 | 2015-07-25 22:55:53 +02:00 | 
		
			
			
			
			
				| gold.pxd | * Remove unused import | 2015-07-25 18:11:16 +02:00 | 
		
			
			
			
			
				| gold.pyx | * Use io module insteads of deprecated codecs module | 2015-10-10 14:13:01 +11:00 | 
		
			
			
			
			
				| language.py | fix pickling | 2016-01-16 13:23:11 +01:00 | 
		
			
			
			
			
				| lemmatizer.py | distinct load() and from_package() methods | 2016-01-16 10:00:57 +01:00 | 
		
			
			
			
			
				| lexeme.pxd | * Fix ugly py_check_flag and py_set_flag functions in Lexeme | 2015-09-15 13:06:18 +10:00 | 
		
			
			
			
			
				| lexeme.pyx | * Add .rank property to Token and Lexeme, for frequency rank | 2015-11-08 16:18:25 +01:00 | 
		
			
			
			
			
				| matcher.pyx | untangle data_path/via | 2016-01-16 12:23:45 +01:00 | 
		
			
			
			
			
				| morphology.pxd | * Ensure Morphology can be pickled, to address Issue #125. | 2015-10-13 13:44:41 +11:00 | 
		
			
			
			
			
				| morphology.pyx | * Fix capitalization in lemmatizer | 2015-11-06 05:44:35 +11:00 | 
		
			
			
			
			
				| multi_words.py | * Fix Issue #50: Python 3 compatibility of v0.80 | 2015-04-13 05:59:43 +02:00 | 
		
			
			
			
			
				| orth.pxd | * Host IS_ flags in attrs.pxd, and add properties for them on Token and Lexeme objects | 2015-07-26 16:37:16 +02:00 | 
		
			
			
			
			
				| orth.pyx | * Fix type declaration in asciied function | 2015-10-09 13:46:57 +11:00 | 
		
			
			
			
			
				| parts_of_speech.pxd | * Fix parts_of_speech now that symbols list has been reformed | 2015-10-13 13:44:40 +11:00 | 
		
			
			
			
			
				| parts_of_speech.pyx | * Fix NAMES list in spacy/parts_of_speech.pyx | 2015-10-13 14:18:45 +11:00 | 
		
			
			
			
			
				| scorer.py | * Fix training under python3 | 2015-07-28 14:09:30 +02:00 | 
		
			
			
			
			
				| strings.pxd | * Use unicode in StringStore.intern, instead of unreliably casting to bytes. | 2015-11-05 11:32:19 +00:00 | 
		
			
			
			
			
				| strings.pyx | * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. | 2015-11-07 03:24:30 +11:00 | 
		
			
			
			
			
				| structs.pxd | * Clean up unused Constituent struct. | 2015-11-03 23:48:21 +11:00 | 
		
			
			
			
			
				| symbols.pxd | * Use lower case strings for dependency label names in symbols enum | 2015-10-13 13:44:40 +11:00 | 
		
			
			
			
			
				| symbols.pyx | * Use lower case strings for dependency label names in symbols enum | 2015-10-13 13:44:40 +11:00 | 
		
			
			
			
			
				| tagger.pxd | * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. | 2015-11-07 03:24:30 +11:00 | 
		
			
			
			
			
				| tagger.pyx | untangle data_path/via | 2016-01-16 12:23:45 +01:00 | 
		
			
			
			
			
				| tokenizer.pxd | Add __reduce__ to Tokenizer so that English pickles. | 2015-10-23 22:24:03 -07:00 | 
		
			
			
			
			
				| tokenizer.pyx | * If final token is whitespace, don't mark it as owning a trailing space. Fixes Issue #154 | 2016-01-16 17:08:59 +01:00 | 
		
			
			
			
			
				| typedefs.pxd | * Fix type declarations for attr_t. Remove unused id_t. | 2015-07-18 22:39:57 +02:00 | 
		
			
			
			
			
				| typedefs.pyx | * Move POS tag definitions to parts_of_speech.pxd | 2015-01-25 16:31:07 +11:00 | 
		
			
			
			
			
				| util.py | untangle data_path/via | 2016-01-16 12:23:45 +01:00 | 
		
			
			
			
			
				| vocab.pxd | * Start trying to pickle Vocab | 2015-10-13 13:44:41 +11:00 | 
		
			
			
			
			
				| vocab.pyx | untangle data_path/via | 2016-01-16 12:23:45 +01:00 |