Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b24e8be2b9
							
						
					 | 
					
						
						
							
							* Whitespace in docstring
						
						
						
						
						
					 | 
					
						2015-07-08 12:37:03 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							abc43b852d
							
						
					 | 
					
						
						
							
							* Add pos_tags attr to Vocab.
						
						
						
						
						
					 | 
					
						2015-07-08 12:36:38 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							935bcdf3e5
							
						
					 | 
					
						
						
							
							* Remove redundant tag_names argument to Tokenizer
						
						
						
						
						
					 | 
					
						2015-07-08 12:36:04 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ff885e8511
							
						
					 | 
					
						
						
							
							* Add ParserFactory convenience function
						
						
						
						
						
					 | 
					
						2015-07-08 12:35:46 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4e4fac452b
							
						
					 | 
					
						
						
							
							* Refactor __init__ for simplicity. Allow parse=True, tag=True etc flags to be passed at top-level. Do not lazy-load parser.
						
						
						
						
						
					 | 
					
						2015-07-08 12:35:29 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1d2deb4616
							
						
					 | 
					
						
						
							
							* Work on refactoring default arguments to English.__init__
						
						
						
						
						
					 | 
					
						2015-07-07 15:53:25 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							2d0e99a096
							
						
					 | 
					
						
						
							
							* Pass pos_tags into Tokenizer.from_dir
						
						
						
						
						
					 | 
					
						2015-07-07 14:23:08 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6788c86b2f
							
						
					 | 
					
						
						
							
							* Begin refactor
						
						
						
						
						
					 | 
					
						2015-07-07 14:00:07 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							52fd80c6c6
							
						
					 | 
					
						
						
							
							* Add experimental supersense features for parsing, based on lookup into wordnet.
						
						
						
						
						
					 | 
					
						2015-07-01 20:12:44 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e6d828a9af
							
						
					 | 
					
						
						
							
							* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.
						
						
						
						
						
					 | 
					
						2015-07-01 20:12:13 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							2b8459d9a8
							
						
					 | 
					
						
						
							
							* Add senses flag to Lexeme
						
						
						
						
						
					 | 
					
						2015-07-01 20:10:41 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e23d1582a2
							
						
					 | 
					
						
						
							
							* Add supersense data to Lexeme objects. Add simple has_sense method to check the flag.
						
						
						
						
						
					 | 
					
						2015-07-01 18:50:37 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							64fafa98be
							
						
					 | 
					
						
						
							
							* Add senses.pyx and senses.pxd
						
						
						
						
						
					 | 
					
						2015-07-01 18:49:44 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							94dab94e5f
							
						
					 | 
					
						
						
							
							uerge branch 'master' of https://github.com/honnibal/spaCy
						
						
						
						
						
					 | 
					
						2015-06-30 18:16:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							9af86b0b0b
							
						
					 | 
					
						
						
							
							* Fix attrs.pxd
						
						
						
						
						
					 | 
					
						2015-06-30 18:16:30 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							af9c82f7a6
							
						
					 | 
					
						
						
							
							Merge branch 'master' of https://github.com/honnibal/spaCy
						
						
						
						
						
					 | 
					
						2015-06-30 18:11:37 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5d595b5a8c
							
						
					 | 
					
						
						
							
							* Inc versions
						
						
						
						
						
					 | 
					
						2015-06-30 18:11:06 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d2eeba6667
							
						
					 | 
					
						
						
							
							* Start wiring up color and emotion lexicons. Hopefully we get to use them.
						
						
						
						
						
					 | 
					
						2015-06-30 16:22:23 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							3bb5876c5a
							
						
					 | 
					
						
						
							
							* Inline methods in StateClass
						
						
						
						
						
					 | 
					
						2015-06-29 01:10:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a02fd3af5d
							
						
					 | 
					
						
						
							
							* Check valency in L and R feature methods, to make feaure calculation faster
						
						
						
						
						
					 | 
					
						2015-06-29 00:27:56 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							bbef71f213
							
						
					 | 
					
						
						
							
							* Fix min function in fill_context
						
						
						
						
						
					 | 
					
						2015-06-28 10:46:39 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							142b6f9510
							
						
					 | 
					
						
						
							
							* Revert last changes
						
						
						
						
						
					 | 
					
						2015-06-28 10:44:28 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b06962f18b
							
						
					 | 
					
						
						
							
							* Pad buffers in state
						
						
						
						
						
					 | 
					
						2015-06-28 10:36:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							53be72387c
							
						
					 | 
					
						
						
							
							* Hack at fill_context to investigate performance loss
						
						
						
						
						
					 | 
					
						2015-06-28 10:34:28 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							71a4e876a9
							
						
					 | 
					
						
						
							
							* Fix parse features
						
						
						
						
						
					 | 
					
						2015-06-28 09:27:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0c4b5a2bb0
							
						
					 | 
					
						
						
							
							* Start scoring tokens
						
						
						
						
						
					 | 
					
						2015-06-28 06:21:38 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5af500909c
							
						
					 | 
					
						
						
							
							* Remove unused directve from parser.pyx
						
						
						
						
						
					 | 
					
						2015-06-28 06:20:21 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d5b4090705
							
						
					 | 
					
						
						
							
							* Add profile directive
						
						
						
						
						
					 | 
					
						2015-06-28 06:19:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							2b5421e60c
							
						
					 | 
					
						
						
							
							* Add profile directive
						
						
						
						
						
					 | 
					
						2015-06-28 06:07:04 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8b5de4a411
							
						
					 | 
					
						
						
							
							* Add word / tag / label sets, for use in neural net
						
						
						
						
						
					 | 
					
						2015-06-28 05:46:53 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							cfcbd8d256
							
						
					 | 
					
						
						
							
							* Fix punctuation eval in scorer.py
						
						
						
						
						
					 | 
					
						2015-06-28 01:31:39 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b266a63f2c
							
						
					 | 
					
						
						
							
							* Inc version of downloadble data
						
						
						
						
						
					 | 
					
						2015-06-24 04:53:08 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							02b171ee67
							
						
					 | 
					
						
						
							
							* Bug fixes to edge calculation
						
						
						
						
						
					 | 
					
						2015-06-24 04:28:02 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							7f9384f53c
							
						
					 | 
					
						
						
							
							* Remove deprecated _state module
						
						
						
						
						
					 | 
					
						2015-06-23 17:28:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6dbe182491
							
						
					 | 
					
						
						
							
							* Fix merge conflicts
						
						
						
						
						
					 | 
					
						2015-06-23 17:28:00 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							579735a095
							
						
					 | 
					
						
						
							
							* Remove import of _state module
						
						
						
						
						
					 | 
					
						2015-06-23 17:25:08 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							88f55d136b
							
						
					 | 
					
						
						
							
							* Remove deprecated _state module
						
						
						
						
						
					 | 
					
						2015-06-23 17:19:51 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							9ab9dd2bf7
							
						
					 | 
					
						
						
							
							* Clean up unused orig_arc_eager and tree_arc_eager modules, which were only added for EMNLP experiments
						
						
						
						
						
					 | 
					
						2015-06-23 17:17:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							7ebfe4b983
							
						
					 | 
					
						
						
							
							* Fixes to edge features
						
						
						
						
						
					 | 
					
						2015-06-23 16:32:54 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							7b125f5a86
							
						
					 | 
					
						
						
							
							* Fixes to edge features
						
						
						
						
						
					 | 
					
						2015-06-23 16:31:01 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8d4bbacfc5
							
						
					 | 
					
						
						
							
							* Fix edge navigation in Token objects
						
						
						
						
						
					 | 
					
						2015-06-23 16:07:34 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							35c290bee4
							
						
					 | 
					
						
						
							
							* Fix edge features
						
						
						
						
						
					 | 
					
						2015-06-23 15:50:56 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							221e2e485f
							
						
					 | 
					
						
						
							
							* Assign 'ROOT' as label, not 'root'
						
						
						
						
						
					 | 
					
						2015-06-23 15:09:54 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a7bf7b0626
							
						
					 | 
					
						
						
							
							* Rename sent_start to sent_end, to reflect its new usage in the Break transition
						
						
						
						
						
					 | 
					
						2015-06-23 05:39:43 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ee3e56f27b
							
						
					 | 
					
						
						
							
							* Fix bounds checking on entities
						
						
						
						
						
					 | 
					
						2015-06-23 04:35:08 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							43ef5ddea5
							
						
					 | 
					
						
						
							
							* Ensure root albel is spelled ROOT, for backwards compatibility
						
						
						
						
						
					 | 
					
						2015-06-23 04:14:03 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							065c2e1d2d
							
						
					 | 
					
						
						
							
							* Add some bounds checking around state arrays
						
						
						
						
						
					 | 
					
						2015-06-23 04:13:09 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							89ae218b75
							
						
					 | 
					
						
						
							
							* Add import to tokens.pyx from weird Cython compiler issue with casting from memory views
						
						
						
						
						
					 | 
					
						2015-06-23 03:04:34 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f01b3d043e
							
						
					 | 
					
						
						
							
							* Add padding to arrays in stateclass. May be papering over a deeper bug.
						
						
						
						
						
					 | 
					
						2015-06-23 03:03:41 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5e94b5d581
							
						
					 | 
					
						
						
							
							* Have Tokens return proper numpy arrays, not Cython views.
						
						
						
						
						
					 | 
					
						2015-06-23 00:07:34 +02:00 | 
					
					
						
						
							
							
							
						
					 |