Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							81a47c01d8 
							
						 
					 
					
						
						
							
							Fix test for empty sentence string.  
						
						
						
					 
					
						2016-09-27 19:21:22 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							fc4a7ad794 
							
						 
					 
					
						
						
							
							Test and fix Issue  #411 : IndexError when .sents property is used on empty string.  
						
						
						
					 
					
						2016-09-27 18:49:14 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3d370b7d45 
							
						 
					 
					
						
						
							
							Add test for Issue  #445 , fixed in  3cb4d455d, with improved lemmatizer logic  
						
						
						
					 
					
						2016-09-27 18:39:46 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							9c8ac91d72 
							
						 
					 
					
						
						
							
							Add test for Issue  #435  
						
						
						
					 
					
						2016-09-27 13:52:38 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e233328d38 
							
						 
					 
					
						
						
							
							Fix Issue  #371 : Lexeme objects were unhashable.  
						
						
						
					 
					
						2016-09-27 13:22:30 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							2debc4e0a2 
							
						 
					 
					
						
						
							
							Add .blank() method to Parser. Start housing default dep labels and entity types within the Defaults class.  
						
						
						
					 
					
						2016-09-26 11:57:54 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							95aaea0d3f 
							
						 
					 
					
						
						
							
							Refactor so that the tokenizer data is read from Python data, rather than from disk  
						
						
						
					 
					
						2016-09-25 14:49:53 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							fd65cf6cbb 
							
						 
					 
					
						
						
							
							Finish refactoring data loading  
						
						
						
					 
					
						2016-09-24 20:26:17 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							83e364188c 
							
						 
					 
					
						
						
							
							Mostly finished loading refactoring. Design is in place, but doesn't work yet.  
						
						
						
					 
					
						2016-09-24 15:42:01 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b00f683a0c 
							
						 
					 
					
						
						
							
							Fix matcher test  
						
						
						
					 
					
						2016-09-24 11:20:58 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							939a791a52 
							
						 
					 
					
						
						
							
							Update tests  
						
						
						
					 
					
						2016-09-24 01:17:03 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f6e587b1c7 
							
						 
					 
					
						
						
							
							Fix matcher tests  
						
						
						
					 
					
						2016-09-21 20:45:20 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							58e83fe34b 
							
						 
					 
					
						
						
							
							Initial, limited support for quantified patterns in Matcher, and tracking of ent_id attribute in Token and Span. The quantifiers need a lot more testing, and there are some known problems. The main known problem is that the zero-plus and one-plus quantifiers won't work if a token can match both the quantified pattern expression AND the tail of the match.  
						
						
						
					 
					
						2016-09-21 14:54:55 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							cc8bf62208 
							
						 
					 
					
						
						
							
							* Fix Issue  #360 : Tokenizer failed when the infix regex matched the start of the string while trying to tokenize multi-infix tokens.  
						
						
						
					 
					
						2016-05-09 13:23:47 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							5d86c30f0b 
							
						 
					 
					
						
						
							
							* Fix Issue  #367 : Missing has_vector property on Doc and Span objects  
						
						
						
					 
					
						2016-05-09 12:36:14 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							26095f9722 
							
						 
					 
					
						
						
							
							* Add span.sent property, re Issue  #366  
						
						
						
					 
					
						2016-05-06 00:17:38 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a6a25166ba 
							
						 
					 
					
						
						
							
							* Remove print from test  
						
						
						
					 
					
						2016-05-05 11:10:59 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							7441ca30ee 
							
						 
					 
					
						
						
							
							* Add tests for Issue  #361 : Lexeme rich comparison  
						
						
						
					 
					
						2016-05-05 01:31:58 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							72564213e3 
							
						 
					 
					
						
						
							
							* Add test for Issue  #309  
						
						
						
					 
					
						2016-05-04 16:00:28 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							76f1d871da 
							
						 
					 
					
						
						
							
							Merge branch 'master' of ssh://github.com/spacy-io/spaCy  
						
						
						
					 
					
						2016-05-04 15:54:00 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b4bfc6ae55 
							
						 
					 
					
						
						
							
							* Add test for Issue  #351 : Indices off when leading whitespace  
						
						
						
					 
					
						2016-05-04 15:53:17 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							a06fca9fdf 
							
						 
					 
					
						
						
							
							German noun chunk iterator now doesn't return tokens more than once  
						
						
						
					 
					
						2016-05-03 16:58:59 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							7825b75548 
							
						 
					 
					
						
						
							
							add tests for German noun chunker  
						
						
						
					 
					
						2016-05-03 15:01:28 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							7b246c13cb 
							
						 
					 
					
						
						
							
							reformulate noun chunk tests for English  
						
						
						
					 
					
						2016-05-03 14:24:35 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							1786331cd8 
							
						 
					 
					
						
						
							
							add model sanity test  
						
						
						
					 
					
						2016-05-03 12:51:47 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							308a28c26c 
							
						 
					 
					
						
						
							
							* Whitespace  
						
						
						
					 
					
						2016-05-02 16:08:11 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c1c11a8ae0 
							
						 
					 
					
						
						
							
							* Fix formatting on serializer tests  
						
						
						
					 
					
						2016-05-02 16:07:21 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							902a389d85 
							
						 
					 
					
						
						
							
							* Fix merge conflict in test_parse  
						
						
						
					 
					
						2016-05-02 15:28:07 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							02c23cc1d0 
							
						 
					 
					
						
						
							
							* Fix sentence boundary test  
						
						
						
					 
					
						2016-05-02 15:26:07 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							d2f469b809 
							
						 
					 
					
						
						
							
							* Fix parsing tests, so that labels are added if they're missing, and so that the branching test values are correct  
						
						
						
					 
					
						2016-05-02 15:25:27 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							b11cbb06c6 
							
						 
					 
					
						
						
							
							remove old tests for sentence boundary detection  
						
						
						
					 
					
						2016-05-02 14:36:35 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							508fd1f6dc 
							
						 
					 
					
						
						
							
							* Refactor noun chunk iterators, so that they're simple functions. Install the iterator when the Doc is created, but allow users to write to the noun_chunk_iterator attribute. The iterator functions accept an object and yield (int start, int end, int label) triples.  
						
						
						
					 
					
						2016-05-02 14:25:10 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							fa961ea694 
							
						 
					 
					
						
						
							
							add tests for serialization bug  
						
						
						
					 
					
						2016-05-02 11:01:56 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							1003e7ccec 
							
						 
					 
					
						
						
							
							remove debug output from tests  
						
						
						
					 
					
						2016-04-25 12:12:40 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							f57f843e85 
							
						 
					 
					
						
						
							
							fix bug in updating tree structure when introducing additional roots  
						
						
						
					 
					
						2016-04-25 12:01:19 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							b6477fc4f4 
							
						 
					 
					
						
						
							
							adjusted tests to Travis Setup  
						
						
						
					 
					
						2016-04-21 17:15:10 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							736ffcb9a2 
							
						 
					 
					
						
						
							
							remove whitespace  
						
						
						
					 
					
						2016-04-21 16:55:55 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							6c7301cc6d 
							
						 
					 
					
						
						
							
							the parser now introduces sentence boundaries properly when predicting dependents with root labels  
						
						
						
					 
					
						2016-04-21 16:50:53 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							12024b0b0a 
							
						 
					 
					
						
						
							
							bugfix: introducing multiple roots now updates original head's properties  
						
						... 
						
						
						
						adjust tests to rely less on statistical model 
						
					 
					
						2016-04-20 16:42:41 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							2add5206aa 
							
						 
					 
					
						
						
							
							* Fix description of matcher test  
						
						
						
					 
					
						2016-04-17 15:40:21 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							2b419d5b8c 
							
						 
					 
					
						
						
							
							* Update test for Issue  #242  
						
						
						
					 
					
						2016-04-17 15:34:23 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f12b043308 
							
						 
					 
					
						
						
							
							* Add test for Issue  #242 : Overlapping matches not well recognised.  
						
						
						
					 
					
						2016-04-17 15:19:17 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c0909afe22 
							
						 
					 
					
						
						
							
							Merge pull request  #312  from wbwseeker/space_head_bug  
						
						... 
						
						
						
						add restrictions to L-arc and R-arc to prevent space heads 
						
					 
					
						2016-04-15 20:36:03 +10:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							6f82065761 
							
						 
					 
					
						
						
							
							* Fix infixed commas in tokenizer, re Issue  #326 . Need to benchmark on empirical data, to make sure this doesn't break other cases.  
						
						
						
					 
					
						2016-04-14 11:36:03 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							0f957dd586 
							
						 
					 
					
						
						
							
							Merge branch 'master' of ssh://github.com/honnibal/spaCy  
						
						
						
					 
					
						2016-04-14 10:37:56 +02:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							d99a9cbce9 
							
						 
					 
					
						
						
							
							different handling of space tokens  
						
						... 
						
						
						
						space tokens are now always attached to the previous non-space token
there are two exceptions:
leading space tokens are attached to the first following non-space token
in input that consists exclusively of space tokens, the last space token
is the head of all others. 
						
					 
					
						2016-04-13 15:28:28 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							04d0209be9 
							
						 
					 
					
						
						
							
							* Recognise multiple infixes in a token.  
						
						
						
					 
					
						2016-04-13 18:38:26 +10:00 
						 
				 
			
				
					
						
							
							
								Henning Peters 
							
						 
					 
					
						
						
						
						
							
						
						
							a473d6e937 
							
						 
					 
					
						
						
							
							fix tests (use english model)  
						
						
						
					 
					
						2016-04-12 16:41:57 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							6df3858dbc 
							
						 
					 
					
						
						
							
							* Fix Issue  #323 : Incorrect semantics of Token.__str__ built-in. Add flag to allow users to switch the old semantics back on, to ease transition.  
						
						
						
					 
					
						2016-04-12 13:17:59 +10:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							80bea62842 
							
						 
					 
					
						
						
							
							bugfix in unit test  
						
						
						
					 
					
						2016-04-08 16:46:44 +02:00