Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							7adbd7a785 
							
						 
					 
					
						
						
							
							replace Counter with normal dict  
						
						
						
					 
					
						2016-03-03 21:36:27 +01:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							1ae487a4f6 
							
						 
					 
					
						
						
							
							add backwards compatibility with python 2.6  
						
						
						
					 
					
						2016-03-03 21:18:12 +01:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							72b8df0684 
							
						 
					 
					
						
						
							
							turned PseudoProjectivity into a normal python class  
						
						
						
					 
					
						2016-03-03 19:05:08 +01:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							690c5acabf 
							
						 
					 
					
						
						
							
							adjust train.py to train both english and german models  
						
						
						
					 
					
						2016-03-03 15:21:00 +01:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							3448cb40a4 
							
						 
					 
					
						
						
							
							integrated pseudo-projective parsing into parser  
						
						... 
						
						
						
						- nonproj.pyx holds a class PseudoProjectivity which currently holds
  all functionality to implement Nivre & Nilsson 2005's pseudo-projective
  parsing using the HEAD decoration scheme
- changed lefts/rights in Token to account for possible non-projective
  structures 
						
					 
					
						2016-03-01 10:09:08 +01:00 
						 
				 
			
				
					
						
							
							
								Wolfgang Seeker 
							
						 
					 
					
						
						
						
						
							
						
						
							56b7210e82 
							
						 
					 
					
						
						
							
							moved nonproj.py to syntax/nonproj.pyx  
						
						
						
					 
					
						2016-02-25 15:08:49 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							1b83cb9dfa 
							
						 
					 
					
						
						
							
							* Fix Issue  #251 : Incorrect right edge calculation on left-clobber low in the tree  
						
						
						
					 
					
						2016-02-07 00:00:42 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							4412a70dc5 
							
						 
					 
					
						
						
							
							* Initialize StateC._empty_token to 0, to avoid undefined behaviour.  
						
						
						
					 
					
						2016-02-06 13:34:38 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							1b41f868d2 
							
						 
					 
					
						
						
							
							* Check for errors in parser, and parallelise the left-over batch  
						
						
						
					 
					
						2016-02-06 10:06:30 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							165ca28b80 
							
						 
					 
					
						
						
							
							* Set is_parsed flag in Parser.pipe  
						
						
						
					 
					
						2016-02-05 19:51:44 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							bdd579db0a 
							
						 
					 
					
						
						
							
							* Set is_parsed flag in Parser.pipe  
						
						
						
					 
					
						2016-02-05 19:50:11 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b04c9aad71 
							
						 
					 
					
						
						
							
							* Fix off-by-one in Parser.pipe  
						
						
						
					 
					
						2016-02-05 19:37:50 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							048dfe35aa 
							
						 
					 
					
						
						
							
							* cimport cython.parallel  
						
						
						
					 
					
						2016-02-05 12:20:42 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							8a13cebdcc 
							
						 
					 
					
						
						
							
							* Update for modified thinc interface  
						
						
						
					 
					
						2016-02-05 11:44:39 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							84b247ef83 
							
						 
					 
					
						
						
							
							* Add a .pipe method, that takes a stream of input, operates on it, and streams the output. Internally, the stream may be buffered, to allow multi-threading.  
						
						
						
					 
					
						2016-02-03 02:10:58 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e3db39dd21 
							
						 
					 
					
						
						
							
							* Fix compiler warning about signed/unsigned comparison  
						
						
						
					 
					
						2016-02-01 09:08:07 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b3802562d6 
							
						 
					 
					
						
						
							
							Merge branch 'rethinc2' of  https://github.com/honnibal/spaCy  into rethinc2  
						
						
						
					 
					
						2016-02-01 08:59:24 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							4b08a3fafd 
							
						 
					 
					
						
						
							
							* Fix merge conflict  
						
						
						
					 
					
						2016-02-01 08:58:18 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							5188f6d9d8 
							
						 
					 
					
						
						
							
							* Fix parseC function  
						
						
						
					 
					
						2016-02-01 08:48:48 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							bcf8f7ba40 
							
						 
					 
					
						
						
							
							* Add a parse_batch method to Parser, that releases the GIL around a batch of documents.  
						
						
						
					 
					
						2016-02-01 08:34:55 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							d5579cd0d8 
							
						 
					 
					
						
						
							
							Merge branch 'rethinc2' of  https://github.com/honnibal/spaCy  into rethinc2  
						
						
						
					 
					
						2016-02-01 03:08:49 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							490ba65398 
							
						 
					 
					
						
						
							
							* Use openmp in parser  
						
						
						
					 
					
						2016-02-01 03:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							cb78d91ec5 
							
						 
					 
					
						
						
							
							* Fix ArcEager.set_valid  
						
						
						
					 
					
						2016-02-01 03:07:37 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							28e5ad62bc 
							
						 
					 
					
						
						
							
							* Pass a StateC pointer into the transition and validation methods in the parser, so that the GIL can be released over a batch of documents  
						
						
						
					 
					
						2016-02-01 03:00:15 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a47f00901b 
							
						 
					 
					
						
						
							
							* Pass a StateC pointer into the transition and validation methods in the parser, so that the GIL can be released over a batch of documents  
						
						
						
					 
					
						2016-02-01 02:58:14 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							daaad66448 
							
						 
					 
					
						
						
							
							* Now fully proxied  
						
						
						
					 
					
						2016-02-01 02:37:08 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							7a0e3bb9c1 
							
						 
					 
					
						
						
							
							* Continue proxying. Some problem currently  
						
						
						
					 
					
						2016-02-01 02:22:21 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							2169bbb7ea 
							
						 
					 
					
						
						
							
							* Shadow StateClass with StateC, to start proxying  
						
						
						
					 
					
						2016-02-01 01:16:14 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							2fa228458e 
							
						 
					 
					
						
						
							
							* Add _state file, which StateClass will proxy to  
						
						
						
					 
					
						2016-02-01 01:09:21 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							9410e74c92 
							
						 
					 
					
						
						
							
							* Switch parser to use nogil functions  
						
						
						
					 
					
						2016-01-30 20:27:07 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							10877a7791 
							
						 
					 
					
						
						
							
							* Update for thinc 5.0, including changing cost from int to weight_t, and updating the tagger and parser  
						
						
						
					 
					
						2016-01-30 14:31:36 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							84c5dfbfc3 
							
						 
					 
					
						
						
							
							* Clean up debugging python list  
						
						
						
					 
					
						2016-01-19 20:10:32 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							04d0686b26 
							
						 
					 
					
						
						
							
							* Make TransitionSystem.add_action idempotent, i.e. ignore duplicate added actions.  
						
						
						
					 
					
						2016-01-19 20:10:04 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							65c5bc4988 
							
						 
					 
					
						
						
							
							* Add add_label method, to allow users to register new entity types and dependency labels.  
						
						
						
					 
					
						2016-01-19 19:11:02 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							151aa0b0e2 
							
						 
					 
					
						
						
							
							* Allow users to add_label, in order to extend the entity recogniser to new classes. Does not by itself add a class to the model  
						
						
						
					 
					
						2016-01-19 19:09:33 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c8e0011ebc 
							
						 
					 
					
						
						
							
							* Add iterators to the NER and parser transition systems, to get the action types  
						
						
						
					 
					
						2016-01-19 19:07:43 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							04177debd0 
							
						 
					 
					
						
						
							
							* Unwind limit to sentence boundary detection that prevents it from inserting boundaries on whitespace. Replace it with a check for whitespace in StateClass.fast_forward, so that whitespace is LeftArced when it's on the stack. This should prevent the previous problem of whitespace-only sentences. Should fix Issue  #184 , but may cause further problems. Needs testing.  
						
						
						
					 
					
						2016-01-19 02:54:15 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3dc398b727 
							
						 
					 
					
						
						
							
							* Fix merge conflict in requirements.txt  
						
						
						
					 
					
						2016-01-16 16:20:49 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c025a0c64b 
							
						 
					 
					
						
						
							
							* Check for KeyboardInerrupt in parser.__call__  
						
						
						
					 
					
						2016-01-16 16:18:44 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							aec130af56 
							
						 
					 
					
						
						
							
							Use util.Package class for io  
						
						... 
						
						
						
						Previous Sputnik integration caused API change: Vocab, Tagger, etc
were loaded via a from_package classmethod, that required a
sputnik.Package instance. This forced users to first create a
sputnik.Sputnik() instance, in order to acquire a Package via
sp.pool().
Instead I've created a small file-system shim, util.Package, which
allows classes to have a .load() classmethod, that accepts either
util.Package objects, or strings. We can later gut the internals
of this and make it a proxy for Sputnik if we need more functionality
that should live in the Sputnik library.
Sputnik is now only used to download and install the data, in
spacy.en.download 
						
					 
					
						2015-12-29 18:00:48 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							5623242b3e 
							
						 
					 
					
						
						
							
							* Adjust NER rules, so that U entries in gazetteer don't become B moves to the model  
						
						
						
					 
					
						2015-11-12 04:48:23 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							44fbdc7260 
							
						 
					 
					
						
						
							
							* Fix bug in NER transition system, that sometimes left no valid moves  
						
						
						
					 
					
						2015-11-08 16:19:12 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e92371bb54 
							
						 
					 
					
						
						
							
							* Fix rule that made Last action invalid if there was a preset of O, since if the entity is already open, that ship has sailed.  
						
						
						
					 
					
						2015-11-08 22:17:51 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							6f47074214 
							
						 
					 
					
						
						
							
							* Make constructor of ParserModel and TaggerModel the same as AveragedPerceptron, for each pickling.  
						
						
						
					 
					
						2015-11-07 18:25:17 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							1cfa20fb17 
							
						 
					 
					
						
						
							
							* Fix sentence-final whitespace issue  
						
						
						
					 
					
						2015-11-07 17:34:46 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							888c05a7fa 
							
						 
					 
					
						
						
							
							* Fix variable naming in StepwiseState, for thinc 4.0  
						
						
						
					 
					
						2015-11-07 11:02:44 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							fc2185bfe3 
							
						 
					 
					
						
						
							
							* Fix variable naming in StepwiseState, for thinc 4.0  
						
						
						
					 
					
						2015-11-07 10:48:31 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							954442a807 
							
						 
					 
					
						
						
							
							* Fix variable naming in StepwiseState, for thinc 4.0  
						
						
						
					 
					
						2015-11-07 10:30:45 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							af70dc166a 
							
						 
					 
					
						
						
							
							* Fix Last restriction, that was supposed to prevent conflicts with presets, but was incorrect.  
						
						
						
					 
					
						2015-11-07 09:52:00 +11:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a06e3c8963 
							
						 
					 
					
						
						
							
							* Fix bone-headed mistake in StateClass.E  
						
						
						
					 
					
						2015-11-07 07:35:28 +11:00