Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							44160cd52f 
							
						 
					 
					
						
						
							
							Tidy up [ci skip]  
						
						
						
					 
					
						2020-10-01 10:41:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							381258b75b 
							
						 
					 
					
						
						
							
							Merge pull request  #6165  from explosion/feature/update-tokenizers-initialize  
						
						
						
					 
					
						2020-10-01 09:49:47 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							6787e56315 
							
						 
					 
					
						
						
							
							print debugging warning before raising error if model not properly initialized  
						
						
						
					 
					
						2020-10-01 09:21:00 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							5121972930 
							
						 
					 
					
						
						
							
							add types of Tok2Vec embedding layers  
						
						
						
					 
					
						2020-10-01 09:20:09 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4b6afd3611 
							
						 
					 
					
						
						
							
							Remove English [initialize] default block for now to get tests to pass  
						
						
						
					 
					
						2020-09-30 23:49:29 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6f29f68f69 
							
						 
					 
					
						
						
							
							Update errors and make Tokenizer.initialize args less strict  
						
						
						
					 
					
						2020-09-30 23:48:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a103ab5f1a 
							
						 
					 
					
						
						
							
							Update augmenter lookups and docs  
						
						
						
					 
					
						2020-09-30 23:03:47 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							5128298964 
							
						 
					 
					
						
						
							
							Add missing augmenter  
						
						
						
					 
					
						2020-09-30 20:18:45 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							59294e91aa 
							
						 
					 
					
						
						
							
							Restore the 'jsonl' arg for init vectors  
						
						... 
						
						
						
						The lexemes.jsonl file is still used in our English vectors, and it may
be required by users as well. I think it's worth supporting the option. 
						
					 
					
						2020-09-30 19:06:50 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							c379a4274a 
							
						 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/explosion/spaCy  into develop  
						
						
						
					 
					
						2020-09-30 16:52:42 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							e58dca3028 
							
						 
					 
					
						
						
							
							Add read_labels  
						
						
						
					 
					
						2020-09-30 16:52:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							115481aca7 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-30 15:16:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							23c63eefaf 
							
						 
					 
					
						
						
							
							Tidy up env vars [ci skip]  
						
						
						
					 
					
						2020-09-30 15:15:11 +02:00 
						 
				 
			
				
					
						
							
							
								Elijah Rippeth 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							4cbb954281 
							
						 
					 
					
						
						
							
							reorder so tagmap is replaced only if a custom file is provided. ( #6164 )  
						
						... 
						
						
						
						* reorder so tagmap is replaced only if a custom file is provided.
* Remove unneeded variable initialization
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com> 
						
					 
					
						2020-09-30 13:26:06 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							6b7bb32834 
							
						 
					 
					
						
						
							
							Refactor Chinese initialization  
						
						
						
					 
					
						2020-09-30 11:46:45 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							1c65b3b2c0 
							
						 
					 
					
						
						
							
							Proofreading  
						
						... 
						
						
						
						A few more small things in Usage. 
						
					 
					
						2020-09-30 11:33:40 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							469f0e539c 
							
						 
					 
					
						
						
							
							Fix docs [ci skip]  
						
						
						
					 
					
						2020-09-30 10:24:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							34f9c26c62 
							
						 
					 
					
						
						
							
							Add lexeme norm defaults  
						
						
						
					 
					
						2020-09-30 10:20:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a5debb356d 
							
						 
					 
					
						
						
							
							Tidy up and adjust logging [ci skip]  
						
						
						
					 
					
						2020-09-30 01:22:08 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							56a2f778c4 
							
						 
					 
					
						
						
							
							Add logging [ci skip]  
						
						
						
					 
					
						2020-09-30 01:08:55 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							95b2a448cf 
							
						 
					 
					
						
						
							
							Update lookups data pin [ci skip]  
						
						
						
					 
					
						2020-09-30 00:24:42 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							fe3f111c37 
							
						 
					 
					
						
						
							
							Merge pull request  #6168  from explosion/fix/default-corpus-values  
						
						
						
					 
					
						2020-09-30 00:24:02 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b799af16de 
							
						 
					 
					
						
						
							
							Don't raise in Pipe.initialize if not implemented  
						
						
						
					 
					
						2020-09-30 00:05:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							7d04ba20c0 
							
						 
					 
					
						
						
							
							Update Thinc  
						
						
						
					 
					
						2020-09-30 00:05:17 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							bc61691f6f 
							
						 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/explosion/spaCy  into develop  
						
						
						
					 
					
						2020-09-29 23:41:04 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f52249fe2e 
							
						 
					 
					
						
						
							
							Fix data augmentation  
						
						
						
					 
					
						2020-09-29 23:40:54 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							14c4da547f 
							
						 
					 
					
						
						
							
							Try to fix augmentation  
						
						
						
					 
					
						2020-09-29 23:08:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ae51843468 
							
						 
					 
					
						
						
							
							Remove augmenter from jinja template [ci skip]  
						
						
						
					 
					
						2020-09-29 23:08:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9bb958fd0a 
							
						 
					 
					
						
						
							
							Fix debug data [ci skip]  
						
						
						
					 
					
						2020-09-29 23:07:11 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a2aa1f6882 
							
						 
					 
					
						
						
							
							Disable the OVL augmentation by default  
						
						
						
					 
					
						2020-09-29 23:02:40 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							df8dd91b6f 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into fix/default-corpus-values  
						
						
						
					 
					
						2020-09-29 22:55:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0a1ee109db 
							
						 
					 
					
						
						
							
							Remove init form path  
						
						
						
					 
					
						2020-09-29 22:53:18 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ad6d40d028 
							
						 
					 
					
						
						
							
							Add logging  
						
						
						
					 
					
						2020-09-29 22:53:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c334a7d45f 
							
						 
					 
					
						
						
							
							Remove  
						
						
						
					 
					
						2020-09-29 22:38:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							1aeef3bfbb 
							
						 
					 
					
						
						
							
							Make corpus paths default to None and improve errors  
						
						
						
					 
					
						2020-09-29 22:33:46 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0250bcf6a3 
							
						 
					 
					
						
						
							
							Show validation error during init  
						
						
						
					 
					
						2020-09-29 22:29:09 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							da30bae8a6 
							
						 
					 
					
						
						
							
							Use __pyx_vtable__ instead of __reduce_cython__  
						
						
						
					 
					
						2020-09-29 22:04:17 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							43c92ec8c9 
							
						 
					 
					
						
						
							
							Resolve dir for better output [ci skip]  
						
						
						
					 
					
						2020-09-29 22:01:04 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							fa47f87924 
							
						 
					 
					
						
						
							
							Tidy up and auto-format  
						
						
						
					 
					
						2020-09-29 21:39:28 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							604be54a5c 
							
						 
					 
					
						
						
							
							Support --code in evaluate CLI [ci skip]  
						
						
						
					 
					
						2020-09-29 21:20:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6467a560e3 
							
						 
					 
					
						
						
							
							WIP: Test updating Chinese tokenizer  
						
						
						
					 
					
						2020-09-29 21:10:22 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4f3102d09c 
							
						 
					 
					
						
						
							
							Auto-format  
						
						
						
					 
					
						2020-09-29 21:09:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							798040bc1d 
							
						 
					 
					
						
						
							
							Fix language detection  
						
						
						
					 
					
						2020-09-29 21:08:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							78021089f9 
							
						 
					 
					
						
						
							
							Merge pull request  #6160  from explosion/feature/prepare  
						
						
						
					 
					
						2020-09-29 20:55:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c3f8c09d7d 
							
						 
					 
					
						
						
							
							Merge pull request  #6154  from adrianeboyd/bugfix/chinese-tokenizer-pickle  
						
						
						
					 
					
						2020-09-29 20:54:59 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d3c63b7965 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/prepare  
						
						
						
					 
					
						2020-09-29 20:53:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							361f91e286 
							
						 
					 
					
						
						
							
							Merge pull request  #6135  from walterhenry/develop-proof  
						
						
						
					 
					
						2020-09-29 20:49:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							b486389eec 
							
						 
					 
					
						
						
							
							Update website/docs/api/doc.md  
						
						
						
					 
					
						2020-09-29 20:48:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2be80379ec 
							
						 
					 
					
						
						
							
							Fix small issues, resolve_dot_names and debug model  
						
						
						
					 
					
						2020-09-29 20:38:35 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a4da3120b4 
							
						 
					 
					
						
						
							
							Fix multitasks  
						
						
						
					 
					
						2020-09-29 18:33:16 +02:00