Andrew Ongko 
							
						 
					 
					
						
						
						
						
							
						
						
							81564cc4e8 
							
						 
					 
					
						
						
							
							Update Indonesian model ( #2752 )  
						
						... 
						
						
						
						* adding e-KTP in tokenizer exceptions list
* add exception token
* removing lines with containing space as it won't matter since we use .split() method in the end, added new tokens in exception
* add tokenizer exceptions list
* combining base_norms with norm_exceptions
* adding norm_exception
* fix double key in lemmatizer
* remove unused import on punctuation.py
* reformat stop_words to reduce number of lines, improve readibility
* updating tokenizer exception
* implement is_currency for lang/id
* adding orth_first_upper in tokenizer_exceptions
* update the norm_exception list
* remove bunch of abbreviations
* adding contributors file 
						
					 
					
						2018-09-14 12:30:32 +02:00 
						 
				 
			
				
					
						
							
							
								Filipe Caixeta 
							
						 
					 
					
						
						
						
						
							
						
						
							fe515085f3 
							
						 
					 
					
						
						
							
							Add words to portuguese language _num_words ( #2759 )  
						
						... 
						
						
						
						* Add words to portuguese language _num_words
* Add words to portuguese language _num_words 
						
					 
					
						2018-09-14 12:30:16 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f32b52e611 
							
						 
					 
					
						
						
							
							Fix bug that caused deprojectivisation to run multiple times  
						
						
						
					 
					
						2018-09-14 12:12:54 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							8f2a6367e9 
							
						 
					 
					
						
						
							
							Fix usage of PyTorch BiLSTM in ud_train  
						
						
						
					 
					
						2018-09-13 22:54:59 +00:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							afeddfff26 
							
						 
					 
					
						
						
							
							Fix PyTorch BiLSTM  
						
						
						
					 
					
						2018-09-13 22:54:34 +00:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a26fe8e7bb 
							
						 
					 
					
						
						
							
							Small hack in Language.update to make torch work  
						
						
						
					 
					
						2018-09-13 22:51:52 +00:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							445b81ce3f 
							
						 
					 
					
						
						
							
							Support bilstm_depth argument in ud-train  
						
						
						
					 
					
						2018-09-13 19:30:22 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b43643a953 
							
						 
					 
					
						
						
							
							Support bilstm_depth option in parser  
						
						
						
					 
					
						2018-09-13 19:29:49 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							45032fe9e1 
							
						 
					 
					
						
						
							
							Support option of BiLSTM in Tok2Vec (requires pytorch)  
						
						
						
					 
					
						2018-09-13 19:28:35 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3eb9f3e2b8 
							
						 
					 
					
						
						
							
							Fix defaults for ud-train  
						
						
						
					 
					
						2018-09-13 18:05:48 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							59cf533879 
							
						 
					 
					
						
						
							
							Improve ud-train script. Make config optional  
						
						
						
					 
					
						2018-09-13 14:24:08 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3e3a309764 
							
						 
					 
					
						
						
							
							Fix tagger  
						
						
						
					 
					
						2018-09-13 14:14:38 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							da7650e84b 
							
						 
					 
					
						
						
							
							Fix maximum doc length in ud_train script  
						
						
						
					 
					
						2018-09-13 14:10:25 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							a95eea4c06 
							
						 
					 
					
						
						
							
							Fix multi-task objective for parser  
						
						
						
					 
					
						2018-09-13 14:08:55 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							21321cd6cf 
							
						 
					 
					
						
						
							
							Add tok2vec property to parser model  
						
						
						
					 
					
						2018-09-13 14:08:43 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							d6aa60139d 
							
						 
					 
					
						
						
							
							Fix tagger training on GPU  
						
						
						
					 
					
						2018-09-13 14:05:37 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							5001d31be6 
							
						 
					 
					
						
						
							
							Don't set stop word in example ( closes   #2657 ) [ci skip]  
						
						
						
					 
					
						2018-09-12 15:36:51 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4e89cfaae1 
							
						 
					 
					
						
						
							
							Fix dependency scheme docs ( closes   #2705 ) [ci skip]  
						
						
						
					 
					
						2018-09-12 15:32:26 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0729d1edca 
							
						 
					 
					
						
						
							
							Fix formatting  
						
						
						
					 
					
						2018-09-12 15:32:08 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							907df53904 
							
						 
					 
					
						
						
							
							Add multi-threading note to Language.pipe ( resolves   #2582 ) [ci skip]  
						
						
						
					 
					
						2018-09-12 15:03:30 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							885691a7ab 
							
						 
					 
					
						
						
							
							Describe converters more explicitly (see  #2643 )  
						
						
						
					 
					
						2018-09-12 14:53:03 +02:00 
						 
				 
			
				
					
						
							
							
								Grivaz 
							
						 
					 
					
						
						
						
						
							
						
						
							aeba99ab0d 
							
						 
					 
					
						
						
							
							Introduces a bulk merge function, in order to solve issue  #653  ( #2696 )  
						
						... 
						
						
						
						* Fix comment
* Introduce bulk merge to increase performance on many span merges
* Sign contributor agreement
* Implement pull request suggestions 
						
					 
					
						2018-09-10 16:41:42 +02:00 
						 
				 
			
				
					
						
							
							
								tyburam 
							
						 
					 
					
						
						
						
						
							
						
						
							476472d181 
							
						 
					 
					
						
						
							
							Lex _attrs for polish language ( #2750 )  
						
						... 
						
						
						
						* Signed spaCy contributor agreement
* Added polish version of english lex_attrs 
						
					 
					
						2018-09-10 11:53:57 +02:00 
						 
				 
			
				
					
						
							
							
								Sainath Adapa 
							
						 
					 
					
						
						
						
						
							
						
						
							77139bc03c 
							
						 
					 
					
						
						
							
							Basic support for Telugu language ( #2751 )  
						
						
						
					 
					
						2018-09-10 11:53:18 +02:00 
						 
				 
			
				
					
						
							
							
								Maxim Kupfer 
							
						 
					 
					
						
						
						
						
							
						
						
							97e2874225 
							
						 
					 
					
						
						
							
							added contributor agreement for mbkupfer ( #2738 )  
						
						
						
					 
					
						2018-09-10 11:32:03 +02:00 
						 
				 
			
				
					
						
							
							
								Maxim Kupfer 
							
						 
					 
					
						
						
						
						
							
						
						
							cebe50b5b8 
							
						 
					 
					
						
						
							
							Remove ')' for clarity ( #2737 )  
						
						... 
						
						
						
						Sorry, don't mean to be nitpicky, I just noticed this when going through the CLI and thought it was a quick fix. That said, if this was intention than please let me know. 
						
					 
					
						2018-09-10 11:31:49 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							b2cb1fc67d 
							
						 
					 
					
						
						
							
							Merge matcher tests  
						
						
						
					 
					
						2018-09-06 01:39:53 +02:00 
						 
				 
			
				
					
						
							
							
								Suraj Krishnan Rajan 
							
						 
					 
					
						
						
						
						
							
						
						
							356af7b0a1 
							
						 
					 
					
						
						
							
							Fix tests  
						
						
						
					 
					
						2018-09-06 01:39:36 +02:00 
						 
				 
			
				
					
						
							
							
								Piotr Żelasko 
							
						 
					 
					
						
						
						
						
							
						
						
							bdb2165bd1 
							
						 
					 
					
						
						
							
							Less norm computations in token similarity ( #2730 )  
						
						... 
						
						
						
						* Less norm computations in token similarity
* Contributor agreement 
						
					 
					
						2018-09-05 21:50:23 +02:00 
						 
				 
			
				
					
						
							
							
								Aniruddha Adhikary 
							
						 
					 
					
						
						
						
						
							
						
						
							4530ddcc51 
							
						 
					 
					
						
						
							
							update bengali token rules for hyphen and digits ( #2731 )  
						
						
						
					 
					
						2018-09-05 21:49:00 +02:00 
						 
				 
			
				
					
						
							
							
								Nathaniel J. Smith 
							
						 
					 
					
						
						
						
						
							
						
						
							26849874ad 
							
						 
					 
					
						
						
							
							When calling getoption() in conftest.py, pass a default option ( #2709 )  
						
						... 
						
						
						
						* When calling getoption() in conftest.py, pass a default option
This is necessary to allow testing an installed spacy by running:
  pytest --pyargs spacy
* Add contributor agreement 
						
					 
					
						2018-09-03 09:57:52 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							4d2d7d5866 
							
						 
					 
					
						
						
							
							Fix new feature flags  
						
						
						
					 
					
						2018-08-27 02:12:39 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							598dbf1ce0 
							
						 
					 
					
						
						
							
							Fix character-based tokenization for Japanese  
						
						
						
					 
					
						2018-08-27 01:51:38 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							3763e20afc 
							
						 
					 
					
						
						
							
							Pass subword_features and conv_depth params  
						
						
						
					 
					
						2018-08-27 01:51:15 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							8051136d70 
							
						 
					 
					
						
						
							
							Support subword_features and conv_depth params in Tok2Vec  
						
						
						
					 
					
						2018-08-27 01:50:48 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							9c33d4d1df 
							
						 
					 
					
						
						
							
							Add more hyper-parameters to spacy ud-train  
						
						... 
						
						
						
						* subword_features: Controls whether subword features are used in the
word embeddings. True by default (specifically, prefix, suffix and word
shape). Should be set to False for languages like Chinese and Japanese.
* conv_depth: Depth of the convolutional layers. Defaults to 4. 
						
					 
					
						2018-08-27 01:48:46 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							e9022f7b33 
							
						 
					 
					
						
						
							
							Remove docstrings for deprecated arguments (see  #2703 )  
						
						
						
					 
					
						2018-08-26 14:23:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							559f4139e3 
							
						 
					 
					
						
						
							
							Add FAC to spacy.explain ( resolves   #2706 )  
						
						
						
					 
					
						2018-08-26 14:13:50 +02:00 
						 
				 
			
				
					
						
							
							
								Steve Sharp 
							
						 
					 
					
						
						
						
						
							
						
						
							ca747f58a4 
							
						 
					 
					
						
						
							
							Update _install.jade ( #2688 )  
						
						... 
						
						
						
						Typo fix: "models" -> "model" 
						
					 
					
						2018-08-22 13:16:04 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							51a9efbf3b 
							
						 
					 
					
						
						
							
							Add draft Binder class  
						
						
						
					 
					
						2018-08-22 13:12:51 +02:00 
						 
				 
			
				
					
						
							
							
								Arya Prabhudesai 
							
						 
					 
					
						
						
						
						
							
						
						
							db2c2b286c 
							
						 
					 
					
						
						
							
							Create aryaprabhudesai.md ( #2681 )  
						
						
						
					 
					
						2018-08-20 18:56:14 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f0e6be689a 
							
						 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/explosion/spaCy  into develop  
						
						
						
					 
					
						2018-08-16 17:18:19 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							5ce459d2ee 
							
						 
					 
					
						
						
							
							Fix error in vocab  
						
						
						
					 
					
						2018-08-16 17:18:09 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							aeb49eb625 
							
						 
					 
					
						
						
							
							Update version [ci skip]  
						
						
						
					 
					
						2018-08-16 16:56:02 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a0eacd3293 
							
						 
					 
					
						
						
							
							Merge branch 'master' into develop  
						
						
						
					 
					
						2018-08-16 16:55:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c0fa9903f4 
							
						 
					 
					
						
						
							
							Update model directory JS [ci skip]  
						
						... 
						
						
						
						Prevent the default release URL from being overwritten and add license type 
						
					 
					
						2018-08-16 16:54:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							03f661fefb 
							
						 
					 
					
						
						
							
							Add Greek to models directory [ci skip]  
						
						
						
					 
					
						2018-08-16 16:51:56 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							00febda2e3 
							
						 
					 
					
						
						
							
							Improve alignment around quotes  
						
						
						
					 
					
						2018-08-16 01:04:34 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							66a3f2ba21 
							
						 
					 
					
						
						
							
							Lower-case text before alignment  
						
						
						
					 
					
						2018-08-16 00:42:36 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							595c893791 
							
						 
					 
					
						
						
							
							Expose noise_level option in train CLI  
						
						
						
					 
					
						2018-08-16 00:41:44 +02:00