Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d093d6343b 
							
						 
					 
					
						
						
							
							TrainablePipe ( #6213 )  
						
						... 
						
						
						
						* rename Pipe to TrainablePipe
* split functionality between Pipe and TrainablePipe
* remove unnecessary methods from certain components
* cleanup
* hasattr(component, "pipe") should be sufficient again
* remove serialization and vocab/cfg from Pipe
* unify _ensure_examples and validate_examples
* small fixes
* hasattr checks for self.cfg and self.vocab
* make is_resizable and is_trainable properties
* serialize strings.json instead of vocab
* fix KB IO + tests
* fix typos
* more typos
* _added_strings as a set
* few more tests specifically for _added_strings field
* bump to 3.0.0a36 
						
					 
					
						2020-10-08 21:33:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							064575d79d 
							
						 
					 
					
						
						
							
							Merge pull request  #6216  from svlandeg/feature/nel-initialize  
						
						
						
					 
					
						2020-10-08 11:14:12 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							43e59bb22a 
							
						 
					 
					
						
						
							
							Update docs and install extras [ci skip]  
						
						
						
					 
					
						2020-10-08 10:58:50 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							eaf5c265cb 
							
						 
					 
					
						
						
							
							set_kb method for entity_linker  
						
						
						
					 
					
						2020-10-08 10:34:01 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2fd7122074 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-06 10:31:48 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							568e12215d 
							
						 
					 
					
						
						
							
							Merge pull request  #6206  from svlandeg/fix/patterns-init  
						
						
						
					 
					
						2020-10-06 10:27:23 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							9b4cf7b0b6 
							
						 
					 
					
						
						
							
							update output of debug config command  
						
						
						
					 
					
						2020-10-06 09:47:23 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fd0f60e2bc 
							
						 
					 
					
						
						
							
							updates to data format for training and pretraining  
						
						
						
					 
					
						2020-10-06 09:28:53 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ff9ac39c88 
							
						 
					 
					
						
						
							
							read entity_ruler patterns with srsly.read_jsonl.v1  
						
						
						
					 
					
						2020-10-05 22:50:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							1a554bdcb1 
							
						 
					 
					
						
						
							
							Update docs and docstring [ci skip]  
						
						
						
					 
					
						2020-10-05 21:55:27 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							919790cb47 
							
						 
					 
					
						
						
							
							Upd MultiHashEmbed docs  
						
						
						
					 
					
						2020-10-05 20:28:21 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							193e0d5a98 
							
						 
					 
					
						
						
							
							add docs for entity_ruler.initialize  
						
						
						
					 
					
						2020-10-05 18:04:08 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							65abd77779 
							
						 
					 
					
						
						
							
							add finish_update to Pipe  
						
						
						
					 
					
						2020-10-05 16:23:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							0f64556c04 
							
						 
					 
					
						
						
							
							Merge pull request  #6197  from svlandeg/feature/pipe-docs [ci skip]  
						
						
						
					 
					
						2020-10-05 11:55:40 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							52b660e9dc 
							
						 
					 
					
						
						
							
							initialize and update explanation  
						
						
						
					 
					
						2020-10-05 00:39:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3c36a57e84 
							
						 
					 
					
						
						
							
							Update data augmenters ( #6196 )  
						
						... 
						
						
						
						* Draft lower-case augmenter
* Make warning a debug log
* Update lowercase augmenter, docs and tests
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-10-04 17:46:29 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							11347f34da 
							
						 
					 
					
						
						
							
							Tidy up, tests and docs  
						
						
						
					 
					
						2020-10-04 13:54:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							989c59918c 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-03 18:53:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							7c4ab7e82c 
							
						 
					 
					
						
						
							
							Fix Lemmatizer.get_lookups_config  
						
						
						
					 
					
						2020-10-03 17:16:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							dd542ec6a4 
							
						 
					 
					
						
						
							
							Fix label initialization of textcat component ( #6190 )  
						
						
						
					 
					
						2020-10-03 17:07:38 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							35d695a031 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-10-03 16:08:24 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							02247cccaf 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/small-fixes  
						
						
						
					 
					
						2020-10-02 20:48:11 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							09dcb75076 
							
						 
					 
					
						
						
							
							small UX fix for DocBin ( #6167 )  
						
						... 
						
						
						
						* add informative warning when messing up store_user_data DocBin flags
* add informative warning when messing up store_user_data DocBin flags
* cleanup test
* rename to patterns_path 
						
					 
					
						2020-10-02 15:43:32 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f0b30aedad 
							
						 
					 
					
						
						
							
							Make lemmatizers use initialize logic ( #6182 )  
						
						... 
						
						
						
						* Make lemmatizer use initialize logic and tidy up
* Fix typo
* Raise for uninitialized tables 
						
					 
					
						2020-10-02 15:42:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							df06f7a792 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-02 13:24:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d2aa662ab2 
							
						 
					 
					
						
						
							
							Merge pull request  #6179  from adrianeboyd/feature/token-morph-refactor-2 [ci skip]  
						
						
						
					 
					
						2020-10-02 12:10:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							32cdc1c4f4 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-02 11:38:03 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							fd09e6b140 
							
						 
					 
					
						
						
							
							Update docs for Token.morph / Token.set_morph  
						
						
						
					 
					
						2020-10-02 09:05:15 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							01c1538c72 
							
						 
					 
					
						
						
							
							Integrate file readers  
						
						
						
					 
					
						2020-10-02 01:36:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6b94cee468 
							
						 
					 
					
						
						
							
							Fix docs [ci skip]  
						
						
						
					 
					
						2020-10-02 01:11:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f2627157c8 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-01 17:38:17 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							1328c9fd14 
							
						 
					 
					
						
						
							
							consistently use --code instead of --code-path  
						
						
						
					 
					
						2020-10-01 16:59:22 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a22215f427 
							
						 
					 
					
						
						
							
							Add FeatureExtractor from Thinc ( #6170 )  
						
						... 
						
						
						
						* move featureextractor from Thinc
* Update website/docs/api/architectures.md
Co-authored-by: Ines Montani <ines@ines.io>
* Update website/docs/api/architectures.md
Co-authored-by: Ines Montani <ines@ines.io>
Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-10-01 16:22:48 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0a8a124a6e 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-01 12:15:53 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a103ab5f1a 
							
						 
					 
					
						
						
							
							Update augmenter lookups and docs  
						
						
						
					 
					
						2020-09-30 23:03:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							115481aca7 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-30 15:16:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9bb958fd0a 
							
						 
					 
					
						
						
							
							Fix debug data [ci skip]  
						
						
						
					 
					
						2020-09-29 23:07:11 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							604be54a5c 
							
						 
					 
					
						
						
							
							Support --code in evaluate CLI [ci skip]  
						
						
						
					 
					
						2020-09-29 21:20:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d3c63b7965 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/prepare  
						
						
						
					 
					
						2020-09-29 20:53:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							361f91e286 
							
						 
					 
					
						
						
							
							Merge pull request  #6135  from walterhenry/develop-proof  
						
						
						
					 
					
						2020-09-29 20:49:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							b486389eec 
							
						 
					 
					
						
						
							
							Update website/docs/api/doc.md  
						
						
						
					 
					
						2020-09-29 20:48:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d7469283c5 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-29 16:59:21 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							c1c841940c 
							
						 
					 
					
						
						
							
							Merge branch 'develop-proof' of  https://github.com/walterhenry/spaCy  into develop-proof  
						
						
						
					 
					
						2020-09-29 11:47:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ff9a63bfbd 
							
						 
					 
					
						
						
							
							begin_training -> initialize  
						
						
						
					 
					
						2020-09-28 21:35:09 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							3360825e00 
							
						 
					 
					
						
						
							
							Proofreading  
						
						... 
						
						
						
						Another round of proofreading. All the API docs have been read through and I've grazed the Usage docs. 
						
					 
					
						2020-09-28 16:50:15 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a976da168c 
							
						 
					 
					
						
						
							
							Support data augmentation in Corpus ( #6155 )  
						
						... 
						
						
						
						* Support data augmentation in Corpus
* Note initial docs for data augmentation
* Add augmenter to quickstart
* Fix flake8
* Format
* Fix test
* Update spacy/tests/training/test_training.py
* Improve data augmentation arguments
* Update templates
* Move randomization out into caller
* Refactor
* Update spacy/training/augment.py
* Update spacy/tests/training/test_training.py
* Fix augment
* Fix test 
						
					 
					
						2020-09-28 03:03:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f29d5b9b89 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-27 18:39:38 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							009ba14aaf 
							
						 
					 
					
						
						
							
							Fix pretraining in train script ( #6143 )  
						
						... 
						
						
						
						* update pretraining API in train CLI
* bump thinc to 8.0.0a35
* bump to 3.0.0a26
* doc fixes
* small doc fix 
						
					 
					
						2020-09-25 15:47:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2aa4d65734 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-24 20:41:09 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3c062b3911 
							
						 
					 
					
						
						
							
							Add MORPH handling to Matcher ( #6107 )  
						
						... 
						
						
						
						* Add MORPH handling to Matcher
* Add `MORPH` to `Matcher` schema
* Rename `_SetMemberPredicate` to `_SetPredicate`
* Add `ISSUBSET` and `ISSUPERSET` operators to `_SetPredicate`
  * Add special handling for normalization and conversion of morph
    values into sets
  * For other attrs, `ISSUBSET` acts like `IN` and `ISSUPERSET` only
    matches for 0 or 1 values
* Update test
* Rename to IS_SUBSET and IS_SUPERSET 
						
					 
					
						2020-09-24 16:55:09 +02:00