Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							75d9019343 
							
						 
					 
					
						
						
							
							Fix types of Tok2Vec encoding architectures ( #6442 )  
						
						... 
						
						
						
						* fix TorchBiLSTMEncoder documentation
* ensure the types of the encoding Tok2vec layers are correct
* update references from v1 to v2 for the new architectures 
						
					 
					
						2021-01-07 16:39:27 +11:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							82ae95267a 
							
						 
					 
					
						
						
							
							Docs for pretrain architectures ( #6605 )  
						
						... 
						
						
						
						* document pretraining architectures
* formatting
* bit more info
* small fixes 
						
					 
					
						2021-01-06 16:12:30 +11:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							afc5714d32 
							
						 
					 
					
						
						
							
							multi-label textcat component ( #6474 )  
						
						... 
						
						
						
						* multi-label textcat component
* formatting
* fix comment
* cleanup
* fix from #6481 
* random edit to push the tests
* add explicit error when textcat is called with multi-label gold data
* fix error nr
* small fix 
						
					 
					
						2021-01-06 13:07:14 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							6f83abb971 
							
						 
					 
					
						
						
							
							Merge pull request  #6647  from svlandeg/feature/init_config_overwrite  
						
						
						
					 
					
						2021-01-05 14:59:04 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3614472e29 
							
						 
					 
					
						
						
							
							Merge pull request  #6646  from svlandeg/feature/cli-docs [ci skip]  
						
						
						
					 
					
						2021-01-05 13:52:49 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9c078a5885 
							
						 
					 
					
						
						
							
							Update formatting for consistency [ci skip]  
						
						
						
					 
					
						2021-01-05 13:52:28 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a9e845426f 
							
						 
					 
					
						
						
							
							Use --force for consistency and add docs  
						
						
						
					 
					
						2021-01-05 13:49:59 +11:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							d5ff0fecf8 
							
						 
					 
					
						
						
							
							add docs  
						
						
						
					 
					
						2020-12-30 14:01:13 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							2fa23b0304 
							
						 
					 
					
						
						
							
							fix capitalization for link  
						
						
						
					 
					
						2020-12-29 15:01:22 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							43cc6aea93 
							
						 
					 
					
						
						
							
							remove non-existing link  
						
						
						
					 
					
						2020-12-29 14:59:39 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							543073bf9d 
							
						 
					 
					
						
						
							
							add pretrain example  
						
						
						
					 
					
						2020-12-29 14:51:23 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							1d0ef98873 
							
						 
					 
					
						
						
							
							move example  
						
						
						
					 
					
						2020-12-29 14:46:03 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							20113b8063 
							
						 
					 
					
						
						
							
							add train CLI example  
						
						
						
					 
					
						2020-12-29 14:44:56 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							87562e470d 
							
						 
					 
					
						
						
							
							fix backticks in docs ( #6635 )  
						
						
						
					 
					
						2020-12-27 22:12:37 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8df5b7f513 
							
						 
					 
					
						
						
							
							fix documentation of 'path' in tokenizer.to_disk ( #6634 )  
						
						
						
					 
					
						2020-12-27 22:01:06 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							282a3b49ea 
							
						 
					 
					
						
						
							
							Fix  parser resizing when there is no upper layer ( #6460 )  
						
						... 
						
						
						
						* allow resizing of the parser model even when upper=False
* update from spacy.TransitionBasedParser.v1 to v2
* bugfix 
						
					 
					
						2020-12-18 18:56:57 +08:00 
						 
				 
			
				
					
						
							
							
								Gareth Sparks 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							efc229c3f4 
							
						 
					 
					
						
						
							
							Doc.char_span arg: alignment_mode ( #6591 )  
						
						... 
						
						
						
						Currently labeled "mode", actually "alignment_mode" 
						
					 
					
						2020-12-18 09:54:56 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							513c4e332a 
							
						 
					 
					
						
						
							
							Include custom code via spacy package command ( #6531 )  
						
						
						
					 
					
						2020-12-10 20:36:46 +08:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2a6043fabb 
							
						 
					 
					
						
						
							
							Merge pull request  #6530  from explosion/feature/init-config-cpu-gpu  
						
						
						
					 
					
						2020-12-10 09:38:46 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9d32e839d3 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/init-config-cpu-gpu  
						
						
						
					 
					
						2020-12-10 08:50:53 +11:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							972820e2b3 
							
						 
					 
					
						
						
							
							Add batch_size to data formats docs  
						
						
						
					 
					
						2020-12-09 12:44:04 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							80ac8af1bf 
							
						 
					 
					
						
						
							
							Format  
						
						
						
					 
					
						2020-12-09 12:44:01 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							795b5bd049 
							
						 
					 
					
						
						
							
							Update website/docs/api/language.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-12-09 12:23:32 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							fa8fa474a3 
							
						 
					 
					
						
						
							
							Add nlp.batch_size setting  
						
						... 
						
						
						
						Add a default `batch_size` setting for `Language.pipe` and
`Language.evaluate` as `nlp.batch_size`. 
						
					 
					
						2020-12-09 09:13:26 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							34449b66fd 
							
						 
					 
					
						
						
							
							Update matcher.md  
						
						
						
					 
					
						2020-12-09 11:09:45 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							758ad6c3cd 
							
						 
					 
					
						
						
							
							Make CPU the default for init config  
						
						
						
					 
					
						2020-12-09 11:00:51 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							94a5a9814f 
							
						 
					 
					
						
						
							
							Update argument handling and documentation  
						
						
						
					 
					
						2020-12-08 20:41:18 +11:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							5ceac425ee 
							
						 
					 
					
						
						
							
							Remove non-working --use-chars from train CLI  
						
						... 
						
						
						
						Remove the non-working `--use-chars` option from the train CLI. The
implementation of the option across component types and the CLI settings
could be fixed, but the `CharacterEmbed` model does not work on GPU in
v2 so it's better to remove it. 
						
					 
					
						2020-12-08 08:30:00 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2c27093c5f 
							
						 
					 
					
						
						
							
							require_cpu functionality ( #6336 )  
						
						... 
						
						
						
						* add require_cpu from Thinc 8.0.0rc2
* add docs
* fix test if cupy is not installed 
						
					 
					
						2020-12-08 14:42:40 +08:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							ee2ec52f48 
							
						 
					 
					
						
						
							
							Merge pull request  #6409  from svlandeg/feature/trf-docs  
						
						
						
					 
					
						2020-12-08 06:32:10 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							82e88f0e3b 
							
						 
					 
					
						
						
							
							Merge pull request  #6379  from svlandeg/fix/labels-constructor  
						
						
						
					 
					
						2020-12-08 06:29:56 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							636be3c791 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/trf-docs  
						
						
						
					 
					
						2020-11-19 14:15:35 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							165993d8e5 
							
						 
					 
					
						
						
							
							fix typo in transformer docs ( #6404 )  
						
						
						
					 
					
						2020-11-19 14:11:38 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							73fc1ed963 
							
						 
					 
					
						
						
							
							remove labels from morphologizer constructor  
						
						
						
					 
					
						2020-11-11 21:48:50 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fcd79e0655 
							
						 
					 
					
						
						
							
							remove set_morphology from docs  
						
						
						
					 
					
						2020-11-11 21:32:34 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							789fb3d124 
							
						 
					 
					
						
						
							
							add docs for upstream argument of TransformerListener  
						
						
						
					 
					
						2020-11-09 21:42:58 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							363ac73c72 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-11-09 12:43:26 +08:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8644ee3e3f 
							
						 
					 
					
						
						
							
							Update TIGER link and tag description ( #6344 )  
						
						
						
					 
					
						2020-11-05 09:33:00 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8ef056cf98 
							
						 
					 
					
						
						
							
							fix embed_size in Entity Linker architecture ( #6343 )  
						
						
						
					 
					
						2020-11-04 22:20:13 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a4b32b9552 
							
						 
					 
					
						
						
							
							Handle missing reference values in scorer ( #6286 )  
						
						... 
						
						
						
						* Handle missing reference values in scorer
Handle missing values in reference doc during scoring where it is
possible to detect an unset state for the attribute. If no reference
docs contain annotation, `None` is returned instead of a score. `spacy
evaluate` displays `-` for missing scores and the missing scores are
saved as `None`/`null` in the metrics.
Attributes without unset states:
* `token.head`: relies on `token.dep` to recognize unset values
* `doc.cats`: unable to handle missing annotation
Additional changes:
* add optional `has_annotation` check to `score_scans` to replace
`doc.sents` hack
* update `score_token_attr_per_feat` to handle missing and empty morph
representations
* fix bug in `Doc.has_annotation` for normalization of `IS_SENT_START`
vs. `SENT_START`
* Fix import
* Update return types 
						
					 
					
						2020-11-03 15:47:18 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							75a202ce65 
							
						 
					 
					
						
						
							
							TextCat updates and fixes ( #6263 )  
						
						... 
						
						
						
						* small fix in example imports
* throw error when train_corpus or dev_corpus is not a string
* small fix in custom logger example
* limit macro_auc to labels with 2 annotations
* fix typo
* also create parents of output_dir if need be
* update documentation of textcat scores
* refactor TextCatEnsemble
* fix tests for new AUC definition
* bump to 3.0.0a42
* update docs
* rename to spacy.TextCatEnsemble.v2
* spacy.TextCatEnsemble.v1 in legacy
* cleanup
* small fix
* update to 3.0.0rc2
* fix import that got lost in merge
* cursed IDE
* fix two typos 
						
					 
					
						2020-10-18 14:50:41 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4d99d2b94a 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-13 11:38:52 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							40276fd3be 
							
						 
					 
					
						
						
							
							update NEL docs after latest refactor  
						
						
						
					 
					
						2020-10-12 11:41:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							e50dc2c1c9 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-09 12:04:52 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							329b61ee7b 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-09 10:36:06 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d093d6343b 
							
						 
					 
					
						
						
							
							TrainablePipe ( #6213 )  
						
						... 
						
						
						
						* rename Pipe to TrainablePipe
* split functionality between Pipe and TrainablePipe
* remove unnecessary methods from certain components
* cleanup
* hasattr(component, "pipe") should be sufficient again
* remove serialization and vocab/cfg from Pipe
* unify _ensure_examples and validate_examples
* small fixes
* hasattr checks for self.cfg and self.vocab
* make is_resizable and is_trainable properties
* serialize strings.json instead of vocab
* fix KB IO + tests
* fix typos
* more typos
* _added_strings as a set
* few more tests specifically for _added_strings field
* bump to 3.0.0a36 
						
					 
					
						2020-10-08 21:33:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							064575d79d 
							
						 
					 
					
						
						
							
							Merge pull request  #6216  from svlandeg/feature/nel-initialize  
						
						
						
					 
					
						2020-10-08 11:14:12 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							43e59bb22a 
							
						 
					 
					
						
						
							
							Update docs and install extras [ci skip]  
						
						
						
					 
					
						2020-10-08 10:58:50 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							eaf5c265cb 
							
						 
					 
					
						
						
							
							set_kb method for entity_linker  
						
						
						
					 
					
						2020-10-08 10:34:01 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2fd7122074 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-06 10:31:48 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							568e12215d 
							
						 
					 
					
						
						
							
							Merge pull request  #6206  from svlandeg/fix/patterns-init  
						
						
						
					 
					
						2020-10-06 10:27:23 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							9b4cf7b0b6 
							
						 
					 
					
						
						
							
							update output of debug config command  
						
						
						
					 
					
						2020-10-06 09:47:23 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fd0f60e2bc 
							
						 
					 
					
						
						
							
							updates to data format for training and pretraining  
						
						
						
					 
					
						2020-10-06 09:28:53 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ff9ac39c88 
							
						 
					 
					
						
						
							
							read entity_ruler patterns with srsly.read_jsonl.v1  
						
						
						
					 
					
						2020-10-05 22:50:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							1a554bdcb1 
							
						 
					 
					
						
						
							
							Update docs and docstring [ci skip]  
						
						
						
					 
					
						2020-10-05 21:55:27 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							919790cb47 
							
						 
					 
					
						
						
							
							Upd MultiHashEmbed docs  
						
						
						
					 
					
						2020-10-05 20:28:21 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							193e0d5a98 
							
						 
					 
					
						
						
							
							add docs for entity_ruler.initialize  
						
						
						
					 
					
						2020-10-05 18:04:08 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							65abd77779 
							
						 
					 
					
						
						
							
							add finish_update to Pipe  
						
						
						
					 
					
						2020-10-05 16:23:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							0f64556c04 
							
						 
					 
					
						
						
							
							Merge pull request  #6197  from svlandeg/feature/pipe-docs [ci skip]  
						
						
						
					 
					
						2020-10-05 11:55:40 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							52b660e9dc 
							
						 
					 
					
						
						
							
							initialize and update explanation  
						
						
						
					 
					
						2020-10-05 00:39:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3c36a57e84 
							
						 
					 
					
						
						
							
							Update data augmenters ( #6196 )  
						
						... 
						
						
						
						* Draft lower-case augmenter
* Make warning a debug log
* Update lowercase augmenter, docs and tests
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-10-04 17:46:29 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							11347f34da 
							
						 
					 
					
						
						
							
							Tidy up, tests and docs  
						
						
						
					 
					
						2020-10-04 13:54:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							989c59918c 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-03 18:53:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							7c4ab7e82c 
							
						 
					 
					
						
						
							
							Fix Lemmatizer.get_lookups_config  
						
						
						
					 
					
						2020-10-03 17:16:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							dd542ec6a4 
							
						 
					 
					
						
						
							
							Fix label initialization of textcat component ( #6190 )  
						
						
						
					 
					
						2020-10-03 17:07:38 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							35d695a031 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-10-03 16:08:24 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							02247cccaf 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/small-fixes  
						
						
						
					 
					
						2020-10-02 20:48:11 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							09dcb75076 
							
						 
					 
					
						
						
							
							small UX fix for DocBin ( #6167 )  
						
						... 
						
						
						
						* add informative warning when messing up store_user_data DocBin flags
* add informative warning when messing up store_user_data DocBin flags
* cleanup test
* rename to patterns_path 
						
					 
					
						2020-10-02 15:43:32 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f0b30aedad 
							
						 
					 
					
						
						
							
							Make lemmatizers use initialize logic ( #6182 )  
						
						... 
						
						
						
						* Make lemmatizer use initialize logic and tidy up
* Fix typo
* Raise for uninitialized tables 
						
					 
					
						2020-10-02 15:42:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							df06f7a792 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-02 13:24:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d2aa662ab2 
							
						 
					 
					
						
						
							
							Merge pull request  #6179  from adrianeboyd/feature/token-morph-refactor-2 [ci skip]  
						
						
						
					 
					
						2020-10-02 12:10:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							32cdc1c4f4 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-02 11:38:03 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							fd09e6b140 
							
						 
					 
					
						
						
							
							Update docs for Token.morph / Token.set_morph  
						
						
						
					 
					
						2020-10-02 09:05:15 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							01c1538c72 
							
						 
					 
					
						
						
							
							Integrate file readers  
						
						
						
					 
					
						2020-10-02 01:36:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6b94cee468 
							
						 
					 
					
						
						
							
							Fix docs [ci skip]  
						
						
						
					 
					
						2020-10-02 01:11:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f2627157c8 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-01 17:38:17 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							1328c9fd14 
							
						 
					 
					
						
						
							
							consistently use --code instead of --code-path  
						
						
						
					 
					
						2020-10-01 16:59:22 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a22215f427 
							
						 
					 
					
						
						
							
							Add FeatureExtractor from Thinc ( #6170 )  
						
						... 
						
						
						
						* move featureextractor from Thinc
* Update website/docs/api/architectures.md
Co-authored-by: Ines Montani <ines@ines.io>
* Update website/docs/api/architectures.md
Co-authored-by: Ines Montani <ines@ines.io>
Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-10-01 16:22:48 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0a8a124a6e 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-01 12:15:53 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a103ab5f1a 
							
						 
					 
					
						
						
							
							Update augmenter lookups and docs  
						
						
						
					 
					
						2020-09-30 23:03:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							115481aca7 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-30 15:16:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9bb958fd0a 
							
						 
					 
					
						
						
							
							Fix debug data [ci skip]  
						
						
						
					 
					
						2020-09-29 23:07:11 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							604be54a5c 
							
						 
					 
					
						
						
							
							Support --code in evaluate CLI [ci skip]  
						
						
						
					 
					
						2020-09-29 21:20:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d3c63b7965 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/prepare  
						
						
						
					 
					
						2020-09-29 20:53:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							361f91e286 
							
						 
					 
					
						
						
							
							Merge pull request  #6135  from walterhenry/develop-proof  
						
						
						
					 
					
						2020-09-29 20:49:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							b486389eec 
							
						 
					 
					
						
						
							
							Update website/docs/api/doc.md  
						
						
						
					 
					
						2020-09-29 20:48:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d7469283c5 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-29 16:59:21 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							c1c841940c 
							
						 
					 
					
						
						
							
							Merge branch 'develop-proof' of  https://github.com/walterhenry/spaCy  into develop-proof  
						
						
						
					 
					
						2020-09-29 11:47:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ff9a63bfbd 
							
						 
					 
					
						
						
							
							begin_training -> initialize  
						
						
						
					 
					
						2020-09-28 21:35:09 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							3360825e00 
							
						 
					 
					
						
						
							
							Proofreading  
						
						... 
						
						
						
						Another round of proofreading. All the API docs have been read through and I've grazed the Usage docs. 
						
					 
					
						2020-09-28 16:50:15 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a976da168c 
							
						 
					 
					
						
						
							
							Support data augmentation in Corpus ( #6155 )  
						
						... 
						
						
						
						* Support data augmentation in Corpus
* Note initial docs for data augmentation
* Add augmenter to quickstart
* Fix flake8
* Format
* Fix test
* Update spacy/tests/training/test_training.py
* Improve data augmentation arguments
* Update templates
* Move randomization out into caller
* Refactor
* Update spacy/training/augment.py
* Update spacy/tests/training/test_training.py
* Fix augment
* Fix test 
						
					 
					
						2020-09-28 03:03:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f29d5b9b89 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-27 18:39:38 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							009ba14aaf 
							
						 
					 
					
						
						
							
							Fix pretraining in train script ( #6143 )  
						
						... 
						
						
						
						* update pretraining API in train CLI
* bump thinc to 8.0.0a35
* bump to 3.0.0a26
* doc fixes
* small doc fix 
						
					 
					
						2020-09-25 15:47:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2aa4d65734 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-24 20:41:09 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3c062b3911 
							
						 
					 
					
						
						
							
							Add MORPH handling to Matcher ( #6107 )  
						
						... 
						
						
						
						* Add MORPH handling to Matcher
* Add `MORPH` to `Matcher` schema
* Rename `_SetMemberPredicate` to `_SetPredicate`
* Add `ISSUBSET` and `ISSUPERSET` operators to `_SetPredicate`
  * Add special handling for normalization and conversion of morph
    values into sets
  * For other attrs, `ISSUBSET` acts like `IN` and `ISSUPERSET` only
    matches for 0 or 1 values
* Update test
* Rename to IS_SUBSET and IS_SUPERSET 
						
					 
					
						2020-09-24 16:55:09 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c7eedd3534 
							
						 
					 
					
						
						
							
							updates to NEL functionality ( #6132 )  
						
						... 
						
						
						
						* NEL: read sentences and ents from reference
* fiddling with sent_start annotations
* add KB serialization test
* KB write additional file with strings.json
* score_links function to calculate NEL P/R/F
* formatting
* documentation 
						
					 
					
						2020-09-24 16:53:59 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							58dde293ce 
							
						 
					 
					
						
						
							
							Merge pull request  #6089  from adrianeboyd/feature/doc-ents-v3-2  
						
						
						
					 
					
						2020-09-24 14:44:42 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							74e1f192b4 
							
						 
					 
					
						
						
							
							Merge pull request  #6134  from explosion/feature/training_before_to_disk  
						
						
						
					 
					
						2020-09-24 14:44:11 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3b58a8be2b 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-09-24 14:32:42 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							88e54caa12 
							
						 
					 
					
						
						
							
							accuracy -> performance  
						
						
						
					 
					
						2020-09-24 14:32:35 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b92c8aae78 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into pr/6135  
						
						
						
					 
					
						2020-09-24 13:44:56 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							3dd5f409ec 
							
						 
					 
					
						
						
							
							Proofreading  
						
						... 
						
						
						
						Proofread some API docs 
						
					 
					
						2020-09-24 13:15:28 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							1c63f02f99 
							
						 
					 
					
						
						
							
							Add API docs  
						
						
						
					 
					
						2020-09-24 12:51:16 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							138c8d45db 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-09-24 12:43:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ae51f580c1 
							
						 
					 
					
						
						
							
							Fix handling of score_weights  
						
						
						
					 
					
						2020-09-24 10:27:33 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							dd2292793f 
							
						 
					 
					
						
						
							
							'parser' instead of 'deps' for state_type  
						
						
						
					 
					
						2020-09-23 16:53:49 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							6c85fab316 
							
						 
					 
					
						
						
							
							state_type and extra_state_tokens instead of nr_feature_tokens  
						
						
						
					 
					
						2020-09-23 13:35:09 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6ca06cb62c 
							
						 
					 
					
						
						
							
							Update docs and formatting [ci skip]  
						
						
						
					 
					
						2020-09-23 10:14:27 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							b556a10808 
							
						 
					 
					
						
						
							
							rename converts in_to_out  
						
						
						
					 
					
						2020-09-22 11:50:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f9af7d365c 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-22 09:45:41 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							49e80dbcac 
							
						 
					 
					
						
						
							
							Merge pull request  #6103  from explosion/chore/tidy-up-tests-docs-get-doc  
						
						
						
					 
					
						2020-09-22 09:45:04 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							5fbb8dfcbc 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into docs/various-v3-2  
						
						
						
					 
					
						2020-09-22 09:22:58 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							67fbcb3da5 
							
						 
					 
					
						
						
							
							Tidy up tests and docs  
						
						
						
					 
					
						2020-09-21 20:43:54 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							f212303729 
							
						 
					 
					
						
						
							
							Add sent_starts to Doc.__init__  
						
						... 
						
						
						
						Add sent_starts to `Doc.__init__`. Officially specify `is_sent_start`
values but also convert to and accept `sent_start` internally. 
						
					 
					
						2020-09-21 17:59:09 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							6aa91c7ca0 
							
						 
					 
					
						
						
							
							Make user_data keyword-only  
						
						
						
					 
					
						2020-09-21 16:00:06 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							bc02e86494 
							
						 
					 
					
						
						
							
							Extend Doc.__init__ with additional annotation  
						
						... 
						
						
						
						Mostly copying from `spacy.tests.util.get_doc`, add additional kwargs to
`Doc.__init__` to initialize the most common doc/token values. 
						
					 
					
						2020-09-21 13:36:24 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							3aa57ce6c9 
							
						 
					 
					
						
						
							
							Update alignment mode in Doc.char_span docs  
						
						
						
					 
					
						2020-09-21 09:07:20 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							012b3a7096 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-20 17:44:58 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							554c9a2497 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-20 12:30:53 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							39872de1f6 
							
						 
					 
					
						
						
							
							Introducing the gpu_allocator ( #6091 )  
						
						... 
						
						
						
						* rename 'use_pytorch_for_gpu_memory' to 'gpu_allocator'
* --code instead of --code-path
* update documentation
* avoid querying the "system" section directly
* add explanation of gpu_allocator to TF/PyTorch section in docs
* fix typo
* fix typo 2
* use set_gpu_allocator from thinc 8.0.0a34
* default null instead of empty string 
						
					 
					
						2020-09-19 01:17:02 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0406200a1e 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-18 15:13:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a127fa475e 
							
						 
					 
					
						
						
							
							Merge pull request  #6078  from svlandeg/fix/corpus  
						
						
						
					 
					
						2020-09-18 14:44:21 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d32ce121be 
							
						 
					 
					
						
						
							
							Fix docs [ci skip]  
						
						
						
					 
					
						2020-09-18 13:41:12 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							1bb8b4f824 
							
						 
					 
					
						
						
							
							Merge branch 'master' into develop  
						
						
						
					 
					
						2020-09-17 17:46:20 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2e3ce9f42f 
							
						 
					 
					
						
						
							
							Merge branch 'feature/init-config-pretrain' of  https://github.com/svlandeg/spaCy  into pr/6084  
						
						
						
					 
					
						2020-09-17 16:58:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3d8e010655 
							
						 
					 
					
						
						
							
							Change order  
						
						
						
					 
					
						2020-09-17 16:58:46 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c4b414b282 
							
						 
					 
					
						
						
							
							Update website/docs/api/cli.md  
						
						
						
					 
					
						2020-09-17 16:58:09 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							e5ceec5df0 
							
						 
					 
					
						
						
							
							Update website/docs/api/cli.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-09-17 16:56:20 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							127ce0c574 
							
						 
					 
					
						
						
							
							Update website/docs/api/cli.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-09-17 16:55:53 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							5fade4feb7 
							
						 
					 
					
						
						
							
							fix cli abbrev  
						
						
						
					 
					
						2020-09-17 16:15:20 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ddfc1fc146 
							
						 
					 
					
						
						
							
							add pretraining option to init config  
						
						
						
					 
					
						2020-09-17 16:05:40 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							130ffa5fbf 
							
						 
					 
					
						
						
							
							fix typos in docs  
						
						
						
					 
					
						2020-09-17 14:59:41 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							0c35885751 
							
						 
					 
					
						
						
							
							generalize corpora, dot notation for dev and train corpus  
						
						
						
					 
					
						2020-09-17 11:38:59 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							8cedb2f380 
							
						 
					 
					
						
						
							
							Merge branch 'fix/corpus' of  https://github.com/svlandeg/spaCy  into fix/corpus  
						
						
						
					 
					
						2020-09-17 09:27:55 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							781fae678b 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into fix/corpus  
						
						
						
					 
					
						2020-09-17 09:24:36 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							21dcf92964 
							
						 
					 
					
						
						
							
							Update website/docs/api/data-formats.md  
						
						... 
						
						
						
						Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-09-17 09:21:36 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							7e4cd7575c 
							
						 
					 
					
						
						
							
							Refactor Docs.is_ flags ( #6044 )  
						
						... 
						
						
						
						* Refactor Docs.is_ flags
* Add derived `Doc.has_annotation` method
  * `Doc.has_annotation(attr)` returns `True` for partial annotation
  * `Doc.has_annotation(attr, require_complete=True)` returns `True` for
    complete annotation
* Add deprecation warnings to `is_tagged`, `is_parsed`, `is_sentenced`
and `is_nered`
* Add `Doc._get_array_attrs()`, which returns a full list of `Doc` attrs
for use with `Doc.to_array`, `Doc.to_bytes` and `Doc.from_docs`. The
list is the `DocBin` attributes list plus `SPACY` and `LENGTH`.
Notes on `Doc.has_annotation`:
* `HEAD` is converted to `DEP` because heads don't have an unset state
* Accept `IS_SENT_START` as a synonym of `SENT_START`
Additional changes:
* Add `NORM`, `ENT_ID` and `SENT_START` to default attributes for
`DocBin`
* In `Doc.from_array()` the presence of `DEP` causes `HEAD` to override
`SENT_START`
* In `Doc.from_array()` using `attrs` other than
`Doc._get_array_attrs()` (i.e., a user's custom list rather than our
default internal list) with both `HEAD` and `SENT_START` shows a warning
that `HEAD` will override `SENT_START`
* `set_children_from_heads` does not require dependency labels to set
sentence boundaries and sets `sent_start` for all non-sentence starts to
`-1`
* Fix call to set_children_form_heads
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-09-17 00:14:01 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							55f8d5478e 
							
						 
					 
					
						
						
							
							fix example output  
						
						
						
					 
					
						2020-09-15 22:09:30 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							51fa929f47 
							
						 
					 
					
						
						
							
							rewrite train_corpus to corpus.train in config  
						
						
						
					 
					
						2020-09-15 21:58:04 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0edd695bf6 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-09-15 11:41:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							99549a5ace 
							
						 
					 
					
						
						
							
							Fix consistency and update docs  
						
						
						
					 
					
						2020-09-15 11:37:37 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							154752f9c2 
							
						 
					 
					
						
						
							
							Update docs and consistency [ci skip]  
						
						
						
					 
					
						2020-09-15 00:32:49 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3216a33149 
							
						 
					 
					
						
						
							
							positive_label config for textcat ( #6062 )  
						
						... 
						
						
						
						* hook up positive_label in textcat
* unit tests
* documentation
* formatting
* tests
* fix typo
* move verify_config to after begin_training
* revert accidential commit 
						
					 
					
						2020-09-14 17:08:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9afb1d9965 
							
						 
					 
					
						
						
							
							Merge pull request  #6063  from svlandeg/feature/doc_cleanup [ci skip]  
						
						
						
					 
					
						2020-09-14 10:35:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							47acb45850 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-13 22:30:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2e3d067a7b 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-13 19:29:06 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							744df9814a 
							
						 
					 
					
						
						
							
							define threshold for scoring textcat in TextCat config ( #6055 )  
						
						... 
						
						
						
						* define threshold for scoring textcat in TextCat config
* fix unit test and documentation 
						
					 
					
						2020-09-13 14:15:52 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							c4f324d5f1 
							
						 
					 
					
						
						
							
							doc fixes  
						
						
						
					 
					
						2020-09-12 17:38:54 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							8b0dabe987 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-12 17:05:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0b2e07215d 
							
						 
					 
					
						
						
							
							Support overwriting name on spacy package  
						
						
						
					 
					
						2020-09-11 11:38:28 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							97d99f7efa 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/doc-fixes  
						
						
						
					 
					
						2020-09-10 11:51:34 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							15bc3a37b4 
							
						 
					 
					
						
						
							
							Add --branch to project clone  
						
						
						
					 
					
						2020-09-10 11:08:15 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b7afd09d27 
							
						 
					 
					
						
						
							
							Update formatting [ci skip]  
						
						
						
					 
					
						2020-09-10 11:07:09 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							9073d99fc9 
							
						 
					 
					
						
						
							
							fix link to shape inference section  
						
						
						
					 
					
						2020-09-10 10:22:59 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							1955aaaa20 
							
						 
					 
					
						
						
							
							Merge pull request  #6045  from svlandeg/feature/more-layers-docs [ci skip]  
						
						
						
					 
					
						2020-09-09 21:46:40 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2e567a47c2 
							
						 
					 
					
						
						
							
							Update docs and formatting  
						
						
						
					 
					
						2020-09-09 21:26:10 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							c89e07927e 
							
						 
					 
					
						
						
							
							document individual component API pages  
						
						
						
					 
					
						2020-09-09 16:18:38 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							cb66ea7400 
							
						 
					 
					
						
						
							
							Remove simple_ner code ( #6041 )  
						
						... 
						
						
						
						* remove simple_ner code
* remove unused _biluo and _iob files 
						
					 
					
						2020-09-09 16:11:27 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							a8aa9a8068 
							
						 
					 
					
						
						
							
							document Pipe API details, crossreferences etc  
						
						
						
					 
					
						2020-09-09 15:56:27 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							39aa740777 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/more-layers-docs  
						
						
						
					 
					
						2020-09-09 11:59:34 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8e7557656f 
							
						 
					 
					
						
						
							
							Renaming gold & annotation_setter ( #6042 )  
						
						... 
						
						
						
						* version bump to 3.0.0a16
* rename "gold" folder to "training"
* rename 'annotation_setter' to 'set_extra_annotations'
* formatting 
						
					 
					
						2020-09-09 10:31:03 +02:00 
						 
				 
			
				
					
						
							
							
								Marek Grzenkowicz 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a26f864ed3 
							
						 
					 
					
						
						
							
							Clarify how to choose pretrained weights files ( closes   #6027 ) [ci skip] ( #6039 )  
						
						
						
					 
					
						2020-09-08 21:13:50 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							bd8f9b188b 
							
						 
					 
					
						
						
							
							small fixes  
						
						
						
					 
					
						2020-09-08 17:24:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							157caf4dfa 
							
						 
					 
					
						
						
							
							WIP: update docs [ci skip]  
						
						
						
					 
					
						2020-09-04 16:30:31 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f174c7b1f3 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into pr/6018  
						
						
						
					 
					
						2020-09-04 15:54:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							864a697e63 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into master-tmp  
						
						
						
					 
					
						2020-09-04 13:15:36 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							b927893309 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/dependency-matcher-v3  
						
						
						
					 
					
						2020-09-04 13:03:30 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4daf138136 
							
						 
					 
					
						
						
							
							Fix alphabetic ordering [ci skip]  
						
						
						
					 
					
						2020-09-03 23:01:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							23b7d9cfa3 
							
						 
					 
					
						
						
							
							Prefix span getters  
						
						
						
					 
					
						2020-09-03 17:37:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							5afe6447cd 
							
						 
					 
					
						
						
							
							registry.assets -> registry.misc  
						
						
						
					 
					
						2020-09-03 17:31:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c063e55eb7 
							
						 
					 
					
						
						
							
							Add prefix to batchers  
						
						
						
					 
					
						2020-09-03 17:30:41 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							804f120361 
							
						 
					 
					
						
						
							
							Don't use registered function version in title  
						
						
						
					 
					
						2020-09-03 17:29:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c53b1433b9 
							
						 
					 
					
						
						
							
							Adjust more arguments [ci skip]  
						
						
						
					 
					
						2020-09-03 17:12:24 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							25a595dc10 
							
						 
					 
					
						
						
							
							Fix typos and wording [ci skip]  
						
						
						
					 
					
						2020-09-03 16:37:45 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b5a0657fd6 
							
						 
					 
					
						
						
							
							"model" terminology consistency in docs  
						
						
						
					 
					
						2020-09-03 13:13:03 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							960d9cfadc 
							
						 
					 
					
						
						
							
							Officially support DependencyMatcher  
						
						... 
						
						
						
						Add official support for the `DependencyMatcher`. Redesign the pattern
specification. Fix and extend operator implementations. Update API docs
and add usage docs.
Patterns
--------
Refactor pattern structure to:
```
{
  "LEFT_ID": str,
  "REL_OP": str,
  "RIGHT_ID": str,
  "RIGHT_ATTRS": dict,
}
```
The first node contains only `RIGHT_ID` and `RIGHT_ATTRS` and all
subsequent nodes contain all four keys.
New operators
-------------
Because of the way patterns are constructed from left to right, it's
helpful to have `follows` operators along with `precedes` operators. Add
operators for simple precedes / follows alongside immediate precedes /
follows.
* `.*`: precedes
* `;`: immediately follows
* `;*`: follows
Operator fixes
--------------
* `<` and `<<` do not include the node itself
* Fix reversed order for all operators involving linear precedence (`.`,
  all sibling operators)
* Linear precedence operators do not match nodes outside the same parse
Additional fixes
----------------
* Use v3 Matcher API
* Support `get` and `remove`
* Support pickling 
						
					 
					
						2020-09-02 17:45:29 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							bbaea530f6 
							
						 
					 
					
						
						
							
							sublayers paragraph  
						
						
						
					 
					
						2020-09-02 17:36:22 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9af82f3f11 
							
						 
					 
					
						
						
							
							Merge pull request  #6003  from explosion/feature/matcher-as-spans  
						
						
						
					 
					
						2020-08-31 17:50:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3929431af1 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-31 17:06:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							add9de5487 
							
						 
					 
					
						
						
							
							Deprecate (Phrase)Matcher.pipe  
						
						
						
					 
					
						2020-08-31 17:01:24 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							2c3b64a567 
							
						 
					 
					
						
						
							
							console logging example  
						
						
						
					 
					
						2020-08-31 16:56:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							bca6bf8dda 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-31 16:39:53 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							db9f8896f5 
							
						 
					 
					
						
						
							
							Add docs [ci skip]  
						
						
						
					 
					
						2020-08-31 16:10:41 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fe6c08218e 
							
						 
					 
					
						
						
							
							fixes  
						
						
						
					 
					
						2020-08-31 14:51:49 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							0e0abb0378 
							
						 
					 
					
						
						
							
							fix  
						
						
						
					 
					
						2020-08-31 14:50:29 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							56ba691ecd 
							
						 
					 
					
						
						
							
							small fixes  
						
						
						
					 
					
						2020-08-31 14:46:00 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							e47ea88aeb 
							
						 
					 
					
						
						
							
							revert annotations refactor  
						
						
						
					 
					
						2020-08-31 14:40:55 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							2c90a06fee 
							
						 
					 
					
						
						
							
							some more information about the loggers  
						
						
						
					 
					
						2020-08-31 13:43:17 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							c18eb63483 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs  
						
						... 
						
						
						
						# Conflicts:
#	website/docs/usage/embeddings-transformers.md 
						
					 
					
						2020-08-31 13:21:36 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							ec14744ee4 
							
						 
					 
					
						
						
							
							Rename Transformer listener ( #6001 )  
						
						... 
						
						
						
						* rename to spacy-transformers.TransformerListener
* add some more tok2vec tests
* use select_pipes
* fix docs - annotation setter was not changed in the end 
						
					 
					
						2020-08-31 12:41:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9b86312bab 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-29 18:43:19 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							870774f475 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into docs/morph-usage-v3  
						
						
						
					 
					
						2020-08-29 16:00:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							45f46a5c85 
							
						 
					 
					
						
						
							
							Merge pull request  #5993  from explosion/feature/disabled-components  
						
						
						
					 
					
						2020-08-29 15:58:41 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							f9ed31a757 
							
						 
					 
					
						
						
							
							Update usage docs for lemmatization and morphology  
						
						
						
					 
					
						2020-08-29 15:56:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							450bf806b0 
							
						 
					 
					
						
						
							
							Merge pull request  #5991  from adrianeboyd/docs/sent-usage-v3  
						
						... 
						
						
						
						Update sentence segmentation usage docs 
						
					 
					
						2020-08-29 12:40:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							66d76f5126 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-08-29 12:36:05 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							5230529de2 
							
						 
					 
					
						
						
							
							add loggers registry & logger docs sections  
						
						
						
					 
					
						2020-08-28 21:44:04 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							48df50533d 
							
						 
					 
					
						
						
							
							Update sentence segmentation usage docs  
						
						... 
						
						
						
						Update sentence segmentation usage docs to incorporate `senter`. 
						
					 
					
						2020-08-28 10:58:16 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							72a87095d9 
							
						 
					 
					
						
						
							
							add loggers registry  
						
						
						
					 
					
						2020-08-27 20:26:28 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							aa9e0c9c39 
							
						 
					 
					
						
						
							
							small fix  
						
						
						
					 
					
						2020-08-27 19:56:52 +02:00