Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							230e651ad6 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into master-tmp  
						
						
						
					 
					
						2021-01-27 13:26:29 +11:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							c447aa2b98 
							
						 
					 
					
						
						
							
							Update --code arg in evaluate CLI docs  
						
						
						
					 
					
						2021-01-26 15:30:46 +01:00 
						 
				 
			
				
					
						
							
							
								jganseman 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							907bce7a78 
							
						 
					 
					
						
						
							
							Merge pull request  #1  from jganseman/patch-1  
						
						... 
						
						
						
						Patch 1 
						
					 
					
						2021-01-26 11:12:30 +01:00 
						 
				 
			
				
					
						
							
							
								jganseman 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8bc57ec372 
							
						 
					 
					
						
						
							
							also update is_oov in lexeme docs  
						
						
						
					 
					
						2021-01-26 11:09:16 +01:00 
						 
				 
			
				
					
						
							
							
								jganseman 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							1f2b0ec168 
							
						 
					 
					
						
						
							
							proposing a more concise explanation for is_oov  
						
						... 
						
						
						
						proposing a more concise explanation for is_oov 
						
					 
					
						2021-01-26 10:53:39 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f049df1715 
							
						 
					 
					
						
						
							
							Revert "Set annotations in update" ( #6810 )  
						
						... 
						
						
						
						* Revert "Set annotations in update (#6767 )"
This reverts commit e680efc7cc 
						
					 
					
						2021-01-25 22:18:45 +08:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							61c9f8bf24 
							
						 
					 
					
						
						
							
							Remove transformers model max length section ( #6807 )  
						
						
						
					 
					
						2021-01-25 19:59:34 +08:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d0236136a2 
							
						 
					 
					
						
						
							
							Fix default config init in Transformer API docs ( #6781 )  
						
						
						
					 
					
						2021-01-21 23:18:03 +08:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							e680efc7cc 
							
						 
					 
					
						
						
							
							Set annotations in update ( #6767 )  
						
						... 
						
						
						
						* bump to 3.0.0rc4
* do set_annotations in component update calls
* update docs and remove set_annotations flag
* fix EL test 
						
					 
					
						2021-01-20 11:49:25 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f50502dad7 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2021-01-19 00:22:47 +11:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							fed8f48965 
							
						 
					 
					
						
						
							
							raise NotImplementedError when noun_chunks iterator is not implemented ( #6711 )  
						
						... 
						
						
						
						* raise NotImplementedError when noun_chunks iterator is not implemented
* bring back, fix and document span.noun_chunks
* formatting
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2021-01-17 19:56:05 +08:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							bf0cdae8d4 
							
						 
					 
					
						
						
							
							Add token_splitter component ( #6726 )  
						
						... 
						
						
						
						* Add long_token_splitter component
Add a `long_token_splitter` component for use with transformer
pipelines. This component splits up long tokens like URLs into smaller
tokens. This is particularly relevant for pretrained pipelines with
`strided_spans`, since the user can't change the length of the span
`window` and may not wish to preprocess the input texts.
The `long_token_splitter` splits tokens that are at least
`long_token_length` tokens long into smaller tokens of `split_length`
size.
Notes:
* Since this is intended for use as the first component in a pipeline,
the token splitter does not try to preserve any token annotation.
* API docs to come when the API is stable.
* Adjust API, add test
* Fix name in factory 
						
					 
					
						2021-01-17 19:54:41 +08:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9328dd5625 
							
						 
					 
					
						
						
							
							Handle unset token.morph in Morphologizer ( #6704 )  
						
						... 
						
						
						
						* Handle unset token.morph in Morphologizer
Handle unset `token.morph` in `Morphologizer.initialize` and
`Morphologizer.get_loss`. If both `token.morph` and `token.pos` are
unset, treat the annotation as missing rather than empty.
* Add token.has_morph() 
						
					 
					
						2021-01-15 17:20:10 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							0c936004d1 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master-rc3  
						
						
						
					 
					
						2021-01-14 11:49:58 +01:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f277bfdf0f 
							
						 
					 
					
						
						
							
							Add SpanGroup and Graph container types to represent arbitrary annotations ( #6696 )  
						
						... 
						
						
						
						* Draft out initial Spans data structure
* Initial span group commit
* Basic span group support on Doc
* Basic test for span group
* Compile span_group.pyx
* Draft addition of SpanGroup to DocBin
* Add deserialization for SpanGroup
* Add tests for serializing SpanGroup
* Fix serialization of SpanGroup
* Add EdgeC and GraphC structs
* Add draft Graph data structure
* Compile graph
* More work on Graph
* Update GraphC
* Upd graph
* Fix walk functions
* Let Graph take nodes and edges on construction
* Fix walking and getting
* Add graph tests
* Fix import
* Add module with the SpanGroups dict thingy
* Update test
* Rename 'span_groups' attribute
* Try to fix c++11 compilation
* Fix test
* Update DocBin
* Try to fix compilation
* Try to fix graph
* Improve SpanGroup docstrings
* Add doc.spans to documentation
* Fix serialization
* Tidy up and add docs
* Update docs [ci skip]
* Add SpanGroup.has_overlap
* WIP updated Graph API
* Start testing new Graph API
* Update Graph tests
* Update Graph
* Add docstring
Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2021-01-14 17:30:41 +11:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							75d9019343 
							
						 
					 
					
						
						
							
							Fix types of Tok2Vec encoding architectures ( #6442 )  
						
						... 
						
						
						
						* fix TorchBiLSTMEncoder documentation
* ensure the types of the encoding Tok2vec layers are correct
* update references from v1 to v2 for the new architectures 
						
					 
					
						2021-01-07 16:39:27 +11:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							82ae95267a 
							
						 
					 
					
						
						
							
							Docs for pretrain architectures ( #6605 )  
						
						... 
						
						
						
						* document pretraining architectures
* formatting
* bit more info
* small fixes 
						
					 
					
						2021-01-06 16:12:30 +11:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							afc5714d32 
							
						 
					 
					
						
						
							
							multi-label textcat component ( #6474 )  
						
						... 
						
						
						
						* multi-label textcat component
* formatting
* fix comment
* cleanup
* fix from #6481 
* random edit to push the tests
* add explicit error when textcat is called with multi-label gold data
* fix error nr
* small fix 
						
					 
					
						2021-01-06 13:07:14 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							6f83abb971 
							
						 
					 
					
						
						
							
							Merge pull request  #6647  from svlandeg/feature/init_config_overwrite  
						
						
						
					 
					
						2021-01-05 14:59:04 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3614472e29 
							
						 
					 
					
						
						
							
							Merge pull request  #6646  from svlandeg/feature/cli-docs [ci skip]  
						
						
						
					 
					
						2021-01-05 13:52:49 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9c078a5885 
							
						 
					 
					
						
						
							
							Update formatting for consistency [ci skip]  
						
						
						
					 
					
						2021-01-05 13:52:28 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a9e845426f 
							
						 
					 
					
						
						
							
							Use --force for consistency and add docs  
						
						
						
					 
					
						2021-01-05 13:49:59 +11:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							d5ff0fecf8 
							
						 
					 
					
						
						
							
							add docs  
						
						
						
					 
					
						2020-12-30 14:01:13 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							2fa23b0304 
							
						 
					 
					
						
						
							
							fix capitalization for link  
						
						
						
					 
					
						2020-12-29 15:01:22 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							43cc6aea93 
							
						 
					 
					
						
						
							
							remove non-existing link  
						
						
						
					 
					
						2020-12-29 14:59:39 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							543073bf9d 
							
						 
					 
					
						
						
							
							add pretrain example  
						
						
						
					 
					
						2020-12-29 14:51:23 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							1d0ef98873 
							
						 
					 
					
						
						
							
							move example  
						
						
						
					 
					
						2020-12-29 14:46:03 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							20113b8063 
							
						 
					 
					
						
						
							
							add train CLI example  
						
						
						
					 
					
						2020-12-29 14:44:56 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							87562e470d 
							
						 
					 
					
						
						
							
							fix backticks in docs ( #6635 )  
						
						
						
					 
					
						2020-12-27 22:12:37 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8df5b7f513 
							
						 
					 
					
						
						
							
							fix documentation of 'path' in tokenizer.to_disk ( #6634 )  
						
						
						
					 
					
						2020-12-27 22:01:06 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							282a3b49ea 
							
						 
					 
					
						
						
							
							Fix  parser resizing when there is no upper layer ( #6460 )  
						
						... 
						
						
						
						* allow resizing of the parser model even when upper=False
* update from spacy.TransitionBasedParser.v1 to v2
* bugfix 
						
					 
					
						2020-12-18 18:56:57 +08:00 
						 
				 
			
				
					
						
							
							
								Gareth Sparks 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							efc229c3f4 
							
						 
					 
					
						
						
							
							Doc.char_span arg: alignment_mode ( #6591 )  
						
						... 
						
						
						
						Currently labeled "mode", actually "alignment_mode" 
						
					 
					
						2020-12-18 09:54:56 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							513c4e332a 
							
						 
					 
					
						
						
							
							Include custom code via spacy package command ( #6531 )  
						
						
						
					 
					
						2020-12-10 20:36:46 +08:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2a6043fabb 
							
						 
					 
					
						
						
							
							Merge pull request  #6530  from explosion/feature/init-config-cpu-gpu  
						
						
						
					 
					
						2020-12-10 09:38:46 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9d32e839d3 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/init-config-cpu-gpu  
						
						
						
					 
					
						2020-12-10 08:50:53 +11:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							972820e2b3 
							
						 
					 
					
						
						
							
							Add batch_size to data formats docs  
						
						
						
					 
					
						2020-12-09 12:44:04 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							80ac8af1bf 
							
						 
					 
					
						
						
							
							Format  
						
						
						
					 
					
						2020-12-09 12:44:01 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							795b5bd049 
							
						 
					 
					
						
						
							
							Update website/docs/api/language.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-12-09 12:23:32 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							fa8fa474a3 
							
						 
					 
					
						
						
							
							Add nlp.batch_size setting  
						
						... 
						
						
						
						Add a default `batch_size` setting for `Language.pipe` and
`Language.evaluate` as `nlp.batch_size`. 
						
					 
					
						2020-12-09 09:13:26 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							34449b66fd 
							
						 
					 
					
						
						
							
							Update matcher.md  
						
						
						
					 
					
						2020-12-09 11:09:45 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							758ad6c3cd 
							
						 
					 
					
						
						
							
							Make CPU the default for init config  
						
						
						
					 
					
						2020-12-09 11:00:51 +11:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							94a5a9814f 
							
						 
					 
					
						
						
							
							Update argument handling and documentation  
						
						
						
					 
					
						2020-12-08 20:41:18 +11:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							5ceac425ee 
							
						 
					 
					
						
						
							
							Remove non-working --use-chars from train CLI  
						
						... 
						
						
						
						Remove the non-working `--use-chars` option from the train CLI. The
implementation of the option across component types and the CLI settings
could be fixed, but the `CharacterEmbed` model does not work on GPU in
v2 so it's better to remove it. 
						
					 
					
						2020-12-08 08:30:00 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2c27093c5f 
							
						 
					 
					
						
						
							
							require_cpu functionality ( #6336 )  
						
						... 
						
						
						
						* add require_cpu from Thinc 8.0.0rc2
* add docs
* fix test if cupy is not installed 
						
					 
					
						2020-12-08 14:42:40 +08:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							ee2ec52f48 
							
						 
					 
					
						
						
							
							Merge pull request  #6409  from svlandeg/feature/trf-docs  
						
						
						
					 
					
						2020-12-08 06:32:10 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							82e88f0e3b 
							
						 
					 
					
						
						
							
							Merge pull request  #6379  from svlandeg/fix/labels-constructor  
						
						
						
					 
					
						2020-12-08 06:29:56 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							636be3c791 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/trf-docs  
						
						
						
					 
					
						2020-11-19 14:15:35 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							165993d8e5 
							
						 
					 
					
						
						
							
							fix typo in transformer docs ( #6404 )  
						
						
						
					 
					
						2020-11-19 14:11:38 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							73fc1ed963 
							
						 
					 
					
						
						
							
							remove labels from morphologizer constructor  
						
						
						
					 
					
						2020-11-11 21:48:50 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fcd79e0655 
							
						 
					 
					
						
						
							
							remove set_morphology from docs  
						
						
						
					 
					
						2020-11-11 21:32:34 +01:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							789fb3d124 
							
						 
					 
					
						
						
							
							add docs for upstream argument of TransformerListener  
						
						
						
					 
					
						2020-11-09 21:42:58 +01:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							363ac73c72 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-11-09 12:43:26 +08:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8644ee3e3f 
							
						 
					 
					
						
						
							
							Update TIGER link and tag description ( #6344 )  
						
						
						
					 
					
						2020-11-05 09:33:00 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8ef056cf98 
							
						 
					 
					
						
						
							
							fix embed_size in Entity Linker architecture ( #6343 )  
						
						
						
					 
					
						2020-11-04 22:20:13 +01:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a4b32b9552 
							
						 
					 
					
						
						
							
							Handle missing reference values in scorer ( #6286 )  
						
						... 
						
						
						
						* Handle missing reference values in scorer
Handle missing values in reference doc during scoring where it is
possible to detect an unset state for the attribute. If no reference
docs contain annotation, `None` is returned instead of a score. `spacy
evaluate` displays `-` for missing scores and the missing scores are
saved as `None`/`null` in the metrics.
Attributes without unset states:
* `token.head`: relies on `token.dep` to recognize unset values
* `doc.cats`: unable to handle missing annotation
Additional changes:
* add optional `has_annotation` check to `score_scans` to replace
`doc.sents` hack
* update `score_token_attr_per_feat` to handle missing and empty morph
representations
* fix bug in `Doc.has_annotation` for normalization of `IS_SENT_START`
vs. `SENT_START`
* Fix import
* Update return types 
						
					 
					
						2020-11-03 15:47:18 +01:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							75a202ce65 
							
						 
					 
					
						
						
							
							TextCat updates and fixes ( #6263 )  
						
						... 
						
						
						
						* small fix in example imports
* throw error when train_corpus or dev_corpus is not a string
* small fix in custom logger example
* limit macro_auc to labels with 2 annotations
* fix typo
* also create parents of output_dir if need be
* update documentation of textcat scores
* refactor TextCatEnsemble
* fix tests for new AUC definition
* bump to 3.0.0a42
* update docs
* rename to spacy.TextCatEnsemble.v2
* spacy.TextCatEnsemble.v1 in legacy
* cleanup
* small fix
* update to 3.0.0rc2
* fix import that got lost in merge
* cursed IDE
* fix two typos 
						
					 
					
						2020-10-18 14:50:41 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4d99d2b94a 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-13 11:38:52 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							40276fd3be 
							
						 
					 
					
						
						
							
							update NEL docs after latest refactor  
						
						
						
					 
					
						2020-10-12 11:41:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							e50dc2c1c9 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-09 12:04:52 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							329b61ee7b 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-09 10:36:06 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d093d6343b 
							
						 
					 
					
						
						
							
							TrainablePipe ( #6213 )  
						
						... 
						
						
						
						* rename Pipe to TrainablePipe
* split functionality between Pipe and TrainablePipe
* remove unnecessary methods from certain components
* cleanup
* hasattr(component, "pipe") should be sufficient again
* remove serialization and vocab/cfg from Pipe
* unify _ensure_examples and validate_examples
* small fixes
* hasattr checks for self.cfg and self.vocab
* make is_resizable and is_trainable properties
* serialize strings.json instead of vocab
* fix KB IO + tests
* fix typos
* more typos
* _added_strings as a set
* few more tests specifically for _added_strings field
* bump to 3.0.0a36 
						
					 
					
						2020-10-08 21:33:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							064575d79d 
							
						 
					 
					
						
						
							
							Merge pull request  #6216  from svlandeg/feature/nel-initialize  
						
						
						
					 
					
						2020-10-08 11:14:12 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							43e59bb22a 
							
						 
					 
					
						
						
							
							Update docs and install extras [ci skip]  
						
						
						
					 
					
						2020-10-08 10:58:50 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							eaf5c265cb 
							
						 
					 
					
						
						
							
							set_kb method for entity_linker  
						
						
						
					 
					
						2020-10-08 10:34:01 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2fd7122074 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-06 10:31:48 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							568e12215d 
							
						 
					 
					
						
						
							
							Merge pull request  #6206  from svlandeg/fix/patterns-init  
						
						
						
					 
					
						2020-10-06 10:27:23 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							9b4cf7b0b6 
							
						 
					 
					
						
						
							
							update output of debug config command  
						
						
						
					 
					
						2020-10-06 09:47:23 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fd0f60e2bc 
							
						 
					 
					
						
						
							
							updates to data format for training and pretraining  
						
						
						
					 
					
						2020-10-06 09:28:53 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ff9ac39c88 
							
						 
					 
					
						
						
							
							read entity_ruler patterns with srsly.read_jsonl.v1  
						
						
						
					 
					
						2020-10-05 22:50:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							1a554bdcb1 
							
						 
					 
					
						
						
							
							Update docs and docstring [ci skip]  
						
						
						
					 
					
						2020-10-05 21:55:27 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							919790cb47 
							
						 
					 
					
						
						
							
							Upd MultiHashEmbed docs  
						
						
						
					 
					
						2020-10-05 20:28:21 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							193e0d5a98 
							
						 
					 
					
						
						
							
							add docs for entity_ruler.initialize  
						
						
						
					 
					
						2020-10-05 18:04:08 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							65abd77779 
							
						 
					 
					
						
						
							
							add finish_update to Pipe  
						
						
						
					 
					
						2020-10-05 16:23:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							0f64556c04 
							
						 
					 
					
						
						
							
							Merge pull request  #6197  from svlandeg/feature/pipe-docs [ci skip]  
						
						
						
					 
					
						2020-10-05 11:55:40 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							52b660e9dc 
							
						 
					 
					
						
						
							
							initialize and update explanation  
						
						
						
					 
					
						2020-10-05 00:39:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3c36a57e84 
							
						 
					 
					
						
						
							
							Update data augmenters ( #6196 )  
						
						... 
						
						
						
						* Draft lower-case augmenter
* Make warning a debug log
* Update lowercase augmenter, docs and tests
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-10-04 17:46:29 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							11347f34da 
							
						 
					 
					
						
						
							
							Tidy up, tests and docs  
						
						
						
					 
					
						2020-10-04 13:54:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							989c59918c 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-03 18:53:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							7c4ab7e82c 
							
						 
					 
					
						
						
							
							Fix Lemmatizer.get_lookups_config  
						
						
						
					 
					
						2020-10-03 17:16:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							dd542ec6a4 
							
						 
					 
					
						
						
							
							Fix label initialization of textcat component ( #6190 )  
						
						
						
					 
					
						2020-10-03 17:07:38 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							35d695a031 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-10-03 16:08:24 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							02247cccaf 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/small-fixes  
						
						
						
					 
					
						2020-10-02 20:48:11 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							09dcb75076 
							
						 
					 
					
						
						
							
							small UX fix for DocBin ( #6167 )  
						
						... 
						
						
						
						* add informative warning when messing up store_user_data DocBin flags
* add informative warning when messing up store_user_data DocBin flags
* cleanup test
* rename to patterns_path 
						
					 
					
						2020-10-02 15:43:32 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f0b30aedad 
							
						 
					 
					
						
						
							
							Make lemmatizers use initialize logic ( #6182 )  
						
						... 
						
						
						
						* Make lemmatizer use initialize logic and tidy up
* Fix typo
* Raise for uninitialized tables 
						
					 
					
						2020-10-02 15:42:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							df06f7a792 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-02 13:24:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d2aa662ab2 
							
						 
					 
					
						
						
							
							Merge pull request  #6179  from adrianeboyd/feature/token-morph-refactor-2 [ci skip]  
						
						
						
					 
					
						2020-10-02 12:10:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							32cdc1c4f4 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-02 11:38:03 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							fd09e6b140 
							
						 
					 
					
						
						
							
							Update docs for Token.morph / Token.set_morph  
						
						
						
					 
					
						2020-10-02 09:05:15 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							01c1538c72 
							
						 
					 
					
						
						
							
							Integrate file readers  
						
						
						
					 
					
						2020-10-02 01:36:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6b94cee468 
							
						 
					 
					
						
						
							
							Fix docs [ci skip]  
						
						
						
					 
					
						2020-10-02 01:11:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f2627157c8 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-01 17:38:17 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							1328c9fd14 
							
						 
					 
					
						
						
							
							consistently use --code instead of --code-path  
						
						
						
					 
					
						2020-10-01 16:59:22 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a22215f427 
							
						 
					 
					
						
						
							
							Add FeatureExtractor from Thinc ( #6170 )  
						
						... 
						
						
						
						* move featureextractor from Thinc
* Update website/docs/api/architectures.md
Co-authored-by: Ines Montani <ines@ines.io>
* Update website/docs/api/architectures.md
Co-authored-by: Ines Montani <ines@ines.io>
Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-10-01 16:22:48 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0a8a124a6e 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-10-01 12:15:53 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							a103ab5f1a 
							
						 
					 
					
						
						
							
							Update augmenter lookups and docs  
						
						
						
					 
					
						2020-09-30 23:03:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							115481aca7 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-30 15:16:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9bb958fd0a 
							
						 
					 
					
						
						
							
							Fix debug data [ci skip]  
						
						
						
					 
					
						2020-09-29 23:07:11 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							604be54a5c 
							
						 
					 
					
						
						
							
							Support --code in evaluate CLI [ci skip]  
						
						
						
					 
					
						2020-09-29 21:20:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d3c63b7965 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/prepare  
						
						
						
					 
					
						2020-09-29 20:53:05 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							361f91e286 
							
						 
					 
					
						
						
							
							Merge pull request  #6135  from walterhenry/develop-proof  
						
						
						
					 
					
						2020-09-29 20:49:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							b486389eec 
							
						 
					 
					
						
						
							
							Update website/docs/api/doc.md  
						
						
						
					 
					
						2020-09-29 20:48:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d7469283c5 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-29 16:59:21 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							c1c841940c 
							
						 
					 
					
						
						
							
							Merge branch 'develop-proof' of  https://github.com/walterhenry/spaCy  into develop-proof  
						
						
						
					 
					
						2020-09-29 11:47:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ff9a63bfbd 
							
						 
					 
					
						
						
							
							begin_training -> initialize  
						
						
						
					 
					
						2020-09-28 21:35:09 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							3360825e00 
							
						 
					 
					
						
						
							
							Proofreading  
						
						... 
						
						
						
						Another round of proofreading. All the API docs have been read through and I've grazed the Usage docs. 
						
					 
					
						2020-09-28 16:50:15 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a976da168c 
							
						 
					 
					
						
						
							
							Support data augmentation in Corpus ( #6155 )  
						
						... 
						
						
						
						* Support data augmentation in Corpus
* Note initial docs for data augmentation
* Add augmenter to quickstart
* Fix flake8
* Format
* Fix test
* Update spacy/tests/training/test_training.py
* Improve data augmentation arguments
* Update templates
* Move randomization out into caller
* Refactor
* Update spacy/training/augment.py
* Update spacy/tests/training/test_training.py
* Fix augment
* Fix test 
						
					 
					
						2020-09-28 03:03:27 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f29d5b9b89 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-27 18:39:38 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							009ba14aaf 
							
						 
					 
					
						
						
							
							Fix pretraining in train script ( #6143 )  
						
						... 
						
						
						
						* update pretraining API in train CLI
* bump thinc to 8.0.0a35
* bump to 3.0.0a26
* doc fixes
* small doc fix 
						
					 
					
						2020-09-25 15:47:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2aa4d65734 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-24 20:41:09 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3c062b3911 
							
						 
					 
					
						
						
							
							Add MORPH handling to Matcher ( #6107 )  
						
						... 
						
						
						
						* Add MORPH handling to Matcher
* Add `MORPH` to `Matcher` schema
* Rename `_SetMemberPredicate` to `_SetPredicate`
* Add `ISSUBSET` and `ISSUPERSET` operators to `_SetPredicate`
  * Add special handling for normalization and conversion of morph
    values into sets
  * For other attrs, `ISSUBSET` acts like `IN` and `ISSUPERSET` only
    matches for 0 or 1 values
* Update test
* Rename to IS_SUBSET and IS_SUPERSET 
						
					 
					
						2020-09-24 16:55:09 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c7eedd3534 
							
						 
					 
					
						
						
							
							updates to NEL functionality ( #6132 )  
						
						... 
						
						
						
						* NEL: read sentences and ents from reference
* fiddling with sent_start annotations
* add KB serialization test
* KB write additional file with strings.json
* score_links function to calculate NEL P/R/F
* formatting
* documentation 
						
					 
					
						2020-09-24 16:53:59 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							58dde293ce 
							
						 
					 
					
						
						
							
							Merge pull request  #6089  from adrianeboyd/feature/doc-ents-v3-2  
						
						
						
					 
					
						2020-09-24 14:44:42 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							74e1f192b4 
							
						 
					 
					
						
						
							
							Merge pull request  #6134  from explosion/feature/training_before_to_disk  
						
						
						
					 
					
						2020-09-24 14:44:11 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3b58a8be2b 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-09-24 14:32:42 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							88e54caa12 
							
						 
					 
					
						
						
							
							accuracy -> performance  
						
						
						
					 
					
						2020-09-24 14:32:35 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b92c8aae78 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into pr/6135  
						
						
						
					 
					
						2020-09-24 13:44:56 +02:00 
						 
				 
			
				
					
						
							
							
								walterhenry 
							
						 
					 
					
						
						
						
						
							
						
						
							3dd5f409ec 
							
						 
					 
					
						
						
							
							Proofreading  
						
						... 
						
						
						
						Proofread some API docs 
						
					 
					
						2020-09-24 13:15:28 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							1c63f02f99 
							
						 
					 
					
						
						
							
							Add API docs  
						
						
						
					 
					
						2020-09-24 12:51:16 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							138c8d45db 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-09-24 12:43:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ae51f580c1 
							
						 
					 
					
						
						
							
							Fix handling of score_weights  
						
						
						
					 
					
						2020-09-24 10:27:33 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							dd2292793f 
							
						 
					 
					
						
						
							
							'parser' instead of 'deps' for state_type  
						
						
						
					 
					
						2020-09-23 16:53:49 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							6c85fab316 
							
						 
					 
					
						
						
							
							state_type and extra_state_tokens instead of nr_feature_tokens  
						
						
						
					 
					
						2020-09-23 13:35:09 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							6ca06cb62c 
							
						 
					 
					
						
						
							
							Update docs and formatting [ci skip]  
						
						
						
					 
					
						2020-09-23 10:14:27 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							b556a10808 
							
						 
					 
					
						
						
							
							rename converts in_to_out  
						
						
						
					 
					
						2020-09-22 11:50:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f9af7d365c 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-22 09:45:41 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							49e80dbcac 
							
						 
					 
					
						
						
							
							Merge pull request  #6103  from explosion/chore/tidy-up-tests-docs-get-doc  
						
						
						
					 
					
						2020-09-22 09:45:04 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							5fbb8dfcbc 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into docs/various-v3-2  
						
						
						
					 
					
						2020-09-22 09:22:58 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							67fbcb3da5 
							
						 
					 
					
						
						
							
							Tidy up tests and docs  
						
						
						
					 
					
						2020-09-21 20:43:54 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							f212303729 
							
						 
					 
					
						
						
							
							Add sent_starts to Doc.__init__  
						
						... 
						
						
						
						Add sent_starts to `Doc.__init__`. Officially specify `is_sent_start`
values but also convert to and accept `sent_start` internally. 
						
					 
					
						2020-09-21 17:59:09 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							6aa91c7ca0 
							
						 
					 
					
						
						
							
							Make user_data keyword-only  
						
						
						
					 
					
						2020-09-21 16:00:06 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							bc02e86494 
							
						 
					 
					
						
						
							
							Extend Doc.__init__ with additional annotation  
						
						... 
						
						
						
						Mostly copying from `spacy.tests.util.get_doc`, add additional kwargs to
`Doc.__init__` to initialize the most common doc/token values. 
						
					 
					
						2020-09-21 13:36:24 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							3aa57ce6c9 
							
						 
					 
					
						
						
							
							Update alignment mode in Doc.char_span docs  
						
						
						
					 
					
						2020-09-21 09:07:20 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							012b3a7096 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-20 17:44:58 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							554c9a2497 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-20 12:30:53 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							39872de1f6 
							
						 
					 
					
						
						
							
							Introducing the gpu_allocator ( #6091 )  
						
						... 
						
						
						
						* rename 'use_pytorch_for_gpu_memory' to 'gpu_allocator'
* --code instead of --code-path
* update documentation
* avoid querying the "system" section directly
* add explanation of gpu_allocator to TF/PyTorch section in docs
* fix typo
* fix typo 2
* use set_gpu_allocator from thinc 8.0.0a34
* default null instead of empty string 
						
					 
					
						2020-09-19 01:17:02 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0406200a1e 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-18 15:13:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a127fa475e 
							
						 
					 
					
						
						
							
							Merge pull request  #6078  from svlandeg/fix/corpus  
						
						
						
					 
					
						2020-09-18 14:44:21 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							d32ce121be 
							
						 
					 
					
						
						
							
							Fix docs [ci skip]  
						
						
						
					 
					
						2020-09-18 13:41:12 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							1bb8b4f824 
							
						 
					 
					
						
						
							
							Merge branch 'master' into develop  
						
						
						
					 
					
						2020-09-17 17:46:20 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2e3ce9f42f 
							
						 
					 
					
						
						
							
							Merge branch 'feature/init-config-pretrain' of  https://github.com/svlandeg/spaCy  into pr/6084  
						
						
						
					 
					
						2020-09-17 16:58:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3d8e010655 
							
						 
					 
					
						
						
							
							Change order  
						
						
						
					 
					
						2020-09-17 16:58:46 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c4b414b282 
							
						 
					 
					
						
						
							
							Update website/docs/api/cli.md  
						
						
						
					 
					
						2020-09-17 16:58:09 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							e5ceec5df0 
							
						 
					 
					
						
						
							
							Update website/docs/api/cli.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-09-17 16:56:20 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							127ce0c574 
							
						 
					 
					
						
						
							
							Update website/docs/api/cli.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-09-17 16:55:53 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							5fade4feb7 
							
						 
					 
					
						
						
							
							fix cli abbrev  
						
						
						
					 
					
						2020-09-17 16:15:20 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ddfc1fc146 
							
						 
					 
					
						
						
							
							add pretraining option to init config  
						
						
						
					 
					
						2020-09-17 16:05:40 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							130ffa5fbf 
							
						 
					 
					
						
						
							
							fix typos in docs  
						
						
						
					 
					
						2020-09-17 14:59:41 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							0c35885751 
							
						 
					 
					
						
						
							
							generalize corpora, dot notation for dev and train corpus  
						
						
						
					 
					
						2020-09-17 11:38:59 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							8cedb2f380 
							
						 
					 
					
						
						
							
							Merge branch 'fix/corpus' of  https://github.com/svlandeg/spaCy  into fix/corpus  
						
						
						
					 
					
						2020-09-17 09:27:55 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							781fae678b 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into fix/corpus  
						
						
						
					 
					
						2020-09-17 09:24:36 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							21dcf92964 
							
						 
					 
					
						
						
							
							Update website/docs/api/data-formats.md  
						
						... 
						
						
						
						Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-09-17 09:21:36 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							7e4cd7575c 
							
						 
					 
					
						
						
							
							Refactor Docs.is_ flags ( #6044 )  
						
						... 
						
						
						
						* Refactor Docs.is_ flags
* Add derived `Doc.has_annotation` method
  * `Doc.has_annotation(attr)` returns `True` for partial annotation
  * `Doc.has_annotation(attr, require_complete=True)` returns `True` for
    complete annotation
* Add deprecation warnings to `is_tagged`, `is_parsed`, `is_sentenced`
and `is_nered`
* Add `Doc._get_array_attrs()`, which returns a full list of `Doc` attrs
for use with `Doc.to_array`, `Doc.to_bytes` and `Doc.from_docs`. The
list is the `DocBin` attributes list plus `SPACY` and `LENGTH`.
Notes on `Doc.has_annotation`:
* `HEAD` is converted to `DEP` because heads don't have an unset state
* Accept `IS_SENT_START` as a synonym of `SENT_START`
Additional changes:
* Add `NORM`, `ENT_ID` and `SENT_START` to default attributes for
`DocBin`
* In `Doc.from_array()` the presence of `DEP` causes `HEAD` to override
`SENT_START`
* In `Doc.from_array()` using `attrs` other than
`Doc._get_array_attrs()` (i.e., a user's custom list rather than our
default internal list) with both `HEAD` and `SENT_START` shows a warning
that `HEAD` will override `SENT_START`
* `set_children_from_heads` does not require dependency labels to set
sentence boundaries and sets `sent_start` for all non-sentence starts to
`-1`
* Fix call to set_children_form_heads
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> 
						
					 
					
						2020-09-17 00:14:01 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							55f8d5478e 
							
						 
					 
					
						
						
							
							fix example output  
						
						
						
					 
					
						2020-09-15 22:09:30 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							51fa929f47 
							
						 
					 
					
						
						
							
							rewrite train_corpus to corpus.train in config  
						
						
						
					 
					
						2020-09-15 21:58:04 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0edd695bf6 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-09-15 11:41:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							99549a5ace 
							
						 
					 
					
						
						
							
							Fix consistency and update docs  
						
						
						
					 
					
						2020-09-15 11:37:37 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							154752f9c2 
							
						 
					 
					
						
						
							
							Update docs and consistency [ci skip]  
						
						
						
					 
					
						2020-09-15 00:32:49 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							3216a33149 
							
						 
					 
					
						
						
							
							positive_label config for textcat ( #6062 )  
						
						... 
						
						
						
						* hook up positive_label in textcat
* unit tests
* documentation
* formatting
* tests
* fix typo
* move verify_config to after begin_training
* revert accidential commit 
						
					 
					
						2020-09-14 17:08:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9afb1d9965 
							
						 
					 
					
						
						
							
							Merge pull request  #6063  from svlandeg/feature/doc_cleanup [ci skip]  
						
						
						
					 
					
						2020-09-14 10:35:43 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							47acb45850 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-13 22:30:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2e3d067a7b 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-13 19:29:06 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							744df9814a 
							
						 
					 
					
						
						
							
							define threshold for scoring textcat in TextCat config ( #6055 )  
						
						... 
						
						
						
						* define threshold for scoring textcat in TextCat config
* fix unit test and documentation 
						
					 
					
						2020-09-13 14:15:52 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							c4f324d5f1 
							
						 
					 
					
						
						
							
							doc fixes  
						
						
						
					 
					
						2020-09-12 17:38:54 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							8b0dabe987 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-09-12 17:05:10 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							0b2e07215d 
							
						 
					 
					
						
						
							
							Support overwriting name on spacy package  
						
						
						
					 
					
						2020-09-11 11:38:28 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							97d99f7efa 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/doc-fixes  
						
						
						
					 
					
						2020-09-10 11:51:34 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							15bc3a37b4 
							
						 
					 
					
						
						
							
							Add --branch to project clone  
						
						
						
					 
					
						2020-09-10 11:08:15 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b7afd09d27 
							
						 
					 
					
						
						
							
							Update formatting [ci skip]  
						
						
						
					 
					
						2020-09-10 11:07:09 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							9073d99fc9 
							
						 
					 
					
						
						
							
							fix link to shape inference section  
						
						
						
					 
					
						2020-09-10 10:22:59 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							1955aaaa20 
							
						 
					 
					
						
						
							
							Merge pull request  #6045  from svlandeg/feature/more-layers-docs [ci skip]  
						
						
						
					 
					
						2020-09-09 21:46:40 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							2e567a47c2 
							
						 
					 
					
						
						
							
							Update docs and formatting  
						
						
						
					 
					
						2020-09-09 21:26:10 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							c89e07927e 
							
						 
					 
					
						
						
							
							document individual component API pages  
						
						
						
					 
					
						2020-09-09 16:18:38 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							cb66ea7400 
							
						 
					 
					
						
						
							
							Remove simple_ner code ( #6041 )  
						
						... 
						
						
						
						* remove simple_ner code
* remove unused _biluo and _iob files 
						
					 
					
						2020-09-09 16:11:27 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							a8aa9a8068 
							
						 
					 
					
						
						
							
							document Pipe API details, crossreferences etc  
						
						
						
					 
					
						2020-09-09 15:56:27 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							39aa740777 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/more-layers-docs  
						
						
						
					 
					
						2020-09-09 11:59:34 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8e7557656f 
							
						 
					 
					
						
						
							
							Renaming gold & annotation_setter ( #6042 )  
						
						... 
						
						
						
						* version bump to 3.0.0a16
* rename "gold" folder to "training"
* rename 'annotation_setter' to 'set_extra_annotations'
* formatting 
						
					 
					
						2020-09-09 10:31:03 +02:00 
						 
				 
			
				
					
						
							
							
								Marek Grzenkowicz 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a26f864ed3 
							
						 
					 
					
						
						
							
							Clarify how to choose pretrained weights files ( closes   #6027 ) [ci skip] ( #6039 )  
						
						
						
					 
					
						2020-09-08 21:13:50 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							bd8f9b188b 
							
						 
					 
					
						
						
							
							small fixes  
						
						
						
					 
					
						2020-09-08 17:24:36 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							157caf4dfa 
							
						 
					 
					
						
						
							
							WIP: update docs [ci skip]  
						
						
						
					 
					
						2020-09-04 16:30:31 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f174c7b1f3 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into pr/6018  
						
						
						
					 
					
						2020-09-04 15:54:49 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							864a697e63 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into master-tmp  
						
						
						
					 
					
						2020-09-04 13:15:36 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							b927893309 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into feature/dependency-matcher-v3  
						
						
						
					 
					
						2020-09-04 13:03:30 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							4daf138136 
							
						 
					 
					
						
						
							
							Fix alphabetic ordering [ci skip]  
						
						
						
					 
					
						2020-09-03 23:01:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							23b7d9cfa3 
							
						 
					 
					
						
						
							
							Prefix span getters  
						
						
						
					 
					
						2020-09-03 17:37:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							5afe6447cd 
							
						 
					 
					
						
						
							
							registry.assets -> registry.misc  
						
						
						
					 
					
						2020-09-03 17:31:14 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c063e55eb7 
							
						 
					 
					
						
						
							
							Add prefix to batchers  
						
						
						
					 
					
						2020-09-03 17:30:41 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							804f120361 
							
						 
					 
					
						
						
							
							Don't use registered function version in title  
						
						
						
					 
					
						2020-09-03 17:29:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c53b1433b9 
							
						 
					 
					
						
						
							
							Adjust more arguments [ci skip]  
						
						
						
					 
					
						2020-09-03 17:12:24 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							25a595dc10 
							
						 
					 
					
						
						
							
							Fix typos and wording [ci skip]  
						
						
						
					 
					
						2020-09-03 16:37:45 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							b5a0657fd6 
							
						 
					 
					
						
						
							
							"model" terminology consistency in docs  
						
						
						
					 
					
						2020-09-03 13:13:03 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							960d9cfadc 
							
						 
					 
					
						
						
							
							Officially support DependencyMatcher  
						
						... 
						
						
						
						Add official support for the `DependencyMatcher`. Redesign the pattern
specification. Fix and extend operator implementations. Update API docs
and add usage docs.
Patterns
--------
Refactor pattern structure to:
```
{
  "LEFT_ID": str,
  "REL_OP": str,
  "RIGHT_ID": str,
  "RIGHT_ATTRS": dict,
}
```
The first node contains only `RIGHT_ID` and `RIGHT_ATTRS` and all
subsequent nodes contain all four keys.
New operators
-------------
Because of the way patterns are constructed from left to right, it's
helpful to have `follows` operators along with `precedes` operators. Add
operators for simple precedes / follows alongside immediate precedes /
follows.
* `.*`: precedes
* `;`: immediately follows
* `;*`: follows
Operator fixes
--------------
* `<` and `<<` do not include the node itself
* Fix reversed order for all operators involving linear precedence (`.`,
  all sibling operators)
* Linear precedence operators do not match nodes outside the same parse
Additional fixes
----------------
* Use v3 Matcher API
* Support `get` and `remove`
* Support pickling 
						
					 
					
						2020-09-02 17:45:29 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							bbaea530f6 
							
						 
					 
					
						
						
							
							sublayers paragraph  
						
						
						
					 
					
						2020-09-02 17:36:22 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							9af82f3f11 
							
						 
					 
					
						
						
							
							Merge pull request  #6003  from explosion/feature/matcher-as-spans  
						
						
						
					 
					
						2020-08-31 17:50:56 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3929431af1 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-31 17:06:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							add9de5487 
							
						 
					 
					
						
						
							
							Deprecate (Phrase)Matcher.pipe  
						
						
						
					 
					
						2020-08-31 17:01:24 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							2c3b64a567 
							
						 
					 
					
						
						
							
							console logging example  
						
						
						
					 
					
						2020-08-31 16:56:13 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							bca6bf8dda 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-31 16:39:53 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							db9f8896f5 
							
						 
					 
					
						
						
							
							Add docs [ci skip]  
						
						
						
					 
					
						2020-08-31 16:10:41 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							fe6c08218e 
							
						 
					 
					
						
						
							
							fixes  
						
						
						
					 
					
						2020-08-31 14:51:49 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							0e0abb0378 
							
						 
					 
					
						
						
							
							fix  
						
						
						
					 
					
						2020-08-31 14:50:29 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							56ba691ecd 
							
						 
					 
					
						
						
							
							small fixes  
						
						
						
					 
					
						2020-08-31 14:46:00 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							e47ea88aeb 
							
						 
					 
					
						
						
							
							revert annotations refactor  
						
						
						
					 
					
						2020-08-31 14:40:55 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							2c90a06fee 
							
						 
					 
					
						
						
							
							some more information about the loggers  
						
						
						
					 
					
						2020-08-31 13:43:17 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							c18eb63483 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs  
						
						... 
						
						
						
						# Conflicts:
#	website/docs/usage/embeddings-transformers.md 
						
					 
					
						2020-08-31 13:21:36 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							ec14744ee4 
							
						 
					 
					
						
						
							
							Rename Transformer listener ( #6001 )  
						
						... 
						
						
						
						* rename to spacy-transformers.TransformerListener
* add some more tok2vec tests
* use select_pipes
* fix docs - annotation setter was not changed in the end 
						
					 
					
						2020-08-31 12:41:39 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							9b86312bab 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-29 18:43:19 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							870774f475 
							
						 
					 
					
						
						
							
							Merge branch 'develop' into docs/morph-usage-v3  
						
						
						
					 
					
						2020-08-29 16:00:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							45f46a5c85 
							
						 
					 
					
						
						
							
							Merge pull request  #5993  from explosion/feature/disabled-components  
						
						
						
					 
					
						2020-08-29 15:58:41 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							f9ed31a757 
							
						 
					 
					
						
						
							
							Update usage docs for lemmatization and morphology  
						
						
						
					 
					
						2020-08-29 15:56:50 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							450bf806b0 
							
						 
					 
					
						
						
							
							Merge pull request  #5991  from adrianeboyd/docs/sent-usage-v3  
						
						... 
						
						
						
						Update sentence segmentation usage docs 
						
					 
					
						2020-08-29 12:40:06 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							66d76f5126 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-08-29 12:36:05 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							5230529de2 
							
						 
					 
					
						
						
							
							add loggers registry & logger docs sections  
						
						
						
					 
					
						2020-08-28 21:44:04 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
						
						
							
						
						
							48df50533d 
							
						 
					 
					
						
						
							
							Update sentence segmentation usage docs  
						
						... 
						
						
						
						Update sentence segmentation usage docs to incorporate `senter`. 
						
					 
					
						2020-08-28 10:58:16 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							72a87095d9 
							
						 
					 
					
						
						
							
							add loggers registry  
						
						
						
					 
					
						2020-08-27 20:26:28 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							aa9e0c9c39 
							
						 
					 
					
						
						
							
							small fix  
						
						
						
					 
					
						2020-08-27 19:56:52 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							8cde6ccb7d 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs  
						
						
						
					 
					
						2020-08-27 19:56:09 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							556e975a30 
							
						 
					 
					
						
						
							
							various fixes  
						
						
						
					 
					
						2020-08-27 19:24:44 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							ff4175e839 
							
						 
					 
					
						
						
							
							Add more info to debug config  
						
						
						
					 
					
						2020-08-27 18:17:58 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							559b65f2e0 
							
						 
					 
					
						
						
							
							adjust references to null_annotation_setter to trfdata_setter  
						
						
						
					 
					
						2020-08-27 09:43:32 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							696f167478 
							
						 
					 
					
						
						
							
							Add diff example to docs [ci skip]  
						
						
						
					 
					
						2020-08-26 15:57:54 +02:00 
						 
				 
			
				
					
						
							
							
								Adriane Boyd 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							90d88729e0 
							
						 
					 
					
						
						
							
							Add AttributeRuler.score ( #5963 )  
						
						... 
						
						
						
						* Add AttributeRuler.score
Add scoring for TAG / POS / MORPH / LEMMA if these are present in the
assigned token attributes.
Add default score weights (that don't really make a lot of sense) so
that the scores are in the default config in some form.
* Update docs 
						
					 
					
						2020-08-26 15:39:30 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ec069627fe 
							
						 
					 
					
						
						
							
							rename to TransformerListener  
						
						
						
					 
					
						2020-08-26 13:31:01 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							627617a079 
							
						 
					 
					
						
						
							
							Tidy up and add docs [ci skip]  
						
						
						
					 
					
						2020-08-26 13:24:55 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							15902c5aa2 
							
						 
					 
					
						
						
							
							fix link  
						
						
						
					 
					
						2020-08-26 11:51:57 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							feb86d5206 
							
						 
					 
					
						
						
							
							clarify default  
						
						
						
					 
					
						2020-08-26 11:21:30 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							8ac5ef1284 
							
						 
					 
					
						
						
							
							Update docs  
						
						
						
					 
					
						2020-08-25 11:54:37 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							e559867605 
							
						 
					 
					
						
						
							
							Allow spacy project to push and pull to/from remote storage ( #5949 )  
						
						... 
						
						
						
						* Add utils for working with remote storage
* WIP add remote_cache for project
* WIP add push and pull commands
* Use pathy in remote_cache
* Updarte util
* Update remote_cache
* Update util
* Update project assets
* Update pull script
* Update push script
* Fix type annotation in util
* Work on remote storage
* Remove site and env hash
* Fix imports
* Fix type annotation
* Require pathy
* Require pathy
* Fix import
* Add a util to handle project variable substitution
* Import push and pull commands
* Fix pull command
* Fix push command
* Fix tarfile in remote_storage
* Improve printing
* Fiddle with status messages
* Set version to v3.0.0a9
* Draft docs for spacy project remote storages
* Update docs [ci skip]
* Use Thinc config to simplify and unify template variables
* Auto-format
* Don't import Pathy globally for now
Causes slow and annoying Google Cloud warning
* Tidy up test
* Tidy up and update tests
* Update to latest Thinc
* Update docs
* variables -> vars
* Update docs [ci skip]
* Update docs [ci skip]
Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-08-23 18:32:09 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							c7c9b0451f 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-22 13:52:52 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							71aeae89c5 
							
						 
					 
					
						
						
							
							Merge pull request  #5948  from svlandeg/feature/docs-docs-docs [ci skip]  
						
						
						
					 
					
						2020-08-22 12:18:47 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							f102164a1f 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-21 19:34:06 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							1b7cfa7347 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/docs-docs-docs  
						
						
						
					 
					
						2020-08-21 18:36:18 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							dc98f69b57 
							
						 
					 
					
						
						
							
							alphabetize registries  
						
						
						
					 
					
						2020-08-21 18:10:21 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							518a1f97f3 
							
						 
					 
					
						
						
							
							remove outdated TODO's  
						
						
						
					 
					
						2020-08-21 17:55:15 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							e92bd6e1c1 
							
						 
					 
					
						
						
							
							alphabetize training lists  
						
						
						
					 
					
						2020-08-21 17:42:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							74cb6d39d0 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-21 16:11:38 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							f5bcc10268 
							
						 
					 
					
						
						
							
							Update architectures  
						
						
						
					 
					
						2020-08-21 15:34:54 +02:00 
						 
				 
			
				
					
						
							
							
								Matthew Honnibal 
							
						 
					 
					
						
						
						
						
							
						
						
							7ed8f4504b 
							
						 
					 
					
						
						
							
							Update API docs for architectures  
						
						
						
					 
					
						2020-08-21 15:22:19 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							52bd3a8b48 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-21 13:22:59 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							e60442d83a 
							
						 
					 
					
						
						
							
							Adjust label casing in displaCy NER visualizer ( resolves   #4866 )  
						
						... 
						
						
						
						- Accept any case for label names in ents and colors option, even if actual predicted label uses different casing
- Don't text-transform: uppercase visually, if it's important to users that the label is represented as-is in the UI 
						
					 
					
						2020-08-21 11:51:31 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							04e4d59235 
							
						 
					 
					
						
						
							
							Update docs [ci skip]  
						
						
						
					 
					
						2020-08-20 16:17:25 +02:00 
						 
				 
			
				
					
						
							
							
								Sofie Van Landeghem 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							410b54e10e 
							
						 
					 
					
						
						
							
							Update website/docs/api/data-formats.md  
						
						... 
						
						
						
						Co-authored-by: Ines Montani <ines@ines.io> 
						
					 
					
						2020-08-20 11:15:34 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							ae719b354f 
							
						 
					 
					
						
						
							
							fix typos  
						
						
						
					 
					
						2020-08-20 10:20:40 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							f728c00cbb 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/update-more-docs  
						
						... 
						
						
						
						# Conflicts:
#	website/docs/api/data-formats.md 
						
					 
					
						2020-08-20 10:02:13 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							229033831a 
							
						 
					 
					
						
						
							
							add explanation of raw_text  
						
						
						
					 
					
						2020-08-20 10:00:45 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							ea6640ea72 
							
						 
					 
					
						
						
							
							Merge pull request  #5939  from explosion/feature/thinc-v8.0.0a28  
						
						... 
						
						
						
						Update Thinc and config variables 
						
					 
					
						2020-08-19 21:14:36 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							09f3cfc985 
							
						 
					 
					
						
						
							
							add version  
						
						
						
					 
					
						2020-08-19 19:58:45 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							7d9f00bdbf 
							
						 
					 
					
						
						
							
							waltzing schedule  
						
						
						
					 
					
						2020-08-19 19:53:00 +02:00 
						 
				 
			
				
					
						
							
							
								Ines Montani 
							
						 
					 
					
						
						
						
						
							
						
						
							3dd390b1a1 
							
						 
					 
					
						
						
							
							Update Thinc and config variables  
						
						
						
					 
					
						2020-08-19 19:46:12 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							85b39639e1 
							
						 
					 
					
						
						
							
							small fix  
						
						
						
					 
					
						2020-08-19 19:17:36 +02:00 
						 
				 
			
				
					
						
							
							
								svlandeg 
							
						 
					 
					
						
						
						
						
							
						
						
							169b5bcda0 
							
						 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/develop' into feature/update-docs  
						
						... 
						
						
						
						# Conflicts:
#	website/docs/usage/training.md 
						
					 
					
						2020-08-19 17:58:25 +02:00