spaCy/website/docs/usage
langdonholmes df541c6b5e
Update processing-pipelines.md to mention method for doc metadata (#7480)
* Update processing-pipelines.md

Under "things to try," inform users they can save metadata when using nlp.pipe(foobar, as_tuples=True)

Link to a new example on the attributes page detailing the following:

> ```
> data = [
>   ("Some text to process", {"meta": "foo"}),
>   ("And more text...", {"meta": "bar"})
> ]
> 
> for doc, context in nlp.pipe(data, as_tuples=True):
>     # Let's assume you have a "meta" extension registered on the Doc
>     doc._.meta = context["meta"]
> ```

from https://stackoverflow.com/questions/57058798/make-spacy-nlp-pipe-process-tuples-of-text-and-additional-information-to-add-as

* Updating the attributes section

Update the attributes section with example of how extensions can be used to store metadata.

* Update processing-pipelines.md

* Update processing-pipelines.md

Made as_tuples example executable and relocated to the end of the "Processing Text" section.

* Update processing-pipelines.md

* Update processing-pipelines.md

Removed extra line

* Reformat and rephrase

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2021-04-19 11:58:12 +02:00
..
101 Update _training.md - Fix broken link on backpropagation (#7431) 2021-03-15 09:21:35 +01:00
_benchmarks-models.md final 3.0 benchmark numbers 2021-02-09 21:28:33 +01:00
embeddings-transformers.md Rephrase transformers PyTorch instructions 2021-01-29 13:36:56 +01:00
facts-figures.md final 3.0 benchmark numbers 2021-02-09 21:28:33 +01:00
index.md Add support for CUDA 11.2 (#7583) 2021-03-30 09:47:33 +02:00
layers-architectures.md consistently use registry as callable 2021-03-02 17:56:28 +01:00
linguistic-features.md Update Tokenizer.explain with special matches (#7749) 2021-04-19 19:08:20 +10:00
models.md fix typo in models.md (#7157) 2021-02-22 09:00:38 +01:00
processing-pipelines.md Update processing-pipelines.md to mention method for doc metadata (#7480) 2021-04-19 11:58:12 +02:00
projects.md W&B integration: Optional support for dataset and model checkpoint logging and versioning (#7429) 2021-04-01 19:36:23 +02:00
rule-based-matching.md Merge branch 'master' into pr/6444 2020-12-09 11:09:40 +11:00
saving-loading.md Update saving/loading example 2021-03-18 16:56:10 +01:00
spacy-101.md Merge branch 'master' into develop 2020-12-11 13:44:41 +11:00
training.md Add usage docs for streamed train corpora (#7693) 2021-04-09 16:15:38 +02:00
v2-1.md Merge branch 'spacy.io' [ci skip] 2021-03-06 17:38:54 +11:00
v2-2.md Update v3 docs [ci skip] 2020-07-05 16:11:16 +02:00
v2-3.md Merge branch 'spacy.io' [ci skip] 2021-03-06 17:38:54 +11:00
v2.md Merge branch 'spacy.io' [ci skip] 2021-03-06 17:38:54 +11:00
v3.md Terminology: deprecated vs obsolete (#7621) 2021-04-12 14:37:00 +02:00
visualizers.md Proofread remarks 2020-10-19 11:11:32 +02:00