Update processing-pipelines.md to mention method for doc metadata #7480

langdonholmes · 2021-03-18T00:24:33Z

Description

Under "things to try," inform users they can save metadata when using nlp.pipe(foobar, as_tuples=True)

Link to a new example on the attributes page detailing the following:

data = [
  ("Some text to process", {"meta": "foo"}),
  ("And more text...", {"meta": "bar"})
]

for doc, context in nlp.pipe(data, as_tuples=True):
    # Let's assume you have a "meta" extension registered on the Doc
    doc._.meta = context["meta"]

from (one of) Ines' comments on StackOverflow

Types of change

Update the docs.

Checklist

I have submitted the spaCy Contributor Agreement.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

Under "things to try," inform users they can save metadata when using nlp.pipe(foobar, as_tuples=True) Link to a new example on the attributes page detailing the following: > ``` > data = [ > ("Some text to process", {"meta": "foo"}), > ("And more text...", {"meta": "bar"}) > ] > > for doc, context in nlp.pipe(data, as_tuples=True): > # Let's assume you have a "meta" extension registered on the Doc > doc._.meta = context["meta"] > ``` from https://stackoverflow.com/questions/57058798/make-spacy-nlp-pipe-process-tuples-of-text-and-additional-information-to-add-as

Update the attributes section with example of how extensions can be used to store metadata.

adrianeboyd · 2021-04-01T13:17:10Z

Hi, thanks for this contribution! I think it's a great idea to have an example for as_tuples in the docs, but it's a bit out-of-place in the proposed locations.

Could you add it instead as a short paragraph + executable example at the end of the "Processing text" section? The main focus of the example would be on using as_tuples rather than the custom extension itself, since those are introduced in a separate section, but you can demonstrate how as_tuples is useful by assigning the context info to the custom extension.

Made as_tuples example executable and relocated to the end of the "Processing Text" section.

langdonholmes · 2021-04-08T18:26:52Z

Excellent suggestions. I think I have implemented them and updated the pull request, but I am a bit new to Git so I'm not 100% sure I did that correctly. I am happy to make any additional changes or a new pull request as needed.

Removed extra line

adrianeboyd · 2021-04-13T12:55:09Z

Thanks for the updates! I reformatted and rephrased a bit and I'll leave this open for a little while for further feedback from others...

svlandeg · 2021-04-19T09:57:53Z

Looks good to me! I'll go ahead and merge this and push the update to the docs :-)

) * Update processing-pipelines.md Under "things to try," inform users they can save metadata when using nlp.pipe(foobar, as_tuples=True) Link to a new example on the attributes page detailing the following: > ``` > data = [ > ("Some text to process", {"meta": "foo"}), > ("And more text...", {"meta": "bar"}) > ] > > for doc, context in nlp.pipe(data, as_tuples=True): > # Let's assume you have a "meta" extension registered on the Doc > doc._.meta = context["meta"] > ``` from https://stackoverflow.com/questions/57058798/make-spacy-nlp-pipe-process-tuples-of-text-and-additional-information-to-add-as * Updating the attributes section Update the attributes section with example of how extensions can be used to store metadata. * Update processing-pipelines.md * Update processing-pipelines.md Made as_tuples example executable and relocated to the end of the "Processing Text" section. * Update processing-pipelines.md * Update processing-pipelines.md Removed extra line * Reformat and rephrase Co-authored-by: Adriane Boyd <[email protected]>

langdonholmes added 3 commits March 17, 2021 16:57

Updating the attributes section

dffdbe5

Update the attributes section with example of how extensions can be used to store metadata.

Update processing-pipelines.md

783c15e

svlandeg added the docs Documentation and website label Mar 18, 2021

langdonholmes added 2 commits April 8, 2021 11:05

Update processing-pipelines.md

cd25dd6

Made as_tuples example executable and relocated to the end of the "Processing Text" section.

Update processing-pipelines.md

3b79ec6

langdonholmes and others added 3 commits April 8, 2021 11:31

Update processing-pipelines.md

942a12f

Removed extra line

Merge branch 'master' into patch-2

d484660

Reformat and rephrase

d72dc8d

svlandeg merged commit df541c6 into explosion:master Apr 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update processing-pipelines.md to mention method for doc metadata #7480

Update processing-pipelines.md to mention method for doc metadata #7480

langdonholmes commented Mar 18, 2021

adrianeboyd commented Apr 1, 2021

langdonholmes commented Apr 8, 2021

adrianeboyd commented Apr 13, 2021

svlandeg commented Apr 19, 2021

Update processing-pipelines.md to mention method for doc metadata #7480

Update processing-pipelines.md to mention method for doc metadata #7480

Conversation

langdonholmes commented Mar 18, 2021

Description

Types of change

Checklist

adrianeboyd commented Apr 1, 2021

langdonholmes commented Apr 8, 2021

adrianeboyd commented Apr 13, 2021

svlandeg commented Apr 19, 2021