diff --git a/website/docs/api/large-language-models.mdx b/website/docs/api/large-language-models.mdx index c7e5a7393..7a738f440 100644 --- a/website/docs/api/large-language-models.mdx +++ b/website/docs/api/large-language-models.mdx @@ -236,18 +236,15 @@ objects. This depends on the return type of the [model](#models). | `responses` | The generated prompts. ~~Iterable[Any]~~ | | **RETURNS** | The annotated documents. ~~Iterable[Doc]~~ | -### Summarization {id="summarization"} +### Raw prompting {id="raw"} -A summarization task takes a document as input and generates a summary that is -stored in an extension attribute. - -#### spacy.Raw.v1 {id="raw-v1"} - -Different to all other tasks `spacy.Raw.v1` doesn't provide a specific prompt, +Different to all other tasks `spacy.Raw.vX` doesn't provide a specific prompt, wrapping doc data, to the model. Instead it instructs the model to reply to the doc content. This is handy for use cases like question answering (where each doc contains one question) or if you want include customized prompts for each doc. +#### spacy.Raw.v1 {id="raw-v1"} + Note that since this task may request arbitrary information, it doesn't do any parsing per se - the model response is stored in a custom `Doc` attribute (i. e. can be accessed via `doc._.{field}`). @@ -300,6 +297,11 @@ field = "llm_reply" path = "raw_examples.yml" ``` +### Summarization {id="summarization"} + +A summarization task takes a document as input and generates a summary that is +stored in an extension attribute. + #### spacy.Summarization.v1 {id="summarization-v1"} The `spacy.Summarization.v1` task supports both zero-shot and few-shot