From e9a4011f77c8e9948723d0db948d3a55101ec3ba Mon Sep 17 00:00:00 2001 From: kadarakos Date: Mon, 23 May 2022 09:15:51 +0200 Subject: [PATCH] oov confusion fix (#10828) --- website/docs/models/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/website/docs/models/index.md b/website/docs/models/index.md index 9ee96528e..203555651 100644 --- a/website/docs/models/index.md +++ b/website/docs/models/index.md @@ -115,7 +115,7 @@ The Finnish, Korean and Swedish `md` and `lg` pipelines use running a trained pipeline on texts and working with [`Doc`](/api/doc) objects, you shouldn't notice any difference with floret vectors. With floret vectors no tokens are out-of-vocabulary, so [`Token.is_oov`](/api/token#attributes) will -return `True` for all tokens. +return `False` for all tokens. If you access vectors directly for similarity comparisons, there are a few differences because floret vectors don't include a fixed word list like the