oov confusion fix (#10828)

This commit is contained in:
kadarakos 2022-05-23 09:15:51 +02:00 committed by GitHub
parent 4fb1809c72
commit a3814ee739
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -115,7 +115,7 @@ The Finnish, Korean and Swedish `md` and `lg` pipelines use
running a trained pipeline on texts and working with [`Doc`](/api/doc) objects,
you shouldn't notice any difference with floret vectors. With floret vectors no
tokens are out-of-vocabulary, so [`Token.is_oov`](/api/token#attributes) will
return `True` for all tokens.
return `False` for all tokens.
If you access vectors directly for similarity comparisons, there are a few
differences because floret vectors don't include a fixed word list like the