mirror of
https://github.com/explosion/spaCy.git
synced 2025-02-04 21:50:35 +03:00
oov confusion fix (#10828)
This commit is contained in:
parent
4fb1809c72
commit
a3814ee739
|
@ -115,7 +115,7 @@ The Finnish, Korean and Swedish `md` and `lg` pipelines use
|
||||||
running a trained pipeline on texts and working with [`Doc`](/api/doc) objects,
|
running a trained pipeline on texts and working with [`Doc`](/api/doc) objects,
|
||||||
you shouldn't notice any difference with floret vectors. With floret vectors no
|
you shouldn't notice any difference with floret vectors. With floret vectors no
|
||||||
tokens are out-of-vocabulary, so [`Token.is_oov`](/api/token#attributes) will
|
tokens are out-of-vocabulary, so [`Token.is_oov`](/api/token#attributes) will
|
||||||
return `True` for all tokens.
|
return `False` for all tokens.
|
||||||
|
|
||||||
If you access vectors directly for similarity comparisons, there are a few
|
If you access vectors directly for similarity comparisons, there are a few
|
||||||
differences because floret vectors don't include a fixed word list like the
|
differences because floret vectors don't include a fixed word list like the
|
||||||
|
|
Loading…
Reference in New Issue
Block a user