mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-25 17:36:30 +03:00
Adjust example
Not actually supported in this alignment interpretation
This commit is contained in:
parent
1ea472468a
commit
9f02e3c027
|
@ -967,9 +967,8 @@ attributes. For details, see the respective usage pages.
|
|||
|
||||
spaCy's tokenization is non-destructive and uses language-specific rules
|
||||
optimized for compatibility with treebank annotations. Other tools and resources
|
||||
can sometimes tokenize things differently – for example, `"I'm"` → `["I", "am"]`
|
||||
instead of `["I", "'m"]`, or `"Obama's"` → `["Obama", "'", "s"]` instead of
|
||||
`["Obama", "'s"]`.
|
||||
can sometimes tokenize things differently – for example, `"I'm"` →
|
||||
`["I", "'", "m"]` instead of `["I", "'m"]`.
|
||||
|
||||
In cases like that, you often want to align the tokenization so that you can
|
||||
merge annotations from different sources together, or take vectors predicted by
|
||||
|
|
Loading…
Reference in New Issue
Block a user