Clarify Token.pos as UPOS (#5419)

This commit is contained in:
adrianeboyd 2020-05-08 10:36:25 +02:00 committed by Adriane Boyd
parent cafe94ee04
commit b3969c1479
2 changed files with 3 additions and 3 deletions

View File

@ -461,8 +461,8 @@ The L2 norm of the token's vector representation.
| `like_email` | bool | Does the token resemble an email address? | | `like_email` | bool | Does the token resemble an email address? |
| `is_oov` | bool | Is the token out-of-vocabulary? | | `is_oov` | bool | Is the token out-of-vocabulary? |
| `is_stop` | bool | Is the token part of a "stop list"? | | `is_stop` | bool | Is the token part of a "stop list"? |
| `pos` | int | Coarse-grained part-of-speech. | | `pos` | int | Coarse-grained part-of-speech from the [Universal POS tag set](https://universaldependencies.org/docs/u/pos/). |
| `pos_` | unicode | Coarse-grained part-of-speech. | | `pos_` | unicode | Coarse-grained part-of-speech from the [Universal POS tag set](https://universaldependencies.org/docs/u/pos/). |
| `tag` | int | Fine-grained part-of-speech. | | `tag` | int | Fine-grained part-of-speech. |
| `tag_` | unicode | Fine-grained part-of-speech. | | `tag_` | unicode | Fine-grained part-of-speech. |
| `dep` | int | Syntactic dependency relation. | | `dep` | int | Syntactic dependency relation. |

View File

@ -25,7 +25,7 @@ for token in doc:
> - **Text:** The original word text. > - **Text:** The original word text.
> - **Lemma:** The base form of the word. > - **Lemma:** The base form of the word.
> - **POS:** The simple part-of-speech tag. > - **POS:** The simple [UPOS](https://universaldependencies.org/docs/u/pos/) part-of-speech tag.
> - **Tag:** The detailed part-of-speech tag. > - **Tag:** The detailed part-of-speech tag.
> - **Dep:** Syntactic dependency, i.e. the relation between tokens. > - **Dep:** Syntactic dependency, i.e. the relation between tokens.
> - **Shape:** The word shape capitalization, punctuation, digits. > - **Shape:** The word shape capitalization, punctuation, digits.