mirror of
https://github.com/explosion/spaCy.git
synced 2025-04-21 17:41:59 +03:00
add an anchor to the trainable lemmatizer section
This commit is contained in:
parent
6fc153a266
commit
005c20b463
|
@ -113,7 +113,7 @@ print(doc[2].morph) # 'Case=Nom|Person=2|PronType=Prs'
|
|||
print(doc[2].pos_) # 'PRON'
|
||||
```
|
||||
|
||||
## Lemmatization {id="lemmatization",model="lemmatizer",version="3"}
|
||||
## Lemmatization {id="lemmatization",version="3"}
|
||||
|
||||
spaCy provides two pipeline components for lemmatization:
|
||||
|
||||
|
@ -194,7 +194,7 @@ information, without consulting the context of the token. The rule-based
|
|||
lemmatizer also accepts list-based exception files. For English, these are
|
||||
acquired from [WordNet](https://wordnet.princeton.edu/).
|
||||
|
||||
### Trainable lemmatizer
|
||||
### Trainable lemmatizer {id="lemmatizer-train",model="lemmatizer"}
|
||||
|
||||
The [`EditTreeLemmatizer`](/api/edittreelemmatizer) can learn form-to-lemma
|
||||
transformations from a training corpus that includes lemma annotations. This
|
||||
|
|
Loading…
Reference in New Issue
Block a user