spaCy/website/docs/api/morphology.md
2020-07-29 11:36:42 +02:00

5.9 KiB

title tag source
Morphology class spacy/morphology.pyx

Store the possible morphological analyses for a language, and index them by hash. To save space on each token, tokens only know the hash of their morphological analysis, so queries of morphological attributes are delegated to this class.

Morphology.__init__

Create a Morphology object using the tag map, lemmatizer and exceptions.

Example

from spacy.morphology import Morphology

morphology = Morphology(strings, tag_map, lemmatizer)
Name Type Description
strings StringStore The string store.
tag_map Dict[str, Dict] The tag map.
lemmatizer Lemmatizer The lemmatizer.
exc Dict[str, Dict] A dictionary of exceptions in the format {tag: {orth: {"POS": "X", "Feat1": "Val1, "Feat2": "Val2", ...}

Morphology.add

Insert a morphological analysis in the morphology table, if not already present. The morphological analysis may be provided in the UD FEATS format as a string or in the tag map dictionary format. Returns the hash of the new analysis.

Example

feats = "Feat1=Val1|Feat2=Val2"
hash = nlp.vocab.morphology.add(feats)
assert hash == nlp.vocab.strings[feats]
Name Type Description
features Union[Dict, str] The morphological features.

Morphology.get

Example

feats = "Feat1=Val1|Feat2=Val2"
hash = nlp.vocab.morphology.add(feats)
assert nlp.vocab.morphology.get(hash) == feats

Get the FEATS string for the hash of the morphological analysis.

Name Type Description
morph int The hash of the morphological analysis.

Morphology.load_tag_map

Replace the current tag map with the provided tag map.

Name Type Description
tag_map Dict[str, Dict] The tag map.

Morphology.load_morph_exceptions

Replace the current morphological exceptions with the provided exceptions.

Name Type Description
morph_rules Dict[str, Dict] The morphological exceptions.

Morphology.add_special_case

Add a special-case rule to the morphological analyzer. Tokens whose tag and orth match the rule will receive the specified properties.

Example

attrs = {"POS": "DET", "Definite": "Def"}
morphology.add_special_case("DT", "the", attrs)
Name Type Description
tag_str str The fine-grained tag.
orth_str str The token text.
attrs dict The features to assign for this token and tag.

Morphology.exc

The current morphological exceptions.

Name Type Description
YIELDS dict The current dictionary of morphological exceptions.

Morphology.lemmatize

TODO

Morphology.feats_to_dict

Convert a string FEATS representation to a dictionary of features and values in the same format as the tag map.

Example

from spacy.morphology import Morphology
d = Morphology.feats_to_dict("Feat1=Val1|Feat2=Val2")
assert d == {"Feat1": "Val1", "Feat2": "Val2"}
Name Type Description
feats str The morphological features in Universal Dependencies FEATS format.
RETURNS dict The morphological features as a dictionary.

Morphology.dict_to_feats

Convert a dictionary of features and values to a string FEATS representation.

Example

from spacy.morphology import Morphology
f = Morphology.dict_to_feats({"Feat1": "Val1", "Feat2": "Val2"})
assert f == "Feat1=Val1|Feat2=Val2"
Name Type Description
feats_dict Dict[str, Dict] The morphological features as a dictionary.
RETURNS str The morphological features as in Universal Dependencies FEATS format.

Attributes

Name Type Description
FEATURE_SEP str The FEATS feature separator. Default is `
FIELD_SEP str The FEATS field separator. Default is =.
VALUE_SEP str The FEATS value separator. Default is ,.