spaCy/spacy/lang/sr/__init__.py

from .stop_words import STOP_WORDS
from .tokenizer_exceptions import TOKENIZER_EXCEPTIONS
from .lex_attrs import LEX_ATTRS
from ...language import Language
from ...util import load_config_from_str


DEFAULT_CONFIG = """
[initialize]

[initialize.lookups]
@misc = "spacy.LookupsDataLoader.v1"
lang = ${nlp.lang}
tables = ["lexeme_norm"]
"""


class SerbianDefaults(Language.Defaults):
    config = load_config_from_str(DEFAULT_CONFIG)
    tokenizer_exceptions = TOKENIZER_EXCEPTIONS
    lex_attr_getters = LEX_ATTRS
    stop_words = STOP_WORDS


class Serbian(Language):
    lang = "sr"
    Defaults = SerbianDefaults


__all__ = ["Serbian"]
Stopwords for Serbian language. (#4078) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated 2019-08-05 11:22:27 +03:00			`from .stop_words import STOP_WORDS`
Serbian language improvement (#4169) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated * Serbian language code update. --bugfix * Tokenizer exceptions added. Init file updated. * Norm exceptions and lexical attributes added. * Examples added. * Tests added. * sr_lang examples update. * Tokenizer exceptions updated. (Serbian) 2019-08-22 12:43:07 +03:00			`from .tokenizer_exceptions import TOKENIZER_EXCEPTIONS`
			`from .lex_attrs import LEX_ATTRS`
Stopwords for Serbian language. (#4078) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated 2019-08-05 11:22:27 +03:00			`from ...language import Language`
Add lexeme norm defaults 2020-09-30 11:20:14 +03:00			`from ...util import load_config_from_str`


			`DEFAULT_CONFIG = """`
			`[initialize]`

			`[initialize.lookups]`
			`@misc = "spacy.LookupsDataLoader.v1"`
			`lang = ${nlp.lang}`
			`tables = ["lexeme_norm"]`
			`"""`
Stopwords for Serbian language. (#4078) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated 2019-08-05 11:22:27 +03:00

			`class SerbianDefaults(Language.Defaults):`
Add lexeme norm defaults 2020-09-30 11:20:14 +03:00			`config = load_config_from_str(DEFAULT_CONFIG)`
Tidy up and move noun_chunks, token_match, url_match 2020-07-22 23:18:46 +03:00			`tokenizer_exceptions = TOKENIZER_EXCEPTIONS`
Simplify language data and revert detailed configs 2020-07-24 15:50:26 +03:00			`lex_attr_getters = LEX_ATTRS`
			`stop_words = STOP_WORDS`
Stopwords for Serbian language. (#4078) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated 2019-08-05 11:22:27 +03:00

			`class Serbian(Language):`
Serbian language code update "rs" -> "sr" (#4159) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated * Serbian language code update. --bugfix 2019-08-21 20:57:37 +03:00			`lang = "sr"`
Stopwords for Serbian language. (#4078) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated 2019-08-05 11:22:27 +03:00			`Defaults = SerbianDefaults`


			`__all__ = ["Serbian"]`