spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-10-04 10:56:45 +03:00

History

Adriane Boyd 2f981d5af1 Remove corpus-specific tag maps Remove corpus-specific tag maps from the language data for languages without custom tokenizers. For languages with custom word segmenters that also provide tags (Japanese and Korean), the tag maps for the custom tokenizers are kept as the default. The default tag maps for languages without custom tokenizers are now the default tag map from `lang/tag_map/py`, UPOS -> UPOS.		2020-07-15 15:58:29 +02:00
..
__init__.py	Remove corpus-specific tag maps	2020-07-15 15:58:29 +02:00
examples.py	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
lex_attrs.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
morph_rules.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
punctuation.py	Remove unicode declarations	2020-03-26 15:18:32 +01:00
stop_words.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
tokenizer_exceptions.py	Merge branch 'master' into tmp/sync	2020-03-26 13:38:14 +01:00