mirror of
https://github.com/explosion/spaCy.git
synced 2025-10-25 05:01:02 +03:00
* Switch to mecab-ko as default Korean tokenizer
Switch to the (confusingly-named) mecab-ko python module for default Korean
tokenization.
Maintain the previous `natto-py` tokenizer as
`spacy.KoreanNattoTokenizer.v1`.
* Temporarily run tests with mecab-ko tokenizer
* Fix types
* Fix duplicate test names
* Update requirements test
* Revert "Temporarily run tests with mecab-ko tokenizer"
This reverts commit
|
||
|---|---|---|
| .. | ||
| __init__.py | ||
| examples.py | ||
| lex_attrs.py | ||
| punctuation.py | ||
| stop_words.py | ||
| tag_map.py | ||