spaCy/spacy/lang/ko/stop_words.py
cedar101 58f06e6180 Korean support (#3901)
* start lang/ko

* add test codes

* using natto-py

* add test_ko_tokenizer_full_tags()

* spaCy contributor agreement

* external dependency for ko

* collections.namedtuple for python version < 3.5

* case fix

* tuple unpacking

* add jongseong(final consonant)

* apply mecab option

* Remove Pipfile for now


Co-authored-by: Ines Montani <ines@ines.io>
2019-07-09 22:23:16 +02:00

69 lines
399 B
Python

# coding: utf8
from __future__ import unicode_literals
STOP_WORDS = set("""
아니
그렇
위하
때문
그것
말하
그러나
못하
그런
그리고
시키
그러
하나
어떤
다른
어떻
이렇
""".split())