spaCy/spacy/lang/ja
Paul O'Leary McCann 61ef0739b8 Add Japanese stop words. (#2549)
List created by taking the 2000 top words from a Wikipedia dump and
removing everything that wasn't hiragana.

Tried going through kanji words and deciding what to keep but there were
too many obvious non-stopwords (東京 was in the top 500) and many other
words where it wasn't clear if they should be included or not.
2018-07-17 10:12:48 +02:00
..
__init__.py Add Japanese lemmas (#2543) 2018-07-13 10:55:14 +02:00
examples.py Add example sentences for Japanese and Chinese (see #1107) 2017-10-24 13:02:24 +02:00
stop_words.py Add Japanese stop words. (#2549) 2018-07-17 10:12:48 +02:00
tag_map.py Port Japanese mecab tokenizer from v1 (#2036) 2018-05-03 18:38:26 +02:00