mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 18:06:29 +03:00
cf52510631
* Add Amharic to space * clean up * Add some PRON_LEMMA * add Tigrinya support * remove text_noun_chunks * Tigrinya Support * added some more details for ti * fix unit test * add amharic char range * changes from review * amharic and tigrinya share same unicode block * get rid of _amharic/_tigrinya in char_classes Co-authored-by: Josiah Solomon <jsolomon@meteorcomm.com>
10 lines
224 B
Python
10 lines
224 B
Python
# coding: utf8
|
|
from __future__ import unicode_literals
|
|
|
|
|
|
# Stop words
|
|
STOP_WORDS = set(
|
|
"""
|
|
ግን አንቺ አንተ እናንተ ያንተ ያንቺ የናንተ ራስህን ራስሽን ራሳችሁን
|
|
""".split()
|
|
) |