Mike Izbicki
|
7a2ca00794
|
fix bug in Korean language, resulting in 100x speedup by reducing overhead of mecab (#5701)
* speed up Korean nlp 100x by stopping mecab from reloading on each doc
* add contributor agreement
* rename variables to improve code readability
|
2020-07-06 17:03:33 +02:00 |
|
Ines Montani
|
75514b5970
|
Fix Korean
|
2019-09-29 17:10:56 +02:00 |
|
Ines Montani
|
499c39acba
|
Remove unnecessary namedtuple/dataclass
|
2019-09-29 15:05:28 +02:00 |
|
Bae Yong-Ju
|
a55f5a744f
|
Fix ValueError exception on empty Korean text. (#4245)
|
2019-09-06 10:29:40 +02:00 |
|
Ines Montani
|
009280fbc5
|
Tidy up and auto-format
|
2019-08-18 15:09:16 +02:00 |
|
veer-bains
|
874bd8c8dd
|
Fixed syntax error in lang/ko when using python 2 (#4082) (closes #4068)
* fixed syntax error in declaring variables with python 2.7 in spacy/lang/ko/__init__.py
* fixed syntax error in declaring variables with python 2.7 in spacy/lang/ko/__init__.py
* Update __init__.py
* Create veer-bains.md
* Update __init__.py
fixed syntax errors in variable datatype assignment when calling spacy.blank("ko") with python 2.7
|
2019-08-05 10:19:32 +02:00 |
|
Bae Yong-Ju
|
05fbf5d976
|
Fix error when Korean text contains regexp special characters. (#4022)
|
2019-07-25 17:53:33 +02:00 |
|
Ines Montani
|
0b8406a05c
|
Tidy up and auto-format
|
2019-07-11 12:02:25 +02:00 |
|
cedar101
|
58f06e6180
|
Korean support (#3901)
* start lang/ko
* add test codes
* using natto-py
* add test_ko_tokenizer_full_tags()
* spaCy contributor agreement
* external dependency for ko
* collections.namedtuple for python version < 3.5
* case fix
* tuple unpacking
* add jongseong(final consonant)
* apply mecab option
* Remove Pipfile for now
Co-authored-by: Ines Montani <ines@ines.io>
|
2019-07-09 22:23:16 +02:00 |
|