spaCy/.github
Andrew Ongko 81564cc4e8 Update Indonesian model (#2752)
* adding e-KTP in tokenizer exceptions list

* add exception token

* removing lines with containing space as it won't matter since we use .split() method in the end, added new tokens in exception

* add tokenizer exceptions list

* combining base_norms with norm_exceptions

* adding norm_exception

* fix double key in lemmatizer

* remove unused import on punctuation.py

* reformat stop_words to reduce number of lines, improve readibility

* updating tokenizer exception

* implement is_currency for lang/id

* adding orth_first_upper in tokenizer_exceptions

* update the norm_exception list

* remove bunch of abbreviations

* adding contributors file
2018-09-14 12:30:32 +02:00
..
contributors Update Indonesian model (#2752) 2018-09-14 12:30:32 +02:00
ISSUE_TEMPLATE Order issue templates 2018-05-04 03:04:41 +02:00
CONTRIBUTOR_AGREEMENT.md Restore contributor agreement 2018-03-31 14:06:37 +02:00
ISSUE_TEMPLATE.md Update ISSUE_TEMPLATE.md 2017-03-18 22:01:54 +01:00
lock.yml Add lock-threads configuration [ci skip] 2018-05-07 18:54:22 +02:00
no-response.yml Add config for no-response [ci skip] 2018-05-07 22:04:54 +02:00
PULL_REQUEST_TEMPLATE.md Update pull request template 2017-10-24 21:49:11 +02:00