spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-02-05 23:09:48 +03:00

History

cedar101 58f06e6180 Korean support (#3901 ) * start lang/ko * add test codes * using natto-py * add test_ko_tokenizer_full_tags() * spaCy contributor agreement * external dependency for ko * collections.namedtuple for python version < 3.5 * case fix * tuple unpacking * add jongseong(final consonant) * apply mecab option * Remove Pipfile for now Co-authored-by: Ines Montani <ines@ines.io>		2019-07-09 22:23:16 +02:00
..
ar	Tidy up and format remaining files	2018-11-30 17:43:08 +01:00
bn	💫 Port master changes over to develop (#2979 )	2018-11-29 16:30:29 +01:00
ca	Improve Italian & Urdu tokenization accuracy (#3228 )	2019-02-04 22:39:25 +01:00
da	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
de	💫 Port master changes over to develop (#2979 )	2018-11-29 16:30:29 +01:00
el	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
en	Fix/irreg adverbs extension (#3499 )	2019-03-28 13:23:33 +01:00
es	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
fi	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
fr	Clean up of char classes, few tokenizer fixes and faster default French tokenizer (#3293 )	2019-02-20 22:10:13 +01:00
ga	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
he	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
hu	Tidy up and format remaining files	2018-11-30 17:43:08 +01:00
id	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
it	Improve Italian & Urdu tokenization accuracy (#3228 )	2019-02-04 22:39:25 +01:00
ja	Tags are joined with a comma and padded with asterisks (#3491 )	2019-03-28 16:17:31 +01:00
ko	Korean support (#3901 )	2019-07-09 22:23:16 +02:00
lt	Lithuanian language support (#3895 )	2019-07-08 10:25:22 +02:00
nb	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
nl	Improved Dutch language resources and Dutch lemmatization (#3409 )	2019-04-03 14:13:26 +02:00
pl	Tidy up and fix small bugs and typos	2019-02-08 14:14:49 +01:00
pt	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
ro	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
ru	Replacing regex library with re to increase tokenization speed (#3218 )	2019-02-01 18:05:22 +11:00
sv	Tidy up and fix small bugs and typos	2019-02-08 14:14:49 +01:00
th	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
tr	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
tt	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
uk	Merge branch 'master' into develop	2019-02-25 15:54:55 +01:00
ur	Improve Italian & Urdu tokenization accuracy (#3228 )	2019-02-04 22:39:25 +01:00
__init__.py	Remove imports in /lang/__init__.py	2017-05-08 23:58:07 +02:00
test_attrs.py	💫 Tidy up and auto-format tests (#2967 )	2018-11-27 01:09:36 +01:00
test_initialize.py	Fix noqa [ci skip]	2019-03-07 12:25:00 +01:00