| .. |
|
af
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
ar
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
bg
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
bn
|
Merge branch 'master' into develop
|
2019-02-18 10:03:32 +01:00 |
|
ca
|
Merge branch 'master' into develop
|
2019-02-17 17:51:17 +01:00 |
|
cs
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
da
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
de
|
Fix morphological features in de tag_map
|
2019-03-07 21:57:43 +01:00 |
|
el
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
en
|
Fix obsolete data in English tokenizer exceptions
|
2019-03-07 21:58:16 +01:00 |
|
es
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
et
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
fa
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
fi
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
fr
|
Clean up of char classes, few tokenizer fixes and faster default French tokenizer (#3293)
|
2019-02-20 22:10:13 +01:00 |
|
ga
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
he
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
hi
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
hr
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
hu
|
Fix regex deprecation warnings
|
2019-02-21 11:56:47 +01:00 |
|
id
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
is
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
it
|
Clean up of char classes, few tokenizer fixes and faster default French tokenizer (#3293)
|
2019-02-20 22:10:13 +01:00 |
|
ja
|
Expose Japanese stop words (closes #3346)
|
2019-03-06 14:21:15 +01:00 |
|
kn
|
Fix typo
|
2019-02-14 12:26:56 +01:00 |
|
lt
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
lv
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
nb
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
nl
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
pl
|
Tidy up and fix small bugs and typos
|
2019-02-08 14:14:49 +01:00 |
|
pt
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
ro
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
ru
|
Merge branch 'master' into develop
|
2019-02-25 15:54:55 +01:00 |
|
si
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
sk
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
sl
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
sq
|
💫 Add base Language classes for more languages (#3276)
|
2019-02-15 01:31:19 +11:00 |
|
sv
|
Tidy up and fix small bugs and typos
|
2019-02-08 14:14:49 +01:00 |
|
ta
|
Remove stray print statement (closes #3342)
|
2019-02-27 15:35:04 +01:00 |
|
te
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
th
|
Merge branch 'master' into develop
|
2019-02-07 20:54:07 +01:00 |
|
tl
|
Tidy up and fix small bugs and typos
|
2019-02-08 14:14:49 +01:00 |
|
tr
|
Merge branch 'master' into develop
|
2019-02-07 20:54:07 +01:00 |
|
tt
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
uk
|
Add missing " (closes #3343)
|
2019-02-27 16:37:03 +01:00 |
|
ur
|
Improve Italian & Urdu tokenization accuracy (#3228)
|
2019-02-04 22:39:25 +01:00 |
|
vi
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
xx
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
zh
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
__init__.py
|
Remove imports in /lang/__init__.py
|
2017-05-08 23:58:07 +02:00 |
|
char_classes.py
|
Auto-format
|
2019-02-24 14:09:15 +01:00 |
|
lex_attrs.py
|
Replacing regex library with re to increase tokenization speed (#3218)
|
2019-02-01 18:05:22 +11:00 |
|
norm_exceptions.py
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
punctuation.py
|
Clean up of char classes, few tokenizer fixes and faster default French tokenizer (#3293)
|
2019-02-20 22:10:13 +01:00 |
|
tag_map.py
|
💫 Tidy up and auto-format .py files (#2983)
|
2018-11-30 17:03:03 +01:00 |
|
tokenizer_exceptions.py
|
Tidy up and fix small bugs and typos
|
2019-02-08 14:14:49 +01:00 |