spaCy/spacy/lang
Duygu Altinok 2fad279a44 Turkish language syntax iterators (#6191)
* added tr_vocab to config

* basic test

* added syntax iterator to Turkish lang class

* first version for Turkish syntax iter, without flat

* added simple tests with nmod, amod, det

* more tests to amod and nmod

* separated noun chunks and parser test

* rearrangement after nchunk parser separation

* added recursive NPs

* tests with complicated recursive NPs

* tests with conjed NPs

* additional tests for conj NP

* small modification for shaving off conj from NP

* added tests with flat

* more tests with flat

* added examples with flats conjed

* added inner func for flat trick

* corrected parse

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2020-10-09 10:10:22 +02:00
..
af Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
ar Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
bg Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
bn Make lemmatizers use initialize logic (#6182) 2020-10-02 15:42:36 +02:00
ca Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
cs Remove empty file [ci skip] 2020-09-23 09:30:09 +02:00
da Remove default initialize lookups 2020-10-01 21:54:33 +02:00
de Merge branch 'develop' into master-tmp 2020-10-04 14:52:20 +02:00
el Make lemmatizers use initialize logic (#6182) 2020-10-02 15:42:36 +02:00
en Make lemmatizers use initialize logic (#6182) 2020-10-02 15:42:36 +02:00
es Tidy up and auto-format 2020-09-29 21:39:28 +02:00
et Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
eu Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
fa Make lemmatizers use initialize logic (#6182) 2020-10-02 15:42:36 +02:00
fi Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
fr Fix Lemmatizer.get_lookups_config 2020-10-03 17:16:10 +02:00
ga Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
gu Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
he Remove unicode declarations and update language data 2020-09-04 13:19:16 +02:00
hi Merge branch 'develop' into master-tmp 2020-09-04 13:15:36 +02:00
hr Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
hu Fix Hungarian % tokenization (#6013) 2020-09-02 13:06:16 +02:00
hy Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
id Merge branch 'develop' into master-tmp 2020-10-04 14:52:20 +02:00
is Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
it Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
ja Add lexeme norm defaults 2020-09-30 10:20:14 +02:00
kn Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
ko Add lexeme norm defaults 2020-09-30 10:20:14 +02:00
lb Remove default initialize lookups 2020-10-01 21:54:33 +02:00
lij Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
lt Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
lv Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
ml Add missing lex_attr_getters (resolves #5806 ) 2020-07-25 12:55:18 +02:00
mr Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
nb Make lemmatizers use initialize logic (#6182) 2020-10-02 15:42:36 +02:00
ne Remove unicode declarations and update language data 2020-09-04 13:19:16 +02:00
nl Fix Lemmatizer.get_lookups_config 2020-10-03 17:16:10 +02:00
pl Tidy up and auto-format 2020-10-03 17:20:18 +02:00
pt Remove default initialize lookups 2020-10-01 21:54:33 +02:00
ro Add missing lex_attr_getters (resolves #5806 ) 2020-07-25 12:55:18 +02:00
ru Update ru/uk lemmatizers for new nlp.initialize 2020-10-05 09:27:16 +02:00
sa Tidy up and auto-format 2020-09-29 21:39:28 +02:00
si Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
sk Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
sl Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
sq Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
sr Remove default initialize lookups 2020-10-01 21:54:33 +02:00
sv Make lemmatizers use initialize logic (#6182) 2020-10-02 15:42:36 +02:00
ta Remove default initialize lookups 2020-10-01 21:54:33 +02:00
te Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
th Remove default initialize lookups 2020-10-01 21:54:33 +02:00
tl Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
tr Turkish language syntax iterators (#6191) 2020-10-09 10:10:22 +02:00
tt Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
uk Auto-format [ci skip] 2020-10-05 21:58:18 +02:00
ur Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
vi Merge pull request #6165 from explosion/feature/update-tokenizers-initialize 2020-10-01 09:49:47 +02:00
xx Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
yo Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
zh Auto-format [ci skip] 2020-10-05 21:58:18 +02:00
__init__.py Remove imports in /lang/__init__.py 2017-05-08 23:58:07 +02:00
char_classes.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
lex_attrs.py Merge branch 'develop' into master-tmp 2020-09-04 13:15:36 +02:00
norm_exceptions.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
punctuation.py Simplify language data and revert detailed configs 2020-07-24 14:50:26 +02:00
tokenizer_exceptions.py Merge branch 'develop' into master-tmp 2020-09-04 13:15:36 +02:00