spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-02-23 06:50:32 +03:00

History

Adriane Boyd b9c524917a Cast to uint64 for all array-based doc representations		2022-12-07 09:53:33 +01:00
..
am	Add Amharic አማርኛ Language support (#6583 )	2020-12-22 16:50:34 +01:00
ar	Revert #4334	2019-09-29 17:32:12 +02:00
bn	Revert #4334	2019-09-29 17:32:12 +02:00
ca	Revert #4334	2019-09-29 17:32:12 +02:00
cs	Adding num_like test for Czech (#5946 )	2020-08-21 17:06:33 +02:00
da	Add (noun chunks) syntax iterators for Danish (#6246 )	2021-01-07 16:33:00 +11:00
de	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
el	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
en	Cast to uint64 for all array-based doc representations	2022-12-07 09:53:33 +01:00
es	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
eu	Add __init__.py to eu and hy tests (#5278 )	2020-04-08 20:03:06 +02:00
fa	Fix syntax iterators for Persian (#5437 )	2020-05-14 16:51:03 +02:00
fi	add two abbreviations and some additional unit tests (#5040 )	2020-02-22 14:12:32 +01:00
fr	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
ga	Revert #4334	2019-09-29 17:32:12 +02:00
gu	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
he	Hebrew like num (#5952 )	2020-08-24 14:30:05 +02:00
hi	Hindi: Adds tests for lexical attributes (norm and like_num) (#5829 )	2020-10-07 10:23:32 +02:00
hu	Tidy up and auto-format	2020-03-25 12:28:12 +01:00
hy	Add missing declaration	2020-05-21 17:30:05 +02:00
id	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
it	Revert #4334	2019-09-29 17:32:12 +02:00
ja	Revert "Convert custom user_data to token extension format for Japanese tokenizer (#5652 )" (#5665 )	2020-06-29 14:34:15 +02:00
ko	Revert #4334	2019-09-29 17:32:12 +02:00
ky	Add tests	2021-01-24 20:56:16 +06:00
lb	Reduce stored lexemes data, move feats to lookups (#5238 )	2020-05-19 15:59:14 +02:00
lt	Improve Lithuanian tokenization (#5205 )	2020-03-25 11:28:12 +01:00
mk	Include Macedonian language (#6230 )	2020-10-15 15:55:01 +02:00
ml	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
nb	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
ne	Add Nepali Language (#5622 )	2020-06-22 10:25:46 +02:00
nl	Move lookup tables out of the core library (#4346 )	2019-10-01 00:01:27 +02:00
pl	Update Polish tokenizer for UD_Polish-PDB (#5432 )	2020-05-19 15:59:55 +02:00
pt	Revert #4334	2019-09-29 17:32:12 +02:00
ro	Move lookup tables out of the core library (#4346 )	2019-10-01 00:01:27 +02:00
ru	Revert #4334	2019-09-29 17:32:12 +02:00
sa	Added support for Sanskrit language (#5956 )	2020-08-25 10:56:29 +02:00
sr	Move lookup tables out of the core library (#4346 )	2019-10-01 00:01:27 +02:00
sv	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
th	Revert #4334	2019-09-29 17:32:12 +02:00
ti	Add Amharic አማርኛ Language support (#6583 )	2020-12-22 16:50:34 +01:00
tr	Turkish tokenization improvements (#6268 )	2020-10-29 09:43:17 +01:00
tt	Add trailing whitespace to multiline test text (#4877 )	2020-01-06 14:58:59 +01:00
uk	Revert #4334	2019-09-29 17:32:12 +02:00
ur	Revert #4334	2019-09-29 17:32:12 +02:00
yo	Adding support for Yoruba Language (#4614 )	2019-12-21 14:11:50 +01:00
zh	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
__init__.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_attrs.py	Tidy up and auto-format	2019-12-21 19:04:17 +01:00
test_initialize.py	Adding support for Yoruba Language (#4614 )	2019-12-21 14:11:50 +01:00