spaCy/spacy/tests/tokenizer
adrianeboyd d24bca62f6 Add CJK to character classes (#4884)
* Add CJK character class as uncased

* Incorporate Chinese URL test case

Un-xfail Chinese URL test instance
2020-01-08 16:50:19 +01:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
sun.txt Revert #4334 2019-09-29 17:32:12 +02:00
test_exceptions.py Revert #4334 2019-09-29 17:32:12 +02:00
test_explain.py Detect more empty matches in tokenizer.explain() (#4675) 2019-11-20 16:31:29 +01:00
test_naughty_strings.py Revert #4334 2019-09-29 17:32:12 +02:00
test_tokenizer.py Revert #4334 2019-09-29 17:32:12 +02:00
test_urls.py Add CJK to character classes (#4884) 2020-01-08 16:50:19 +01:00
test_whitespace.py Revert #4334 2019-09-29 17:32:12 +02:00