spaCy/spacy/tests/tokenizer
Raphaël Bournhonesque 3452d6ce52 Resolve issue #1078 by simplifying URL pattern
- avoid catastrophic backtracking
- reduce character range of host name, domain name and TLD identifier
2017-10-11 11:24:00 +02:00
..
__init__.py add __init__.py to empty package dirs 2016-03-14 11:28:03 +01:00
sun.txt Move text file back to tokenizer tests directory 2017-01-12 02:10:23 +01:00
test_customized_tokenizer.py Fix customized-tokenizer tests 2017-09-04 20:13:13 +02:00
test_exceptions.py Move non-English-specific tests back to general tokenizer tests 2017-01-05 18:09:29 +01:00
test_tokenizer.py Remove unused utf8open util and replace os.path with ensure_path 2017-04-16 20:37:45 +02:00
test_urls.py Resolve issue #1078 by simplifying URL pattern 2017-10-11 11:24:00 +02:00
test_whitespace.py Move non-English-specific tests back to general tokenizer tests 2017-01-05 18:09:29 +01:00