spaCy/spacy/tests/tokenizer
Adriane Boyd 1139247532 Revert changes to token_match priority from #4374
* Revert changes to priority of `token_match` so that it has priority
over all other tokenizer patterns

* Add lookahead and potentially slow lookbehind back to the default URL
pattern

* Expand character classes in URL pattern to improve matching around
lookaheads and lookbehinds related to #4882

* Revert changes to Hungarian tokenizer

* Revert (xfail) several URL tests to their status before #4374

* Update `tokenizer.explain()` and docs accordingly
2020-03-09 12:09:41 +01:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
sun.txt Revert #4334 2019-09-29 17:32:12 +02:00
test_exceptions.py Revert #4334 2019-09-29 17:32:12 +02:00
test_explain.py Detect more empty matches in tokenizer.explain() (#4675) 2019-11-20 16:31:29 +01:00
test_naughty_strings.py Revert #4334 2019-09-29 17:32:12 +02:00
test_tokenizer.py Revert #4334 2019-09-29 17:32:12 +02:00
test_urls.py Revert changes to token_match priority from #4374 2020-03-09 12:09:41 +01:00
test_whitespace.py Revert #4334 2019-09-29 17:32:12 +02:00