spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-01-08 17:51:16 +03:00

History

Adriane Boyd 1139247532 Revert changes to token_match priority from #4374 * Revert changes to priority of `token_match` so that it has priority over all other tokenizer patterns * Add lookahead and potentially slow lookbehind back to the default URL pattern * Expand character classes in URL pattern to improve matching around lookaheads and lookbehinds related to #4882 * Revert changes to Hungarian tokenizer * Revert (xfail) several URL tests to their status before #4374 * Update `tokenizer.explain()` and docs accordingly		2020-03-09 12:09:41 +01:00
..
__init__.py	Revert #4334	2019-09-29 17:32:12 +02:00
sun.txt	Revert #4334	2019-09-29 17:32:12 +02:00
test_exceptions.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_explain.py	Detect more empty matches in tokenizer.explain() (#4675 )	2019-11-20 16:31:29 +01:00
test_naughty_strings.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_tokenizer.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_urls.py	Revert changes to token_match priority from #4374	2020-03-09 12:09:41 +01:00
test_whitespace.py	Revert #4334	2019-09-29 17:32:12 +02:00