mirror of
https://github.com/explosion/spaCy.git
synced 2025-10-24 04:31:17 +03:00
- avoid catastrophic backtracking - reduce character range of host name, domain name and TLD identifier |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| sun.txt | ||
| test_customized_tokenizer.py | ||
| test_exceptions.py | ||
| test_tokenizer.py | ||
| test_urls.py | ||
| test_whitespace.py | ||