spaCy/spacy/language_data
Matthew Honnibal e7b1ee9efd Switch to regex module for URL identification
The URL detection regex was failing on input such as 0.1.2.3, as this
input triggered excessive back-tracking in the builtin re module.
The solution was to switch to the regex module, which behaves better.

Closes #913.
2017-04-07 15:47:36 +02:00
..
__init__.py Revert "Revert "Merge remote-tracking branch 'origin/master'"" 2017-01-09 13:28:13 +01:00
abbreviations.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
emoticons.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
entity_rules.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
punctuation.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
tag_map.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
tokenizer_exceptions.py Switch to regex module for URL identification 2017-04-07 15:47:36 +02:00
util.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00