mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 18:06:29 +03:00
* Allow longer ellipses to be treated as a single token, e.g. Hello......there
This commit is contained in:
parent
c61ee8f9fa
commit
eab2376547
|
@ -1,4 +1,4 @@
|
||||||
\.\.\.
|
\.\.\.+
|
||||||
(?<=[a-z])\.(?=[A-Z])
|
(?<=[a-z])\.(?=[A-Z])
|
||||||
(?<=[a-zA-Z])-(?=[a-zA-z])
|
(?<=[a-zA-Z])-(?=[a-zA-z])
|
||||||
(?<=[a-zA-Z])--(?=[a-zA-z])
|
(?<=[a-zA-Z])--(?=[a-zA-z])
|
||||||
|
|
Loading…
Reference in New Issue
Block a user