mirror of
https://github.com/explosion/spaCy.git
synced 2025-11-16 15:55:58 +03:00
* Adjust hyphenation rule in tokenizer
This commit is contained in:
parent
2b5421e60c
commit
b5b869366b
|
|
@ -1,2 +1,3 @@
|
|||
(?<=[a-z])\.(?=[A-Z])
|
||||
(?<=[a-zA-Z])-(?=[a-zA-z])
|
||||
(?<=[a-zA-Z0-9])-(?=[a-zA-z])
|
||||
(?<=[a-zA-Z])-(?=[0-9a-zA-z])
|
||||
|
|
|
|||
Loading…
Reference in New Issue
Block a user