mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-26 01:46:28 +03:00
Auto-format
This commit is contained in:
parent
1664d1fa62
commit
59c0620487
|
@ -105,11 +105,11 @@ to know the language's character set. If the language you're adding uses
|
||||||
non-latin characters, you might need to define the required character classes in
|
non-latin characters, you might need to define the required character classes in
|
||||||
the global
|
the global
|
||||||
[`char_classes.py`](https://github.com/explosion/spaCy/tree/master/spacy/lang/char_classes.py).
|
[`char_classes.py`](https://github.com/explosion/spaCy/tree/master/spacy/lang/char_classes.py).
|
||||||
For efficiency, spaCy uses hard-coded unicode ranges to define character classes,
|
For efficiency, spaCy uses hard-coded unicode ranges to define character
|
||||||
the definitions of which can be found on [Wikipedia](https://en.wikipedia.org/wiki/Unicode_block).
|
classes, the definitions of which can be found on
|
||||||
If the language requires very specific punctuation
|
[Wikipedia](https://en.wikipedia.org/wiki/Unicode_block). If the language
|
||||||
rules, you should consider overwriting the default regular expressions with your
|
requires very specific punctuation rules, you should consider overwriting the
|
||||||
own in the language's `Defaults`.
|
default regular expressions with your own in the language's `Defaults`.
|
||||||
|
|
||||||
</Infobox>
|
</Infobox>
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue
Block a user