mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-25 17:36:30 +03:00
923a453449
Modifications to Portuguese tokenization for UD_Portuguese-Bosque. Instead of splitting contactions as exceptions, they are kept as merged tokens. |
||
---|---|---|
.. | ||
__init__.py | ||
examples.py | ||
lex_attrs.py | ||
norm_exceptions.py | ||
punctuation.py | ||
stop_words.py | ||
tag_map.py | ||
tokenizer_exceptions.py |