mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-30 20:06:30 +03:00
b98d216205
* Update Catalan language data Update Catalan language data based on contributions from the Text Mining Unit at the Barcelona Supercomputing Center: https://github.com/TeMU-BSC/spacy4release/tree/main/lang_data * Update tokenizer settings for UD Catalan AnCora Update for UD Catalan AnCora v2.7 with merged multi-word tokens. * Update test * Move prefix patternt to more generic infix pattern * Clean up |
||
---|---|---|
.. | ||
__init__.py | ||
examples.py | ||
lemmatizer.py | ||
lex_attrs.py | ||
punctuation.py | ||
stop_words.py | ||
syntax_iterators.py | ||
tokenizer_exceptions.py |