mirror of
https://github.com/explosion/spaCy.git
synced 2025-11-01 00:17:44 +03:00
Update Polish tokenizer for UD_Polish-PDB, which is a relatively major change from the existing tokenizer. Unused exceptions files and conflicting test cases removed. Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| examples.py | ||
| lemmatizer.py | ||
| lex_attrs.py | ||
| punctuation.py | ||
| stop_words.py | ||
| tag_map.py | ||