spaCy/spacy/lang/fi
Antti Ajanki e626a011cc Improvements to the Finnish language data (#4738)
* Enable lex_attrs on Finnish

* Copy the Danish tokenizer rules to Finnish

Specifically, don't break hyphenated compound words

* Contributor agreement

* A new file for Finnish tokenizer rules instead of including the Danish ones
2019-12-03 12:55:28 +01:00
..
__init__.py Improvements to the Finnish language data (#4738) 2019-12-03 12:55:28 +01:00
examples.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
lex_attrs.py Improvements to the Finnish language data (#4738) 2019-12-03 12:55:28 +01:00
punctuation.py Improvements to the Finnish language data (#4738) 2019-12-03 12:55:28 +01:00
stop_words.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
tokenizer_exceptions.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00