1
1
mirror of https://github.com/explosion/spaCy.git synced 2025-01-30 19:24:07 +03:00
Commit Graph

1 Commits

Author SHA1 Message Date
Antti Ajanki
e626a011cc Improvements to the Finnish language data ()
* Enable lex_attrs on Finnish

* Copy the Danish tokenizer rules to Finnish

Specifically, don't break hyphenated compound words

* Contributor agreement

* A new file for Finnish tokenizer rules instead of including the Danish ones
2019-12-03 12:55:28 +01:00