spaCy/lang_data
Wolfgang Seeker eae35e9b27 add tokenizer files for German, add/change code to train German pos tagger
- add files to specify rules for German tokenization
- change generate_specials.py to generate from an external file (abbrev.de.tab)
- copy gazetteer.json from lang_data/en/

- init_model.py
	- change doc freq threshold to 0
- add train_german_tagger.py
	- expects conll09-formatted input
2016-02-18 13:24:20 +01:00
..
de add tokenizer files for German, add/change code to train German pos tagger 2016-02-18 13:24:20 +01:00
en Fix Issue #243: Incorrect gazetteer entry 2016-01-30 06:58:29 +11:00
fi * Fix identity tag map 2015-10-08 13:59:56 +11:00
it * Patch italian tag map 2015-10-08 14:00:13 +11:00