spaCy/lang_data/en
Matthew Honnibal 85485f5c2b Fix inconsistencies in generate_specials.py
Re Issue #321, fix inconsistencies in the script that generates specials.json. The result still isn't so satisfying --- we need to revise this as we move to parse more morphologically rich languages.
2016-04-07 11:21:52 +10:00
..
gazetteer.json Fix Issue #243: Incorrect gazetteer entry 2016-01-30 06:58:29 +11:00
generate_specials.py Fix inconsistencies in generate_specials.py 2016-04-07 11:21:52 +10:00
infix.txt * Add infix rule for double hyphens, re Issue #302 2016-03-29 13:03:44 +11:00
lemma_rules.json * Fix quote marks in lemma_rules 2015-10-10 15:03:36 +11:00
morphs.json * Whitespace 2015-10-10 16:03:48 +11:00
prefix.txt * Add en language data, for tokenizer etc 2015-02-25 17:10:32 -05:00
specials.json * Fix Issue #201: Tokenization of there'll 2015-12-29 18:09:09 +01:00
suffix.txt * Add smart-quote possessive marker to tokenizer 2015-07-30 05:12:48 +02:00
tag_map.json * Map NIL to empty string in tag map 2015-10-10 22:09:50 +11:00