Commit Graph

17 Commits

Author SHA1 Message Date
Matthew Honnibal
4b4eec8b47 * Fix Issue #201: Tokenization of there'll 2015-12-29 18:09:09 +01:00
Matthew Honnibal
e8bd92f1e7 * Fix lemma of let's, re Issue #177 2015-11-13 06:42:23 +11:00
Matthew Honnibal
bdcb8d695c * Add non-breaking space to specials.json 2015-10-10 15:54:06 +11:00
Matthew Honnibal
a510858f5a * Pretty-print specials.json, and add the em dash 2015-10-09 11:07:45 +02:00
jxs8172
85f01c5e16 Add contributor agreement. Add exception to 'it' so that 'its' and 'Its' isn't generated (its =/= it's) 2015-08-24 18:20:06 -04:00
jxs8172
5876248109 Add missing we've and hardcoded 's and 'S 2015-08-21 22:57:47 -04:00
jxs8172
a5e0a0073b Add a script to generate the specials.json file, to take care of handling uppercase and missing apostrophe contractions 2015-08-21 22:39:33 -04:00
Matthew Honnibal
6c01e01f12 * Fix some casing problems in specials.json 2015-07-26 01:38:29 +02:00
Matthew Honnibal
9cae1b4cad * Restore accidentally clobbered updates to specials.json 2015-07-20 12:19:46 +02:00
Matthew Honnibal
14e9e6ec6c * Fix ... tokenization, and correct orth inconsistencies in specials.json 2015-07-20 12:10:56 +02:00
Matthew Honnibal
d8bc279e0c * Fix 'you' contraction capitals in specials.json 2015-07-16 01:28:32 +02:00
Matthew Honnibal
3c1e3e9ee8 * Fix capitalization problems in specials.json 2015-07-14 23:46:31 +02:00
Matthew Honnibal
b5223c4824 * Add whitespace to specials.json 2015-07-09 13:31:12 +02:00
Matthew Honnibal
0c25001325 * Fix specials.json 2015-04-12 04:45:41 +02:00
Matthew Honnibal
056c672caf * Bug fixes to tokenization, and support for times 2015-03-26 16:44:48 +01:00
Matthew Honnibal
13520e6cf0 * Add i.e. to specials.json 2015-03-26 16:44:45 +01:00
Matthew Honnibal
5e27bd0c4c * Add en language data, for tokenizer etc 2015-02-25 17:10:32 -05:00