Commit Graph

15 Commits

Author SHA1 Message Date
Matthew Honnibal
149a901ea7 * Don't use POS tags in supersense dict 2015-07-05 10:50:22 +02:00
Matthew Honnibal
211058f7a6 * Load adverb senses 2015-07-05 09:13:22 +02:00
Matthew Honnibal
4c6533a019 * Write a supersenses.json fil into a wsd directory in init_model 2015-07-04 17:24:32 +02:00
Matthew Honnibal
948ea9333a * Fix alignment of supersenses in init_model 2015-07-04 12:24:40 +02:00
Matthew Honnibal
ff1f9fe246 * Fix init_model to read supersenses from wordnet, not pre-computed supersenses file 2015-07-03 13:28:39 +02:00
Matthew Honnibal
c60cc22390 * Ignore adjective supersenses 2015-07-03 04:46:11 +02:00
Matthew Honnibal
62cfcd76fe * Add supersense sets to lexemes, from WordNet. Look-up via lemmatization. 2015-07-01 18:48:59 +02:00
Matthew Honnibal
c8a553fe91 * Fix cluster initialization 2015-05-31 15:21:28 +02:00
Matthew Honnibal
c037f80638 * Add case expansion to Brown clusters 2015-05-31 05:50:50 +02:00
Matthew Honnibal
5ab0f233a1 * Ensure words in Brown clusters make it into the vocab, even if they're not in our probs list 2015-05-31 05:46:16 +02:00
Matthew Honnibal
4489d87550 * Add cluster=0 by default in init_model 2015-04-29 14:23:13 +02:00
Matthew Honnibal
693c5a1558 * Exclude clusterings for words only seen 1 or 2 times, as their clusters are unreliable 2015-04-17 04:44:52 +02:00
Matthew Honnibal
1629b33082 * Fix copying of tokenizer data in init_model 2015-04-12 04:45:31 +02:00
Matthew Honnibal
baff0f8ad8 * Add docstring explaining script a bit, and add handling of word vectors 2015-04-08 08:20:15 +02:00
Matthew Honnibal
156b70ed82 * Add new script to replace make_lexicon, that does full setup of data 2015-04-08 07:46:53 +02:00