Commit Graph

96 Commits

Author SHA1 Message Date
Matthew Honnibal
c388833ca6 Minibatch by number of tokens, support other vectors, refactor CoNLL printing 2018-02-25 10:38:06 +01:00
Matthew Honnibal
dd78ef066a Unset data size limit in conll script 2018-02-24 18:14:57 +01:00
Matthew Honnibal
8adeea3746 Generalize conllu script. Now handling Chinese (maybe badly) 2018-02-24 16:04:27 +01:00
Matthew Honnibal
329b14c9e6 Clean up conllu script 2018-02-24 10:31:53 +01:00
Matthew Honnibal
5be092ee72 CONLLU scoring 80.9% UAS with no oracle segments 2018-02-23 23:49:17 +01:00
Matthew Honnibal
23236340f4 Update CoNLL script. Don't preset SBD. Set batch size to 8, avoid writing twice 2018-02-22 21:35:50 +01:00
Matthew Honnibal
a26e399f84 Update conllu script 2018-02-22 19:43:54 +01:00
Matthew Honnibal
001e2ec6d6 Refactor CoNLL training script 2018-02-22 16:00:34 +01:00
Matthew Honnibal
6a27a4f77c Set accelerating batch size in CONLL train script 2018-02-21 21:02:41 +01:00
Matthew Honnibal
4dc0fc9954 Replace labels that didn't make freq cutoff 2018-02-21 15:59:22 +01:00
Matthew Honnibal
97164b1763 Fix conllu script 2018-02-21 14:46:54 +01:00
Matthew Honnibal
24fb2c246f Add script to do conllu training 2018-02-21 13:53:59 +01:00
Matthew Honnibal
00557c5fdd Add example of NER multitask objective 2018-01-21 19:46:37 +01:00
mpuels
1e8147aec7
fix: Add missing period in train data 2017-12-13 10:51:05 +01:00
mpuels
ee4d6fdd40
Fix typo in comment 2017-12-09 13:14:57 +01:00
ines
726fb2d0b5 Use fewer iterations by default to avoid overfitting on blank model (resolves #1632) 2017-11-23 15:27:12 +01:00
ines
ec08996000 Add note on tags matching tokenization (see #1613) 2017-11-20 15:12:47 +01:00
ines
f36fab39b0 Don't rename component in intent parser example (resolves #1551)
Otherwise, the default saved model won't know that it's supposed to create spaCy's 'parser'.
2017-11-10 23:35:38 +01:00
Ines Montani
1a23a0f87e
Remove broken link (resolves #1541) 2017-11-10 12:28:39 +01:00
ines
89bd40b821 Fix print statement in textcat training example (resolves #1515) 2017-11-08 17:17:40 +01:00
ines
a09c096d3c Get docs ready for v2.0.0 2017-11-07 12:00:43 +01:00
ines
173b1551af Update examples 2017-11-07 01:22:30 +01:00
ines
1b1c9105b4 Update example compatibility statements 2017-11-07 01:11:45 +01:00
ines
8fb48b9b91 Update and document new util functions 2017-11-07 00:22:43 +01:00
Matthew Honnibal
d7016d4050 Update intent parser example 2017-11-06 23:31:11 +01:00
ines
fe498b3d5e Update training examples to use "simple style" 2017-11-06 23:14:04 +01:00
ines
2dca9e71a1 Add notes on catastrophic forgetting (see #1496) 2017-11-06 13:17:02 +01:00
Matthew Honnibal
e033162a1d Update tagger training example 2017-11-01 21:49:08 +01:00
ines
8f1d3fc3ee Update textcat example 2017-11-01 17:09:22 +01:00
Matthew Honnibal
dad8f09fba Fix print statements in text classifier example 2017-11-01 16:34:31 +01:00
ines
bfe17b7df1 Fix begin_training if get_gold_tuples is None 2017-11-01 13:14:31 +01:00
ines
4b196fdf7f Fix formatting 2017-11-01 00:43:22 +01:00
ines
33af6ac69a Use even smaller examle size
100 was still too much, so try 20 instead
2017-10-30 19:46:45 +01:00
ines
f02b0af821 Fix path and use smaller example size
500 was too larger and caused laggy rendering
2017-10-30 19:44:35 +01:00
ines
18dde7869a Update training data docs and add vocab JSONL 2017-10-30 19:40:05 +01:00
ines
b5643d8575 Update intent parser docs and add to usage docs 2017-10-27 04:49:05 +02:00
ines
9dfca0f2f8 Add example for custom intent parser 2017-10-27 03:55:11 +02:00
ines
4d272e25ee Fix examples 2017-10-27 03:55:04 +02:00
ines
a7b9074b4c Update textcat training example and docs 2017-10-27 00:48:45 +02:00
ines
b61866a2e4 Update textcat example 2017-10-27 00:32:19 +02:00
ines
f81cc0bd1c Fix usage of disable_pipes 2017-10-27 00:31:30 +02:00
ines
f57043e6fe Update docstring 2017-10-26 16:29:08 +02:00
ines
b90e958975 Update tagger and parser examples and add to docs 2017-10-26 16:27:42 +02:00
ines
f1529463a8 Update tagger training example 2017-10-26 16:19:02 +02:00
ines
e44bbb5361 Remove old example 2017-10-26 16:12:41 +02:00
ines
421c3837e8 Fix formatting 2017-10-26 16:11:25 +02:00
ines
4d896171ae Use plac annotations for arguments 2017-10-26 16:11:20 +02:00
ines
c3b681e5fb Use plac annotations for arguments and add n_iter 2017-10-26 16:11:05 +02:00
ines
bc2c92f22d Use plac annotations for arguments 2017-10-26 16:10:56 +02:00
ines
b5c74dbb34 Update parser training example 2017-10-26 15:15:37 +02:00