Matthew Honnibal
|
4fb038a9eb
|
* Update conll_train.py script for spaCy v0.97
|
2015-10-31 00:53:51 +11:00 |
|
Matthew Honnibal
|
cfaa4bde5d
|
* Add train and parse scripts that use CoNLL formatted data
|
2015-10-30 12:54:49 +11:00 |
|
Matthew Honnibal
|
83dccf0fd7
|
* Use io module insteads of deprecated codecs module
|
2015-10-10 14:13:01 +11:00 |
|
Matthew Honnibal
|
f35632e2e5
|
* Remove SBD print statement in train, after SBD evaluation was removed from Scorer
|
2015-10-09 11:08:58 +02:00 |
|
Matthew Honnibal
|
6ea1601e93
|
* Add script to train models off the UD treebanks. Note that the UD data is restricted to research purposes only, and should only be used to train models for academic experiments.
|
2015-10-08 12:01:08 +11:00 |
|
Matthew Honnibal
|
c503654ec1
|
* Update bin/parser/train for printing output.
|
2015-10-06 10:35:22 +11:00 |
|
alvations
|
764bdc62e7
|
caught another codecs.open
|
2015-09-30 20:16:52 +02:00 |
|
Matthew Honnibal
|
b2e82e55f6
|
* Create POS model dir in training script
|
2015-09-08 15:36:23 +02:00 |
|
Matthew Honnibal
|
d1eea2d865
|
* Update train.py for language-generic spaCy
|
2015-09-06 17:51:48 +02:00 |
|
Matthew Honnibal
|
ddc1a5cfe5
|
* Fix training under python3
|
2015-07-28 14:09:30 +02:00 |
|
Matthew Honnibal
|
c52179f5fa
|
* Use print function in train.py, for py 2/3 compatibility
|
2015-07-24 04:52:35 +02:00 |
|
Matthew Honnibal
|
4729200dfc
|
* Whitespace
|
2015-07-23 01:19:26 +02:00 |
|
Matthew Honnibal
|
317cbbc015
|
* Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time.
|
2015-07-19 15:18:17 +02:00 |
|
Matthew Honnibal
|
a6ff7e6ca4
|
* Fix redundant options in train.py
|
2015-07-17 22:38:05 +02:00 |
|
Matthew Honnibal
|
31b5e58aeb
|
* Begin reorganizing neuralnet work
|
2015-06-30 14:26:53 +02:00 |
|
Matthew Honnibal
|
1135cfe50a
|
* Tidy nn_train a bit
|
2015-06-29 16:45:14 +02:00 |
|
Matthew Honnibal
|
df8179ca4f
|
* Add separate Param and AdadeltaParam classes. AdadeltaParam seems broken.
|
2015-06-29 16:39:16 +02:00 |
|
Matthew Honnibal
|
1dff04acb5
|
* Apply regularization to the softmax, not the bias
|
2015-06-29 11:45:38 +02:00 |
|
Matthew Honnibal
|
ca30fe1582
|
* Use He initialization trick
|
2015-06-29 10:56:02 +02:00 |
|
Matthew Honnibal
|
fc34e1b6e4
|
* Move Theano functions into nn_train.py script
|
2015-06-29 07:09:16 +02:00 |
|
Matthew Honnibal
|
fe7b24ecef
|
* whitespace
|
2015-06-28 11:37:17 +02:00 |
|
Matthew Honnibal
|
7b8275fcc4
|
* Wire hyperparameters to script interface
|
2015-06-28 11:37:17 +02:00 |
|
Matthew Honnibal
|
897dd0dd0b
|
* Merge changes, and adjust Example to use memoryview
|
2015-06-28 11:36:11 +02:00 |
|
Matthew Honnibal
|
ef97b90833
|
* Fix token scoring
|
2015-06-28 06:22:18 +02:00 |
|
Matthew Honnibal
|
34c0ef2ee8
|
* Don't compile the orig_arc_eager and tree_arc_eager modules used for the EMNLP paper
|
2015-06-23 05:38:17 +02:00 |
|
Matthew Honnibal
|
59e9f9153c
|
* Remove projectivity constraint in train.py, but raise Exception if non-projective sentence is encountered, since we've told GoldParse to projectivize
|
2015-06-23 05:04:46 +02:00 |
|
Matthew Honnibal
|
839e5038b7
|
* Raise exception on non-projective input
|
2015-06-23 00:01:55 +02:00 |
|
Matthew Honnibal
|
4dad4058c3
|
* Uncomment NER training
|
2015-06-16 23:36:54 +02:00 |
|
Matthew Honnibal
|
5699585278
|
* Use tree_arc_eager system as baseline in experiments
|
2015-06-15 08:23:43 +02:00 |
|
Matthew Honnibal
|
4841f8ad5e
|
* Set transition system early
|
2015-06-15 02:54:12 +02:00 |
|
Matthew Honnibal
|
bcfdf126a4
|
* Add toggle for OrigArcEager system
|
2015-06-14 20:28:14 +02:00 |
|
Matthew Honnibal
|
c500d72dc2
|
* Temporarily disable NER, and wire up the verbose flag during training
|
2015-06-14 17:45:31 +02:00 |
|
Matthew Honnibal
|
ac422492cf
|
* Fix write_parses mode of bin/parser/train.py
|
2015-06-07 19:08:48 +02:00 |
|
Matthew Honnibal
|
1736fc5a67
|
* Add more options to bin/parser/train
|
2015-06-05 23:49:26 +02:00 |
|
Matthew Honnibal
|
362f87dc3a
|
* Update input corruption method to work with lists as well as trings
|
2015-06-05 19:33:32 +02:00 |
|
Matthew Honnibal
|
0aed9c9a33
|
* Fix train.py
|
2015-06-05 15:50:24 +02:00 |
|
Matthew Honnibal
|
8466600add
|
* Clean up train.py, removing unused tag jackknifing code
|
2015-06-05 15:01:28 +02:00 |
|
Matthew Honnibal
|
e772b48dcd
|
* Skip sentences of length 1 in training
|
2015-06-05 02:29:03 +02:00 |
|
Matthew Honnibal
|
e822df0867
|
* Fix bugs in new greedy/beam parser
|
2015-06-02 02:01:33 +02:00 |
|
Matthew Honnibal
|
70a7ad89ca
|
* Removed unused imports from train.py
|
2015-06-02 00:59:09 +02:00 |
|
Matthew Honnibal
|
a3de20118e
|
* Wire up beam-width command line argument
|
2015-06-02 00:54:12 +02:00 |
|
Matthew Honnibal
|
08044ea70c
|
* Remove try/except around parser.train
|
2015-05-31 15:21:56 +02:00 |
|
Matthew Honnibal
|
d7cc2338e7
|
* Fix bug in train.py
|
2015-05-31 06:49:06 +02:00 |
|
Matthew Honnibal
|
d42dda0372
|
* Shuffle docs before doing jackknife partition --- otherwise we'll not get the right genre mixes...
|
2015-05-31 01:25:02 +02:00 |
|
Matthew Honnibal
|
d512d20d81
|
* Allow parser to jackknife POS tags before training.
|
2015-05-31 01:11:11 +02:00 |
|
Matthew Honnibal
|
6bbdcc5db5
|
* Fix gold_preproc flag in train.py
|
2015-05-30 05:23:02 +02:00 |
|
Matthew Honnibal
|
76300bbb1b
|
* Use updated JSON format, with sentences below paragraphs. Allows use of gold preprocessing flag.
|
2015-05-30 01:25:46 +02:00 |
|
Matthew Honnibal
|
b76bbbd12c
|
* Read json files recursively from a directory, instead of requiring a single .json file
|
2015-05-29 03:52:55 +02:00 |
|
Matthew Honnibal
|
ef67ef7a4c
|
* Recomment in training in train.py
|
2015-05-28 22:40:26 +02:00 |
|
Matthew Honnibal
|
f42dc1f7d8
|
* Fix evaluate method in train.py, to use sentences which don't have raw text
|
2015-05-28 16:30:23 +02:00 |
|