Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							35124b144a
							
						
					 | 
					
						
						
							
							Add L1 penalty option to parser
						
						
						
						
						
					 | 
					
						2017-03-09 18:44:53 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							04a51dab62
							
						
					 | 
					
						
						
							
							Print active parser features during training
						
						
						
						
						
					 | 
					
						2017-03-08 01:37:19 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							314bc8d34f
							
						
					 | 
					
						
						
							
							Fix train script for 1.0
						
						
						
						
						
					 | 
					
						2016-11-25 08:57:37 -06:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							bd1bfcca61
							
						
					 | 
					
						
						
							
							Update train.py
						
						
						
						
						
					 | 
					
						2016-10-13 03:23:48 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ea23b64cc8
							
						
					 | 
					
						
						
							
							Refactor training, with new spacy.train module. Defaults still a little awkward.
						
						
						
						
						
					 | 
					
						2016-10-09 12:24:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							53fbd3dd1c
							
						
					 | 
					
						
						
							
							Fix train.py for v1.0.0-rc1
						
						
						
						
						
					 | 
					
						2016-10-05 01:11:46 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Wolfgang Seeker
							
						 
					 | 
					
						
						
						
						
							
						
						
							dae6bc05eb
							
						
					 | 
					
						
						
							
							define German dummy lemmatizer until morphology is done
						
						
						
						
						
					 | 
					
						2016-05-02 16:04:53 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8569dbc2d0
							
						
					 | 
					
						
						
							
							* Add initial stuff for Chinese parsing
						
						
						
						
						
					 | 
					
						2016-04-24 18:44:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d249e2f7f3
							
						
					 | 
					
						
						
							
							* Improve error message in bin/parser/train.py
						
						
						
						
						
					 | 
					
						2016-03-29 13:04:33 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Wolfgang Seeker
							
						 
					 | 
					
						
						
						
						
							
						
						
							690c5acabf
							
						
					 | 
					
						
						
							
							adjust train.py to train both english and german models
						
						
						
						
						
					 | 
					
						2016-03-03 15:21:00 +01:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6e68b344c1
							
						
					 | 
					
						
						
							
							* Train after parsing, not before.
						
						
						
						
						
					 | 
					
						2015-11-12 04:43:52 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							83dccf0fd7
							
						
					 | 
					
						
						
							
							* Use io module insteads of deprecated codecs module
						
						
						
						
						
					 | 
					
						2015-10-10 14:13:01 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f35632e2e5
							
						
					 | 
					
						
						
							
							* Remove SBD print statement in train, after SBD evaluation was removed from Scorer
						
						
						
						
						
					 | 
					
						2015-10-09 11:08:58 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c503654ec1
							
						
					 | 
					
						
						
							
							* Update bin/parser/train for printing output.
						
						
						
						
						
					 | 
					
						2015-10-06 10:35:22 +11:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								alvations
							
						 
					 | 
					
						
						
						
						
							
						
						
							764bdc62e7
							
						
					 | 
					
						
						
							
							caught another codecs.open
						
						
						
						
						
					 | 
					
						2015-09-30 20:16:52 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b2e82e55f6
							
						
					 | 
					
						
						
							
							* Create POS model dir in training script
						
						
						
						
						
					 | 
					
						2015-09-08 15:36:23 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d1eea2d865
							
						
					 | 
					
						
						
							
							* Update train.py for language-generic spaCy
						
						
						
						
						
					 | 
					
						2015-09-06 17:51:48 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ddc1a5cfe5
							
						
					 | 
					
						
						
							
							* Fix training under python3
						
						
						
						
						
					 | 
					
						2015-07-28 14:09:30 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c52179f5fa
							
						
					 | 
					
						
						
							
							* Use print function in train.py, for py 2/3 compatibility
						
						
						
						
						
					 | 
					
						2015-07-24 04:52:35 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4729200dfc
							
						
					 | 
					
						
						
							
							* Whitespace
						
						
						
						
						
					 | 
					
						2015-07-23 01:19:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							317cbbc015
							
						
					 | 
					
						
						
							
							* Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time.
						
						
						
						
						
					 | 
					
						2015-07-19 15:18:17 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a6ff7e6ca4
							
						
					 | 
					
						
						
							
							* Fix redundant options in train.py
						
						
						
						
						
					 | 
					
						2015-07-17 22:38:05 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ef97b90833
							
						
					 | 
					
						
						
							
							* Fix token scoring
						
						
						
						
						
					 | 
					
						2015-06-28 06:22:18 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							34c0ef2ee8
							
						
					 | 
					
						
						
							
							* Don't compile the orig_arc_eager and tree_arc_eager modules used for the EMNLP paper
						
						
						
						
						
					 | 
					
						2015-06-23 05:38:17 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							59e9f9153c
							
						
					 | 
					
						
						
							
							* Remove projectivity constraint in train.py, but raise Exception if non-projective sentence is encountered, since we've told GoldParse to projectivize
						
						
						
						
						
					 | 
					
						2015-06-23 05:04:46 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							839e5038b7
							
						
					 | 
					
						
						
							
							* Raise exception on non-projective input
						
						
						
						
						
					 | 
					
						2015-06-23 00:01:55 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4dad4058c3
							
						
					 | 
					
						
						
							
							* Uncomment NER training
						
						
						
						
						
					 | 
					
						2015-06-16 23:36:54 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5699585278
							
						
					 | 
					
						
						
							
							* Use tree_arc_eager system as baseline in experiments
						
						
						
						
						
					 | 
					
						2015-06-15 08:23:43 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4841f8ad5e
							
						
					 | 
					
						
						
							
							* Set transition system early
						
						
						
						
						
					 | 
					
						2015-06-15 02:54:12 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							bcfdf126a4
							
						
					 | 
					
						
						
							
							* Add toggle for OrigArcEager system
						
						
						
						
						
					 | 
					
						2015-06-14 20:28:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c500d72dc2
							
						
					 | 
					
						
						
							
							* Temporarily disable NER, and wire up the verbose flag during training
						
						
						
						
						
					 | 
					
						2015-06-14 17:45:31 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ac422492cf
							
						
					 | 
					
						
						
							
							* Fix write_parses mode of bin/parser/train.py
						
						
						
						
						
					 | 
					
						2015-06-07 19:08:48 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1736fc5a67
							
						
					 | 
					
						
						
							
							* Add more options to bin/parser/train
						
						
						
						
						
					 | 
					
						2015-06-05 23:49:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							362f87dc3a
							
						
					 | 
					
						
						
							
							* Update input corruption method to work with lists as well as trings
						
						
						
						
						
					 | 
					
						2015-06-05 19:33:32 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0aed9c9a33
							
						
					 | 
					
						
						
							
							* Fix train.py
						
						
						
						
						
					 | 
					
						2015-06-05 15:50:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8466600add
							
						
					 | 
					
						
						
							
							* Clean up train.py, removing unused tag jackknifing code
						
						
						
						
						
					 | 
					
						2015-06-05 15:01:28 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e772b48dcd
							
						
					 | 
					
						
						
							
							* Skip sentences of length 1 in training
						
						
						
						
						
					 | 
					
						2015-06-05 02:29:03 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e822df0867
							
						
					 | 
					
						
						
							
							* Fix bugs in new greedy/beam parser
						
						
						
						
						
					 | 
					
						2015-06-02 02:01:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							70a7ad89ca
							
						
					 | 
					
						
						
							
							* Removed unused imports from train.py
						
						
						
						
						
					 | 
					
						2015-06-02 00:59:09 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a3de20118e
							
						
					 | 
					
						
						
							
							* Wire up beam-width command line argument
						
						
						
						
						
					 | 
					
						2015-06-02 00:54:12 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							08044ea70c
							
						
					 | 
					
						
						
							
							* Remove try/except around parser.train
						
						
						
						
						
					 | 
					
						2015-05-31 15:21:56 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d7cc2338e7
							
						
					 | 
					
						
						
							
							* Fix bug in train.py
						
						
						
						
						
					 | 
					
						2015-05-31 06:49:06 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d42dda0372
							
						
					 | 
					
						
						
							
							* Shuffle docs before doing jackknife partition --- otherwise we'll not get the right genre mixes...
						
						
						
						
						
					 | 
					
						2015-05-31 01:25:02 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d512d20d81
							
						
					 | 
					
						
						
							
							* Allow parser to jackknife POS tags before training.
						
						
						
						
						
					 | 
					
						2015-05-31 01:11:11 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6bbdcc5db5
							
						
					 | 
					
						
						
							
							* Fix gold_preproc flag in train.py
						
						
						
						
						
					 | 
					
						2015-05-30 05:23:02 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							76300bbb1b
							
						
					 | 
					
						
						
							
							* Use updated JSON format, with sentences below paragraphs. Allows use of gold preprocessing flag.
						
						
						
						
						
					 | 
					
						2015-05-30 01:25:46 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							b76bbbd12c
							
						
					 | 
					
						
						
							
							* Read json files recursively from a directory, instead of requiring a single .json file
						
						
						
						
						
					 | 
					
						2015-05-29 03:52:55 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ef67ef7a4c
							
						
					 | 
					
						
						
							
							* Recomment in training in train.py
						
						
						
						
						
					 | 
					
						2015-05-28 22:40:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f42dc1f7d8
							
						
					 | 
					
						
						
							
							* Fix evaluate method in train.py, to use sentences which don't have raw text
						
						
						
						
						
					 | 
					
						2015-05-28 16:30:23 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a7cee46fe9
							
						
					 | 
					
						
						
							
							* Update train.py, to support paragraphs where there's no raw_text
						
						
						
						
						
					 | 
					
						2015-05-27 19:14:02 +02:00 | 
					
					
						
						
							
							
							
						
					 |