| 
							
							
								 Matthew Honnibal | bd1bfcca61 | Update train.py | 2016-10-13 03:23:48 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ea23b64cc8 | Refactor training, with new spacy.train module. Defaults still a little awkward. | 2016-10-09 12:24:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 53fbd3dd1c | Fix train.py for v1.0.0-rc1 | 2016-10-05 01:11:46 +02:00 |  | 
			
				
					| 
							
							
								 Wolfgang Seeker | dae6bc05eb | define German dummy lemmatizer until morphology is done | 2016-05-02 16:04:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8569dbc2d0 | * Add initial stuff for Chinese parsing | 2016-04-24 18:44:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d249e2f7f3 | * Improve error message in bin/parser/train.py | 2016-03-29 13:04:33 +11:00 |  | 
			
				
					| 
							
							
								 Wolfgang Seeker | 690c5acabf | adjust train.py to train both english and german models | 2016-03-03 15:21:00 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6e68b344c1 | * Train after parsing, not before. | 2015-11-12 04:43:52 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 83dccf0fd7 | * Use io module insteads of deprecated codecs module | 2015-10-10 14:13:01 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f35632e2e5 | * Remove SBD print statement in train, after SBD evaluation was removed from Scorer | 2015-10-09 11:08:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c503654ec1 | * Update bin/parser/train for printing output. | 2015-10-06 10:35:22 +11:00 |  | 
			
				
					| 
							
							
								 alvations | 764bdc62e7 | caught another codecs.open | 2015-09-30 20:16:52 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b2e82e55f6 | * Create POS model dir in training script | 2015-09-08 15:36:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d1eea2d865 | * Update train.py for language-generic spaCy | 2015-09-06 17:51:48 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ddc1a5cfe5 | * Fix training under python3 | 2015-07-28 14:09:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c52179f5fa | * Use print function in train.py, for py 2/3 compatibility | 2015-07-24 04:52:35 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4729200dfc | * Whitespace | 2015-07-23 01:19:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 317cbbc015 | * Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time. | 2015-07-19 15:18:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a6ff7e6ca4 | * Fix redundant options in train.py | 2015-07-17 22:38:05 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ef97b90833 | * Fix token scoring | 2015-06-28 06:22:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 34c0ef2ee8 | * Don't compile the orig_arc_eager and tree_arc_eager modules used for the EMNLP paper | 2015-06-23 05:38:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 59e9f9153c | * Remove projectivity constraint in train.py, but raise Exception if non-projective sentence is encountered, since we've told GoldParse to projectivize | 2015-06-23 05:04:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 839e5038b7 | * Raise exception on non-projective input | 2015-06-23 00:01:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4dad4058c3 | * Uncomment NER training | 2015-06-16 23:36:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5699585278 | * Use tree_arc_eager system as baseline in experiments | 2015-06-15 08:23:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4841f8ad5e | * Set transition system early | 2015-06-15 02:54:12 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bcfdf126a4 | * Add toggle for OrigArcEager system | 2015-06-14 20:28:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c500d72dc2 | * Temporarily disable NER, and wire up the verbose flag during training | 2015-06-14 17:45:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ac422492cf | * Fix write_parses mode of bin/parser/train.py | 2015-06-07 19:08:48 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1736fc5a67 | * Add more options to bin/parser/train | 2015-06-05 23:49:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 362f87dc3a | * Update input corruption method to work with lists as well as trings | 2015-06-05 19:33:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0aed9c9a33 | * Fix train.py | 2015-06-05 15:50:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8466600add | * Clean up train.py, removing unused tag jackknifing code | 2015-06-05 15:01:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e772b48dcd | * Skip sentences of length 1 in training | 2015-06-05 02:29:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e822df0867 | * Fix bugs in new greedy/beam parser | 2015-06-02 02:01:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 70a7ad89ca | * Removed unused imports from train.py | 2015-06-02 00:59:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a3de20118e | * Wire up beam-width command line argument | 2015-06-02 00:54:12 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 08044ea70c | * Remove try/except around parser.train | 2015-05-31 15:21:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d7cc2338e7 | * Fix bug in train.py | 2015-05-31 06:49:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d42dda0372 | * Shuffle docs before doing jackknife partition --- otherwise we'll not get the right genre mixes... | 2015-05-31 01:25:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d512d20d81 | * Allow parser to jackknife POS tags before training. | 2015-05-31 01:11:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6bbdcc5db5 | * Fix gold_preproc flag in train.py | 2015-05-30 05:23:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 76300bbb1b | * Use updated JSON format, with sentences below paragraphs. Allows use of gold preprocessing flag. | 2015-05-30 01:25:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b76bbbd12c | * Read json files recursively from a directory, instead of requiring a single .json file | 2015-05-29 03:52:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ef67ef7a4c | * Recomment in training in train.py | 2015-05-28 22:40:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f42dc1f7d8 | * Fix evaluate method in train.py, to use sentences which don't have raw text | 2015-05-28 16:30:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a7cee46fe9 | * Update train.py, to support paragraphs where there's no raw_text | 2015-05-27 19:14:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 895060e774 | * Ensure tagger and NER are trained, even if non-projective problem | 2015-05-27 03:16:21 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 15bbbf4901 | * Remove cruft from train.py | 2015-05-25 07:54:10 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fc75210941 | * Move spacy.syntax.conll to spacy.gold | 2015-05-24 21:35:02 +02:00 |  |