Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8a4c9c33f1
							
						
					 | 
					
						
						
							
							* Fix test_token_references test
						
						
						
						
						
					 | 
					
						2015-06-07 18:33:04 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							15123329b1
							
						
					 | 
					
						
						
							
							* Have travis test the pip version of the code
						
						
						
						
						
					 | 
					
						2015-06-07 18:17:19 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							5f44adc659
							
						
					 | 
					
						
						
							
							* Add tests/spans/conftest.py
						
						
						
						
						
					 | 
					
						2015-06-07 18:07:59 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							dd587b7477
							
						
					 | 
					
						
						
							
							* Fix tests
						
						
						
						
						
					 | 
					
						2015-06-07 18:07:32 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e3af6af83c
							
						
					 | 
					
						
						
							
							* Add tests/vocab/conftest.py
						
						
						
						
						
					 | 
					
						2015-06-07 18:02:47 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							88041f69d1
							
						
					 | 
					
						
						
							
							* More work on reorganising tests, using conftest.py
						
						
						
						
						
					 | 
					
						2015-06-07 18:02:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							674ee5dde7
							
						
					 | 
					
						
						
							
							* Add conftest.py to tests/, to allow session-global pipeline. This allows much faster tests.
						
						
						
						
						
					 | 
					
						2015-06-07 17:53:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							877abb0e5b
							
						
					 | 
					
						
						
							
							* Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds.
						
						
						
						
						
					 | 
					
						2015-06-07 17:24:49 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1d5f20fdda
							
						
					 | 
					
						
						
							
							* Move nlp variable from global scope
						
						
						
						
						
					 | 
					
						2015-06-07 16:55:11 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d37dca72dd
							
						
					 | 
					
						
						
							
							* Reorganize tests
						
						
						
						
						
					 | 
					
						2015-06-07 16:49:46 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							2ef3555d88
							
						
					 | 
					
						
						
							
							* Add ujson to requirements.txt
						
						
						
						
						
					 | 
					
						2015-06-07 03:22:17 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8f142c1838
							
						
					 | 
					
						
						
							
							* Refactor transition system oracles, to split out move and label cost. Preparing to add Unshift move. Will exclude non-monotonic.
						
						
						
						
						
					 | 
					
						2015-06-07 03:21:29 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e2578fbb90
							
						
					 | 
					
						
						
							
							* Avoid parsing and tagging in test_emoticons
						
						
						
						
						
					 | 
					
						2015-06-06 05:59:20 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							89b8775887
							
						
					 | 
					
						
						
							
							* Fix output from _min_edit_path when inputs match.
						
						
						
						
						
					 | 
					
						2015-06-06 05:58:53 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							27c8dc3db2
							
						
					 | 
					
						
						
							
							* Run tests one file at a time, as the teardown isn't cleaning up objects in global namespace properly
						
						
						
						
						
					 | 
					
						2015-06-06 05:58:16 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4126ef3b8c
							
						
					 | 
					
						
						
							
							* Restore hyphenation test to test_infix
						
						
						
						
						
					 | 
					
						2015-06-06 05:57:36 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							98cfd84123
							
						
					 | 
					
						
						
							
							* Remove hyphenation from main tokenizer loop: do it in infix.txt instead. This lets emoticons work
						
						
						
						
						
					 | 
					
						2015-06-06 05:57:03 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							45ec92243a
							
						
					 | 
					
						
						
							
							* Add hyphenation rule to infix.txt for tokenizer
						
						
						
						
						
					 | 
					
						2015-06-06 05:56:00 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4073533e28
							
						
					 | 
					
						
						
							
							* Upd munge_ewtb for the new json format
						
						
						
						
						
					 | 
					
						2015-06-06 02:10:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6a1341b29e
							
						
					 | 
					
						
						
							
							* Add tb pre-process script
						
						
						
						
						
					 | 
					
						2015-06-06 01:59:44 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a57ced0ead
							
						
					 | 
					
						
						
							
							* Pin cython packages to particular versions, so that the current version works even if updates to them are pushed.
						
						
						
						
						
					 | 
					
						2015-06-05 23:51:39 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1736fc5a67
							
						
					 | 
					
						
						
							
							* Add more options to bin/parser/train
						
						
						
						
						
					 | 
					
						2015-06-05 23:49:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							1fee7ade61
							
						
					 | 
					
						
						
							
							* Tweak to ner
						
						
						
						
						
					 | 
					
						2015-06-05 23:48:43 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							362f87dc3a
							
						
					 | 
					
						
						
							
							* Update input corruption method to work with lists as well as trings
						
						
						
						
						
					 | 
					
						2015-06-05 19:33:32 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							33e70b167f
							
						
					 | 
					
						
						
							
							* Remove dead code from ner.pyx
						
						
						
						
						
					 | 
					
						2015-06-05 17:12:47 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							88ac5c6e98
							
						
					 | 
					
						
						
							
							* Send beam_width < 0 to greedy parser
						
						
						
						
						
					 | 
					
						2015-06-05 17:12:06 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0114e7600d
							
						
					 | 
					
						
						
							
							* Fix NER oracle
						
						
						
						
						
					 | 
					
						2015-06-05 17:11:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							c04e6ebca6
							
						
					 | 
					
						
						
							
							* Allow user to load different sized vectors.
						
						
						
						
						
					 | 
					
						2015-06-05 16:26:39 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0aed9c9a33
							
						
					 | 
					
						
						
							
							* Fix train.py
						
						
						
						
						
					 | 
					
						2015-06-05 15:50:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							23e2f26535
							
						
					 | 
					
						
						
							
							* Require thinc 1.76
						
						
						
						
						
					 | 
					
						2015-06-05 15:50:05 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							8466600add
							
						
					 | 
					
						
						
							
							* Clean up train.py, removing unused tag jackknifing code
						
						
						
						
						
					 | 
					
						2015-06-05 15:01:28 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e772b48dcd
							
						
					 | 
					
						
						
							
							* Skip sentences of length 1 in training
						
						
						
						
						
					 | 
					
						2015-06-05 02:29:03 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6bf35cecc3
							
						
					 | 
					
						
						
							
							* Refactor transition system to use classes with staticmethods.
						
						
						
						
						
					 | 
					
						2015-06-05 02:27:17 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							36a34d544b
							
						
					 | 
					
						
						
							
							* Refactoring arc_eager, grouping oracle functions into transitions
						
						
						
						
						
					 | 
					
						2015-06-04 22:43:03 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							4433396005
							
						
					 | 
					
						
						
							
							* Impove efficiency of dynamic oracle, making beam training faster
						
						
						
						
						
					 | 
					
						2015-06-04 21:15:14 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							079dad28a7
							
						
					 | 
					
						
						
							
							* Update for faster beam training
						
						
						
						
						
					 | 
					
						2015-06-04 19:32:32 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							f8843906ad
							
						
					 | 
					
						
						
							
							Merge branch 'constituency'
						
						
						
						
						
						
						
						Add beam parsing and training from JSON files, with Levenshtein alignment. 
						
					 | 
					
						2015-06-03 06:07:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							ae653b850a
							
						
					 | 
					
						
						
							
							* Remove unused import from gold.pyx
						
						
						
						
						
					 | 
					
						2015-06-03 06:07:15 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a2627b6102
							
						
					 | 
					
						
						
							
							* Fix bug in refactored init_transition
						
						
						
						
						
					 | 
					
						2015-06-03 06:01:26 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							dd0867645d
							
						
					 | 
					
						
						
							
							* Remove stray const from State header
						
						
						
						
						
					 | 
					
						2015-06-03 00:10:04 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							6c47b10a6e
							
						
					 | 
					
						
						
							
							* Make optimization to children_in_buffer: stop searching when we would cross a bracket.
						
						
						
						
						
					 | 
					
						2015-06-02 21:05:24 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a513ec500f
							
						
					 | 
					
						
						
							
							* Have oracle functions take a struct instead of a Python object
						
						
						
						
						
					 | 
					
						2015-06-02 20:01:06 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							d1b55310a1
							
						
					 | 
					
						
						
							
							* Refactor _advance_beam function
						
						
						
						
						
					 | 
					
						2015-06-02 18:38:41 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							0786d9b3c7
							
						
					 | 
					
						
						
							
							* Refactor TransitionSystem, adding set_valid method
						
						
						
						
						
					 | 
					
						2015-06-02 18:38:07 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							bd82a49994
							
						
					 | 
					
						
						
							
							* Add set_scores method to Model
						
						
						
						
						
					 | 
					
						2015-06-02 18:37:10 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							a3964957f6
							
						
					 | 
					
						
						
							
							* Add profiling for _state.pyx
						
						
						
						
						
					 | 
					
						2015-06-02 18:36:27 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							e822df0867
							
						
					 | 
					
						
						
							
							* Fix bugs in new greedy/beam parser
						
						
						
						
						
					 | 
					
						2015-06-02 02:01:33 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							66dfa95847
							
						
					 | 
					
						
						
							
							* Revise greedy_parse/beam_parse ownership goof
						
						
						
						
						
					 | 
					
						2015-06-02 01:34:19 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							70a7ad89ca
							
						
					 | 
					
						
						
							
							* Removed unused imports from train.py
						
						
						
						
						
					 | 
					
						2015-06-02 00:59:09 +02:00 | 
					
					
						
						
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Matthew Honnibal
							
						 
					 | 
					
						
						
						
						
							
						
						
							75658b2ed3
							
						
					 | 
					
						
						
							
							* Remove use of new beam.loss property, to maintain compatibility with older versions of thinc for now.
						
						
						
						
						
					 | 
					
						2015-06-02 00:57:09 +02:00 | 
					
					
						
						
							
							
							
						
					 |