| 
							
							
								 Matthew Honnibal | d24b8509e4 | * Correct screw ups from the previous commits | 2015-11-07 06:51:41 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5efad178b5 | * Set ent tag when close entity | 2015-11-07 06:09:25 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9285f01d26 | * Fix broken StateClass.E tracking | 2015-11-07 06:06:39 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 19136b0e7d | * Add better debug message for illegal move | 2015-11-07 05:34:37 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2733816b7b | * Fix whitespace | 2015-11-07 05:31:06 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 01ab464383 | * Prevent Begin and In moves from applying in NER if we're at the last token of a sentence, as this would mean the entity would span over a sentence boundary. Re Issue #169 | 2015-11-07 05:30:44 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b65633f270 | * Fix function that returns nth entity in StateClass. Was only returning the first. | 2015-11-07 05:29:11 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3c162dcac3 | * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. | 2015-11-07 03:24:30 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b9991fbd20 | * Update to use thinc 3.0 | 2015-11-06 00:25:59 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 68f479e821 | * Rename Doc.data to Doc.c | 2015-11-04 00:15:14 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 329ae57520 | * Fix whitespace attachment thing | 2015-10-13 09:46:38 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 37919eac82 | * Fix whitespace attachment in simpler way. Leaves problem with setting left/right children. | 2015-10-13 18:23:24 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c70eb776ae | * Fix whitespace attachment, so that left/right children are consistent with head. | 2015-10-13 15:58:22 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 20fd36a0f7 | * Very scrappy, likely buggy first-cut pickle implementation, to work on Issue #125: allow pickle for Apache Spark. The current implementation sends stuff to temp files, and does almost nothing to ensure all modifiable state is actually preserved. The Language() instance is a deep tree of extension objects, and if pickling during training, some of the C-data state is hard to preserve. | 2015-10-13 13:44:41 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9dd2f25c74 | * Fix Issue #131: Force whitespace characters to attach syntactically to previous token, and ensure they cannot serve as stand-alone 'sentence' units. | 2015-10-10 15:53:30 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8b39feefbe | * Add dependency post-process rule to ensure spaces are attached to neighbouring tokens, so that they can't be sentence boundaries | 2015-10-10 15:32:13 +11:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0e24d099a1 | * Fix L/R edge bug, by ensuring l_edge and r_edge are preset, and fixing the way the edge update in del_arc. Bugs keep arising here because the edges are absolute positions, where everything else is relative. I'm also not 100% convinced that del_arc is handled correctly. Do we need to update the parents? | 2015-09-09 03:40:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 86c888667f | * Merge in changes from de branch | 2015-09-06 19:49:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5edac11225 | * Wrap self.parse in nogil, and break if an invalid move is predicted. The invalid break is a work-around that papers over likely bugs, but we can't easily break in the nogil block, and otherwise we'll get an infinite loop. Need to set this as an error flag. | 2015-09-06 04:15:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a3d5e6c0dd | * Reform constructor and save/load workflow in parser model | 2015-08-26 19:19:01 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bf38b3b883 | * Hack on l/r reversal bug | 2015-08-10 05:58:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6116413b47 | * Fix label prediction in StepwiseState | 2015-08-10 05:05:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2c9753eff2 | * Whitespace | 2015-08-10 00:09:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9de98f5a6f | * Add Parser.stepthrough method, with context manager | 2015-08-10 00:08:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fe43f8cf39 | * Whitespace | 2015-08-09 02:31:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9c090945e0 | * Add Parser.predict method, and clean up Parser.get_state | 2015-08-09 02:29:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04fccfb984 | * Fix get_state for parser prediction | 2015-08-09 02:11:22 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 55fde0e240 | * Fix get_state | 2015-08-09 01:45:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f0f4fa9838 | * Fix Parser.get_state | 2015-08-09 01:40:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 18331dca89 | * Add continue_for argument to parser 'partial' function, which is now renamed to get_state | 2015-08-09 01:31:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0653288fa5 | * Fix stateclass.queue | 2015-08-09 00:39:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9de218b7ba | * Fix Parser.partial function | 2015-08-08 23:45:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cc9deae960 | * Add is_valid method to transition_system | 2015-08-08 23:36:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2a46c77324 | * Whitespace | 2015-08-08 23:35:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7bafc789e7 | * Add stack and queue properties to stateclass, for python access | 2015-08-08 23:32:42 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3af938365f | * Add function partial to Parser | 2015-08-08 23:32:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 76a1f0481a | * Whitespace | 2015-08-08 23:31:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 59c3bf60a6 | * Ensure entity recognizer doesn't over-write preset types | 2015-08-06 16:09:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9c1724ecae | * Gazetteer stuff working, now need to wire up to API | 2015-08-06 00:35:40 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a8bbd7312c | * Hackishly patch long dependencies problem | 2015-07-28 00:14:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bb583f7f09 | * Hackishly patch long dependencies problem | 2015-07-27 23:14:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 823ef4a00b | * Remove profile declarations | 2015-07-25 18:13:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aa28e2e01d | * Release the GIL around parse function | 2015-07-24 04:53:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d5255aad77 | * Update freqs for missing tags in ner, for serializer | 2015-07-23 01:17:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 12699a1152 | * Set initial freqs, to avoid missing values in serializer | 2015-07-23 01:16:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 317cbbc015 | * Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time. | 2015-07-19 15:18:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b1d74ce60d | * Remove unused joint.pyx and joint.pxd files | 2015-07-17 23:31:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fb0a641a2d | * Don't release the gil around Parser.parse. Does this indicate thread problems? | 2015-07-17 23:07:37 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e29daea85f | * Fix bint/int typing problem in TransitionSystem. In C++ bint* means bool*, but in C it means int*. So, type-casting to bint* is unsafe. | 2015-07-17 22:37:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 45ae1ce428 | * Remove unused declaration in parser | 2015-07-16 01:27:11 +02:00 |  |