| 
							
							
								 Matthew Honnibal | 9590968fc1 | * Fix negative indices in Span | 2015-07-30 02:30:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 74d8cb3980 | * Add noun_chunks iterator, and fix left/right child setting in Doc.merge | 2015-07-30 02:29:49 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d153f18969 | * Fix negative indices on spans | 2015-07-29 22:36:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b5132bed7d | * Set left and right children when loading parse from byte string | 2015-07-28 21:03:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6609fcf4b2 | * Make mem and vocab python-visible in Doc | 2015-07-28 20:46:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aa7a964a4f | * Add a type declaration for doc.from_array | 2015-07-27 22:57:22 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8e4c69ee8c | * Add is_oov property, and fix up handling of attributes | 2015-07-27 01:50:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6bb96c122d | * Host IS_ flags in attrs.pxd, and add properties for them on Token and Lexeme objects | 2015-07-26 16:37:16 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2060935cdb | * Remove explicit bytes type in doc.from_bytes, to accept bytearray | 2015-07-24 04:54:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0bb839d299 | * Fix string coercion for Python 3 | 2015-07-24 03:49:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a0e36e8efc | * Add working to/from bytes API to Doc | 2015-07-23 01:14:45 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4d61239eac | * Reorganize the serialization functions on Doc | 2015-07-22 04:53:01 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8743a8c084 | * Update Doc serialization for new Packer interface | 2015-07-20 01:38:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 317cbbc015 | * Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time. | 2015-07-19 15:18:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6b13e7227c | * Remove duplicate get_lex_attr method from doc.pyx | 2015-07-18 22:46:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ced59ab9ea | * Make minor efficiency improvement in Doc.__iter__ | 2015-07-18 04:10:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cf0c788892 | * Tests passing on round-trip pack/unpack on basic example | 2015-07-17 21:20:48 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | dfdf19f6a9 | * Draft a from_orth method for Doc | 2015-07-17 16:39:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | db9dfd2e23 | * Major refactor of serialization. Nearly complete now. | 2015-07-17 01:27:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d8458d6a25 | * Fix attr_id_t import in Spans | 2015-07-16 19:55:21 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a6f401580d | * Add from_array function to Doc. | 2015-07-16 17:46:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 30be4f15da | * Import attrs from spacy.attrs, not spacy.typedefs | 2015-07-16 11:23:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e2133d990e | * Move serialization functionality out into a Serializer object | 2015-07-16 11:21:44 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 01fab6bb90 | * Improve de/serialize functions | 2015-07-16 01:26:35 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0e07c1ed2a | * draft de/serialization functions in doc.pyx | 2015-07-16 01:16:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9d956b07e9 | * Fix import of attrs in doc.pyx, and update the get_token_attr function. | 2015-07-16 01:15:34 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 935ac53ee3 | * Extend count_by method | 2015-07-14 03:20:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 81aa4e6dcc | * Go back to having token reference doc, instead of complicated gymnastics. Rename the attr 'doc', to expose it in the API | 2015-07-14 00:10:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8214b74eec | * Restore _py_tokens cache, to handle orphan tokens. | 2015-07-13 22:28:10 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 67641f3b58 | * Refactor tokenizer, to set the 'spacy' field on TokenC instead of passing a string | 2015-07-13 21:46:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6eef0bf9ab | * Break up tokens.pyx into tokens/doc.pyx, tokens/token.pyx, tokens/spans.pyx | 2015-07-13 20:20:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3ea8756c24 | * Add spacy/tokens/doc.pyx, for Doc class in its own file | 2015-07-13 19:58:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d27899658e | * Import classes in spacy.tokens.__init__ | 2015-07-13 19:48:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | dba6b47d4e | * Refactor monster tokens.pyx file, into a tokens/ subpackage. Try to break the cycle between Doc and Token, and remove the need to pass around a unicode string reference | 2015-07-13 19:20:48 +02:00 |  |