| 
							
							
								 Matthew Honnibal | b0718b6ee1 | * Move to thinc 5.0 | 2016-01-29 03:58:55 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9721502c81 | * Update version | 2016-01-25 15:52:59 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 907e8cf07d | * Add u prefix to string in web example | 2016-01-25 15:51:38 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | eba03695ef | * Comment out pickle tests | 2016-01-25 15:51:13 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | de94e6c525 | * Mark pickle tests as xfail, due to temp files problem | 2016-01-25 15:24:17 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 87172a15c6 | * Fix runtime error bug that arose from updated Span.root function. | 2016-01-25 15:22:42 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2c8dd91785 | * Fix first code example on the website | 2016-01-23 18:09:19 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | af332f5095 | * Add some stream of consciousness about NER | 2016-01-23 13:41:01 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3af84cfd6e | * Increment version | 2016-01-21 17:49:27 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 571d26b773 | Merge branch 'master' of ssh://github.com/honnibal/spaCy | 2016-01-21 17:48:32 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6842f681e5 | Merge pull request #234 from henningpeters/master remove package version constraint | 2016-01-22 03:48:12 +11:00 |  | 
			
				
					| 
							
							
								 Henning Peters | 65aeac24cb | remove package version constraint | 2016-01-21 17:40:51 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0ec4df6d7c | * Write more notes about spaCy's NER | 2016-01-21 16:37:13 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7d16f25218 | * Update release notes | 2016-01-21 00:24:21 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1270506f7e | * Update release notes | 2016-01-21 00:23:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 792c98a438 | * Increment version for OSX-fixed release of v0.100 | 2016-01-21 00:23:04 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 110304f62e | * Start writing bootstrap word2vec tutorial | 2016-01-20 13:51:36 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 82d011ac43 | * Fix test for whitespace | 2016-01-19 20:38:26 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e89069dcae | * Fix matcher test | 2016-01-19 20:24:01 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 63e3d4e27f | * Add comment on Vocab.__reduce__ | 2016-01-19 20:11:25 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e1282b7f2f | * Require user-custom NER classes to work without adding the label. | 2016-01-19 20:11:03 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 84c5dfbfc3 | * Clean up debugging python list | 2016-01-19 20:10:32 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04d0686b26 | * Make TransitionSystem.add_action idempotent, i.e. ignore duplicate added actions. | 2016-01-19 20:10:04 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c4a89d56bd | * Automatically register any entity types pre-set on the tokens, so that the NER works with user-given entity types. | 2016-01-19 20:09:26 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f0f92793f6 | * Add test for user NER classes in matcher blocking the NER model. Re Issue #178 and Issue #217 | 2016-01-19 19:23:16 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 65c5bc4988 | * Add add_label method, to allow users to register new entity types and dependency labels. | 2016-01-19 19:11:02 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 151aa0b0e2 | * Allow users to add_label, in order to extend the entity recogniser to new classes. Does not by itself add a class to the model | 2016-01-19 19:09:33 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c8e0011ebc | * Add iterators to the NER and parser transition systems, to get the action types | 2016-01-19 19:07:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 515493c675 | * Add xfail test for Issue #225: tokenization with non-whitespace delimiters | 2016-01-19 13:20:14 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7abe653223 | * Fix imports | 2016-01-19 03:36:51 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 590f38bdb2 | * Add hacky solution to Issue #220. Currently specials.json only supports literal patterns, which doesn't allow us to pre-tag whitespace with the correct token, SP, as a rule. The data-driven approach should be easy but for some reason fails here. Adding a hard code in Morphology isn't a good solution, but we do want to fix the behaviour right away, and don't want to wait for an architecturally better solution. | 2016-01-19 03:35:20 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 445164d5b4 | * Restore the LOCAL_DATA_DIR global in spacy/en/__init__.py, although this is now deprecated | 2016-01-19 02:54:56 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04177debd0 | * Unwind limit to sentence boundary detection that prevents it from inserting boundaries on whitespace. Replace it with a check for whitespace in StateClass.fast_forward, so that whitespace is LeftArced when it's on the stack. This should prevent the previous problem of whitespace-only sentences. Should fix Issue #184, but may cause further problems. Needs testing. | 2016-01-19 02:54:15 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7893de3203 | * Add test for Issue #184: Whitespace at sentence boundary causes sentence boundary error. | 2016-01-18 23:04:38 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bba0a5e078 | * Handle string paths in default_vocab, default_parser, default_entity in Language class | 2016-01-18 22:37:24 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e825fd9554 | * Make some of the website tests work without models | 2016-01-18 18:14:44 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 334c4b2b57 | * Disprefer punctuation and spaces as heads of spans | 2016-01-18 18:14:09 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bed36ab0ff | * Fix import of HEAD attribute | 2016-01-18 17:34:43 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 28c659c1fe | * Fix import for numpy | 2016-01-18 17:25:04 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fc36bcf458 | * Fix import for English | 2016-01-18 17:14:40 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cc4c335e14 | * Set heads for test_merge_tokens, to make the test run without models | 2016-01-18 17:00:11 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c107da9738 | * Bug fix to _count_words_to_root | 2016-01-18 16:59:38 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f24833d607 | * Fix merge for coordinations | 2016-01-18 16:03:19 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 14534958a9 | * Fix bug in Span.root | 2016-01-18 15:40:28 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 714cbc03d5 | * Add test for Issue #203: nested noun chunks. | 2016-01-16 18:02:30 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4e2253170c | * Move test for doc.merge to tokens_api file, to avoid name conflicts which upset pytest | 2016-01-16 18:01:36 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 34a157511f | * Move test_merge_hang to test_tokens_api | 2016-01-16 18:00:26 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fc8f26584a | * Don't consider NPs connected to parse via conj relation as noun chunks. Change motivated by the nested noun chunks identified in Issue #203, but might be problematic. Also allow root NPs to be considered noun chunks. | 2016-01-16 17:52:40 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4a16dbfeca | * Add test for Issue #203: noun chunks should be flat, but sometimes are nested | 2016-01-16 17:41:25 +01:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 995b2d18fd | * Route token.string via token.txt_with_ws, to deprecate token.string in future | 2016-01-16 17:14:34 +01:00 |  |