| 
							
							
								 Matthew Honnibal | 18cc326dc0 | * Bug fixes to ner.pyx | 2015-06-10 06:57:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e5570c9700 | * Set nogil for oracle functions | 2015-06-10 06:56:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4575e7a60f | * Fix beam search with new StateClass | 2015-06-10 06:33:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04b1cd9b8c | * Greedy parsing working with new StateClass. Beam parsing broken | 2015-06-10 04:20:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6a94b64eca | * Remove State* from parser.pyx entirely, switching over to StateClass. Beam parsing still untested. | 2015-06-10 02:03:38 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f14a1526aa | * Remove version of fill_context that takes State* | 2015-06-10 01:39:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d68c686ec1 | * Move StateClass into interface of transition functions | 2015-06-10 01:35:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4b98b3e9c8 | * Cost functions now take StateClass argument, instead of State*. | 2015-06-10 00:40:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e0cf61f591 | * Move StateClass into the interface for is_valid | 2015-06-09 23:23:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0895d454fb | * Prepare to switch to using state class, instead of state struct | 2015-06-09 21:20:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2b9629ed62 | * Begin adding stateclass to ArcEager | 2015-06-09 01:41:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ba10fd8af5 | * Add StateClass, to replace/refactor the mess in _state | 2015-06-09 01:39:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c7e3dfc1dc | * Don't automatically push words when stack is empty, as it messes up beam parsing. Add hash method to beam state. | 2015-06-08 14:49:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 00a0dfcb59 | * Avoid shipping the spacy.munge package | 2015-06-08 00:54:13 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7d265a9c62 | * Revert to wget in spacy.en.download | 2015-06-08 00:48:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a8fc5f1285 | * Fix munge/read_ner | 2015-06-08 00:35:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1515862861 | * Fix download.py | 2015-06-08 00:08:05 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7e9e8f654a | * Use urllib in spacy.en.download | 2015-06-07 23:51:38 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 80cff41a9c | * Upd download.py | 2015-06-07 19:13:28 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6e2564239d | * Bug fixes to beam parser. Search still broken on non-gold sentences | 2015-06-07 19:12:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1ec4e6fc95 | * Don't score whitespace tokens | 2015-06-07 19:10:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 731e5f1e46 | * Add get() function in spacy/syntax/Config | 2015-06-07 19:09:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8f142c1838 | * Refactor transition system oracles, to split out move and label cost. Preparing to add Unshift move. Will exclude non-monotonic. | 2015-06-07 03:21:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 89b8775887 | * Fix output from _min_edit_path when inputs match. | 2015-06-06 05:58:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 98cfd84123 | * Remove hyphenation from main tokenizer loop: do it in infix.txt instead. This lets emoticons work | 2015-06-06 05:57:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1fee7ade61 | * Tweak to ner | 2015-06-05 23:48:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 33e70b167f | * Remove dead code from ner.pyx | 2015-06-05 17:12:47 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 88ac5c6e98 | * Send beam_width < 0 to greedy parser | 2015-06-05 17:12:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0114e7600d | * Fix NER oracle | 2015-06-05 17:11:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c04e6ebca6 | * Allow user to load different sized vectors. | 2015-06-05 16:26:39 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6bf35cecc3 | * Refactor transition system to use classes with staticmethods. | 2015-06-05 02:27:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 36a34d544b | * Refactoring arc_eager, grouping oracle functions into transitions | 2015-06-04 22:43:03 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4433396005 | * Impove efficiency of dynamic oracle, making beam training faster | 2015-06-04 21:15:14 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 079dad28a7 | * Update for faster beam training | 2015-06-04 19:32:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f8843906ad | Merge branch 'constituency' Add beam parsing and training from JSON files, with Levenshtein alignment. | 2015-06-03 06:07:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ae653b850a | * Remove unused import from gold.pyx | 2015-06-03 06:07:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a2627b6102 | * Fix bug in refactored init_transition | 2015-06-03 06:01:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | dd0867645d | * Remove stray const from State header | 2015-06-03 00:10:04 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6c47b10a6e | * Make optimization to children_in_buffer: stop searching when we would cross a bracket. | 2015-06-02 21:05:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a513ec500f | * Have oracle functions take a struct instead of a Python object | 2015-06-02 20:01:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d1b55310a1 | * Refactor _advance_beam function | 2015-06-02 18:38:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0786d9b3c7 | * Refactor TransitionSystem, adding set_valid method | 2015-06-02 18:38:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bd82a49994 | * Add set_scores method to Model | 2015-06-02 18:37:10 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a3964957f6 | * Add profiling for _state.pyx | 2015-06-02 18:36:27 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e822df0867 | * Fix bugs in new greedy/beam parser | 2015-06-02 02:01:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 66dfa95847 | * Revise greedy_parse/beam_parse ownership goof | 2015-06-02 01:34:19 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 75658b2ed3 | * Remove use of new beam.loss property, to maintain compatibility with older versions of thinc for now. | 2015-06-02 00:57:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7c29362d60 | * Rename parser class in parser.pxd, now that beam parsing is supported | 2015-06-02 00:53:49 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 58d5ac0944 | * Add beam search capabilities to Parser. Rename GreedyParser to Parser. | 2015-06-02 00:28:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 62424e6c76 | * Remove unused regularize argument from _ml.Model | 2015-06-02 00:27:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | adeb57cb1e | * Fix long line | 2015-06-01 23:07:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e09a08bd00 | * Add copy_state function | 2015-06-01 23:06:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c7876aa8b6 | * Add get_valid method | 2015-06-01 23:06:00 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d82f9d958d | * Remove regularization cruft from _ml, move score from .pxd file to .pyx | 2015-05-31 18:48:05 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5e99ff94c8 | * Edits to arc eager oracle. Couldn't figure out how the non-monotonic lines made sense. They seem covered by children_in_stack | 2015-05-31 15:14:37 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6c5632b71c | * Roll back proposed change to Break transition while investigate effect | 2015-05-31 06:49:52 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6bba793df3 | * Disable the Zipf-reweighting thing while investigate effect | 2015-05-31 06:48:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e77940565d | * Add length cap to distance feature | 2015-05-31 05:25:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fd596351ba | * Fix valency features | 2015-05-31 05:24:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 87d6551d19 | * Allow gold parse to cut non-projective arcs | 2015-05-31 01:11:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c4f0914b4e | * Fix POS tag evaluation in scorer.py: do evaluate punctuation tags | 2015-05-30 18:24:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9e39a206da | * Fix efficiency of JSON reading, by using ujson instead of stream | 2015-05-30 17:54:52 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 76300bbb1b | * Use updated JSON format, with sentences below paragraphs. Allows use of gold preprocessing flag. | 2015-05-30 01:25:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b76bbbd12c | * Read json files recursively from a directory, instead of requiring a single .json file | 2015-05-29 03:52:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8f31d3b864 | * Relax constraint on Break transition for non-monotonic parsing. | 2015-05-28 23:39:52 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6b2e5c4b8a | * Avoid NER scoring for sentences with some missing NER values. | 2015-05-28 22:39:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d25d31442d | * Hackishly support broken NER annotations. Should fix this. | 2015-05-27 19:14:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7a2725bca4 | * Read input json in a streaming way | 2015-05-27 19:13:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6a1c91675e | * Add file to read ENAMEX ner data | 2015-05-27 17:36:23 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 732fa7709a | * Edits to align_raw script, for use in prepare_treebank | 2015-05-27 04:23:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4010b9b6d9 | * Pass parameter for regularization in parser.pyx | 2015-05-27 03:18:50 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4c6058baa7 | * Fix evaluation of NER in scorer.py | 2015-05-27 03:18:16 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6016ee83a6 | * Fix reading of NER in gold.pyx | 2015-05-27 03:17:50 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 04bda8648d | * Pass parameter for regularization to model | 2015-05-27 03:16:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f69fe6a635 | * Fix heads problem in read_conll | 2015-05-27 01:14:54 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0eec1d12af | * Add comment about zipf reweighting | 2015-05-27 01:14:07 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4d37b66c55 | * Make Zipf regularization a bit more efficient | 2015-05-27 01:12:50 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 7fc24821bc | * Experiment with Zipfian corruptions when calculating prediction | 2015-05-26 22:17:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | eba7b34f66 | * Add flag to disable loading of word vectors | 2015-05-25 01:02:42 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 3593babd35 | * Add functions for Levenshtein distance alignment | 2015-05-24 21:50:48 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 744f06abf5 | * Add script to read OntoNotes source documents | 2015-05-24 21:49:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fc75210941 | * Move spacy.syntax.conll to spacy.gold | 2015-05-24 21:35:02 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 765b61cac4 | * Update spacy.scorer, to use P/R/F to support tokenization errors | 2015-05-24 20:07:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | efe7a7d7d6 | * Clean unused functions from spacy.syntax.conll | 2015-05-24 20:06:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 78487f3e66 | * Update parser oracle for missing heads | 2015-05-24 20:05:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1044a13413 | * Begin refactoring scorer to use recall over gold dependencies | 2015-05-24 17:40:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | acd1245ad4 | * Remove cruft from conll.pyx --- unused stuff about evlauation, which now lives in spacy.scorer | 2015-05-24 17:35:49 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 20f1d868a3 | * Tmp commit. Working on whole document parsing | 2015-05-24 02:49:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f2ee9c4feb | * Comment out constituency parsing stuff, so that code compiles | 2015-05-20 16:55:05 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8ee7c541f1 | * Update Constituent definition | 2015-05-20 16:03:26 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9dfc9c039c | * Work on constituency parsing. | 2015-05-20 16:02:51 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5a5710e711 | * Fix Span.subtree property | 2015-05-13 21:53:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | badf030b6c | * Add parse navigation to Span objects | 2015-05-13 21:45:19 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ca320afe86 | * Add docstring for ents attribute | 2015-05-13 21:20:47 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ba07b925a7 | * Fix compile error in conll.pyx | 2015-05-12 22:33:47 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f1e0272b18 | * Disable c-parsing transitions | 2015-05-12 22:33:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 03a6626545 | * Tmp commit | 2015-05-12 20:27:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9568ebed08 | * Fix off-by-one in head reading | 2015-05-12 20:27:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 69840d8cc3 | * Tweak verbose output printing in scorer.py | 2015-05-12 20:27:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0605af6838 | * Fix head misalignment in read_conll, when periods are ignored | 2015-05-12 20:27:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d2ac8d8007 | * Add ctnt field to State, in preparation for constituency parsing | 2015-05-12 20:27:56 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ab67693393 | * Add read_json_file to conll.pyx | 2015-05-12 20:27:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | aff9359a8d | * Update ner.pyx to expect brackets from gold_tuples | 2015-05-12 20:27:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0ad72a77ce | * Write JSON files, with both dependency and PSG parses | 2015-05-12 20:27:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d48218f4b2 | * Add left_edge and right_edge properties | 2015-05-12 20:27:55 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 53cf77e1c8 | * Bug fix: when non-monotonically correct a dependency, make sure to delete the old one from the child list | 2015-05-12 20:26:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a4e2af54f9 | * Add support for l/r edge to add_dep, and move inlined methods into _state.pyx where possible | 2015-05-12 20:26:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | d634038eb6 | * Add l_edge and r_edge props in TokenC for tracking the parse-yield of the token | 2015-05-12 20:26:41 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 03ebf70a66 | * Inc version to 0.84 | 2015-05-12 02:38:51 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e73eaf2d05 | * Replace some assertions with proper errors | 2015-05-08 16:52:17 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fb8d50b3d5 | Merge branch 'master' of ssh://github.com/honnibal/spaCy | 2015-04-30 12:45:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | ed8e8c3bd0 | * Whitespace | 2015-04-29 14:22:47 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 378c2a6435 | * Fix POS model: make it use tag instead of pos in history features | 2015-04-29 00:02:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 763ef01575 | * Fix two bugs in feature calculation | 2015-04-28 23:25:09 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b3fd48c97b | * Fix missing root labels bug identified in Issue #57 | 2015-04-28 20:45:51 +02:00 |  | 
			
				
					| 
							
							
								 Jordan Suchow | 3a8d9b37a6 | Remove trailing whitespace | 2015-04-19 13:01:38 -07:00 |  | 
			
				
					| 
							
							
								 Jordan Suchow | 5f0f940a1f | Remove unused imports | 2015-04-19 01:05:22 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cc4e395927 | * Add some ad hoc regexes, for multi-word location prepositions | 2015-04-17 04:44:24 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f7ffd94e6a | * Add Token.conjuncts property | 2015-04-17 01:40:53 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 684d0e5e85 | * Download updated data | 2015-04-16 04:29:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2ef170a991 | * Fix Issue #54: Error merging multi-word token when there's a mid-token match. | 2015-04-16 04:28:06 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 42617548af | * Disable merge_mwes by default | 2015-04-16 04:20:31 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 99dbf8a38c | * Fix error type in lookup_transition | 2015-04-16 01:36:22 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 77d0700caf | * Add on X way regexes | 2015-04-16 01:35:46 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 9f16848b60 | * Add (N0w, N1w) unigram pair to NER features, prompted by failure to detect 'this weekend' | 2015-04-15 06:01:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | c6707778dd | * Fix Issue #51: Handle non-ascii lemmas correctly | 2015-04-13 22:28:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | bf0aff5124 | * Fix bug in Tokens.ents where entity wasn't being emitted if another started immediately after | 2015-04-13 21:34:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2b84a90bbb | * Fix Issue #50: Python 3 compatibility of v0.80 | 2015-04-13 05:59:43 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fbd48c571d | * Rearrange code in tokens.pyx | 2015-04-13 05:41:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 507048dc45 | * Rename StandardError to Exception, for Python 3 compatibility | 2015-04-12 07:28:34 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 761a19113a | * Fix /tmp moving thing in download.py | 2015-04-12 07:04:10 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 248a2b4b0f | * Remove Spans class | 2015-04-12 04:07:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 1d05e6da00 | * Add ne_iob and ne_type features to NER | 2015-04-10 19:07:08 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 4df8a3d90f | * Add ne_iob and ne_type attributes to context vector | 2015-04-10 05:02:15 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 8c354c432b | * Add ValueError condition to ner_tag reading | 2015-04-10 04:59:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 435cccf098 | * Add read_conll03_file function to conll.pyx | 2015-04-10 04:59:11 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 99c9ecfc18 | * Fix bug in prefix, suffix and word shape features in parser and NER | 2015-04-10 03:53:33 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | cff2b13fef | * Fix Issue #44: Broken Token.string attribute when single word sentence | 2015-04-07 06:08:25 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 6640386b25 | * Fix Issue #43: TAG attr not supported. Also add DEP attr, while I'm at it. Need better way of ensuring future changes don't break in similar way. | 2015-04-07 06:00:57 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b64b2bd910 | * Fix Issue #43: TAG attr not supported. Also add DEP attr, while I'm at it. Need better way of ensuring future changes don't break in similar way. | 2015-04-07 06:00:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | f9e510a893 | * Whitespace | 2015-04-07 04:53:59 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 66c7ccf6cc | * Fix Spans.orth_ | 2015-04-07 04:53:40 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | b8d34531c4 | * Add support for units to English.__init__, by loading and applying regular expressions | 2015-04-07 04:02:32 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 0ea5af88b6 | * Add multi-word expression RegexMatcher | 2015-04-07 03:45:40 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 2fee67cfa3 | * Add regular expressions for English multi-word expressions | 2015-04-07 03:45:18 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 5a075ea3fc | * Ensure NER moves are available for single-word tokens | 2015-04-05 22:30:58 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | a60a366b2c | * Support 'punct' dep label in conll.pyx | 2015-04-05 22:30:19 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | 021c972137 | * Print parse if verbose in scorer | 2015-04-05 22:29:30 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | fbf19049cf | * Add ent_type_ property | 2015-03-31 02:01:29 +02:00 |  | 
			
				
					| 
							
							
								 Matthew Honnibal | e70b87efeb | * Add merge() method to Tokens, with fairly brittle/hacky implementation, but quite easy to test. Passing minimal tests. Still need to fix left/right deps in C data | 2015-03-30 01:37:41 +02:00 |  |