Commit Graph

12067 Commits

Author SHA1 Message Date
svlandeg
5b6ed05752 attempt to fix _guess_spaces 2020-06-26 11:38:37 +02:00
Matthew Honnibal
5b1d15e247 Don't unset example docs 2020-06-25 21:19:49 +02:00
Matthew Honnibal
04583fe0c6 Copy examples at the start of Language.update 2020-06-25 21:19:33 +02:00
Matthew Honnibal
0d2492c97a Implement Example.copy 2020-06-25 21:18:52 +02:00
Matthew Honnibal
c2fff89b1d Implement Doc.copy 2020-06-25 21:18:37 +02:00
Matthew Honnibal
ae58d00327 Restore random cuts in parser/ner training 2020-06-25 21:18:29 +02:00
Matthew Honnibal
9e3695de6b Print epochs 2020-06-25 21:18:08 +02:00
Matthew Honnibal
98c026195b Add assertion 2020-06-25 18:52:55 +02:00
Matthew Honnibal
403b362a5d Restore the 'cutting' in parser training 2020-06-25 18:52:42 +02:00
Matthew Honnibal
6bda23ad26 Improve handling of missing values in NER 2020-06-25 16:26:44 +02:00
Matthew Honnibal
b8c85e593b Improve NER alignment 2020-06-25 16:26:13 +02:00
Matthew Honnibal
b3df6228dd Fix spaces reading 2020-06-25 15:20:00 +02:00
Matthew Honnibal
7d866c3056 Add test 2020-06-25 15:19:46 +02:00
Matthew Honnibal
c39401105b Set spaces on gold doc after conversion 2020-06-25 15:19:36 +02:00
Matthew Honnibal
c2fd1e4eb9 Fix merge 2020-06-25 03:58:32 +02:00
Matthew Honnibal
ae7041f182 Dont return Example from init_gold_batch 2020-06-25 03:55:59 +02:00
Matthew Honnibal
67c82dbea9 Return examples from init_gold_batch 2020-06-25 03:44:12 +02:00
Matthew Honnibal
4925c0be34 Return kept examples from init_gold_batch 2020-06-25 03:42:41 +02:00
Matthew Honnibal
b3625dc697 Fix unset variable 2020-06-24 18:03:55 +02:00
Matthew Honnibal
8bbf31a582 Output unlabelled spans from O biluo tags in iob_utils 2020-06-24 18:03:44 +02:00
Matthew Honnibal
7eb064854e Fix handling of NER data in Example 2020-06-24 18:03:24 +02:00
Matthew Honnibal
359e874766 Hack scorer to avoid evaluating non-nered data 2020-06-24 18:02:59 +02:00
Matthew Honnibal
70bd7fb611 Fix test 2020-06-24 17:21:53 +02:00
Matthew Honnibal
bca1674c30 Fix convert for single file 2020-06-24 16:04:23 +02:00
Matthew Honnibal
2d5f5cb5fb Add NER data test 2020-06-24 15:49:25 +02:00
Matthew Honnibal
14bb102a6b Update test 2020-06-24 15:35:17 +02:00
Matthew Honnibal
c09019aa8d Fix parser test 2020-06-24 15:31:43 +02:00
Matthew Honnibal
a78427d640 Update gold test for NER 2020-06-24 15:19:40 +02:00
Matthew Honnibal
5120113f39 Update onto-json default 2020-06-24 00:00:59 +02:00
Matthew Honnibal
306a591e1f Tweak efficiency of precomputable affine 2020-06-24 00:00:40 +02:00
Matthew Honnibal
d85d063231 Fix NER init_gold_batch 2020-06-23 23:58:55 +02:00
Matthew Honnibal
78e9e15e9e Fix conversion of NER data 2020-06-23 23:58:27 +02:00
Matthew Honnibal
b82431207d Simplify NER alignment 2020-06-23 23:57:54 +02:00
Matthew Honnibal
10eb44d76a Remove debugging in parser 2020-06-23 22:59:31 +02:00
Matthew Honnibal
ca608467da Filter out uannotated sentences in NER 2020-06-23 22:58:49 +02:00
Matthew Honnibal
420a986d15 Fix arc_eager oracle 2020-06-23 22:58:12 +02:00
Matthew Honnibal
a68d0e63f0 Support max_length in Corpus 2020-06-23 22:57:40 +02:00
svlandeg
28ad71c187 bugfix excl Span.end in iob2docs 2020-06-23 17:20:41 +02:00
svlandeg
7c76a2b796 fix asserts 2020-06-23 17:09:37 +02:00
svlandeg
351ab3a3d4 pull merge_sent into iob2docs to avoid Doc creation for each line 2020-06-23 16:47:30 +02:00
Matthew Honnibal
d5212f7ba8 Improve efficiency of ArEager oracle 2020-06-23 15:55:12 +02:00
svlandeg
5cf3eeee0d bugfix: update states after creating golds to avoid out of bounds indexing 2020-06-23 14:49:31 +02:00
svlandeg
790b373901 fix var 2020-06-23 14:05:00 +02:00
svlandeg
0962ee7ce6 fix output_dir (converted to Path by typer) 2020-06-23 13:49:21 +02:00
Matthew Honnibal
537a5b9cef Fix import 2020-06-23 13:35:12 +02:00
Matthew Honnibal
6f7f2131cb Fix flake8 2020-06-23 13:26:38 +02:00
Matthew Honnibal
1682a60a20 Fix convert command 2020-06-23 13:19:37 +02:00
Matthew Honnibal
8722b65bce Fix json2docs converter 2020-06-23 13:19:26 +02:00
Matthew Honnibal
7376518af2 Fix gold_preproc 2020-06-23 12:01:29 +02:00
Matthew Honnibal
8f420f3978 Merge branch 'whatif/arrow' of https://github.com/explosion/spaCy into whatif/arrow 2020-06-22 17:49:16 +02:00