Commit Graph

11823 Commits

Author SHA1 Message Date
Matthew Honnibal
ad50c8baca Add missing costs to NER oracle 2020-06-22 14:30:08 +02:00
Matthew Honnibal
f73fa77bb9 Update test 2020-06-22 14:29:52 +02:00
Matthew Honnibal
792bf9476c Update tests 2020-06-22 14:26:38 +02:00
Matthew Honnibal
6a75992af6 Format 2020-06-22 01:11:43 +02:00
Matthew Honnibal
455dc0d9e2 Fix eg.doc 2020-06-22 01:03:39 +02:00
Matthew Honnibal
79288e7110 Merge from remote 2020-06-22 00:58:18 +02:00
Matthew Honnibal
e634bae69e Fix Corpus 2020-06-22 00:54:38 +02:00
Matthew Honnibal
6e4d486b1e Debugging 2020-06-22 00:54:38 +02:00
Matthew Honnibal
e9860daf4b Update ArcEager oracle
Fix Break oracle
2020-06-22 00:54:38 +02:00
Matthew Honnibal
3354758351 Remove Example.doc property
Remove Example.doc

Remove Example.doc

Remove Example.doc

Remove Example.doc
2020-06-22 00:54:38 +02:00
Matthew Honnibal
3cc267d600 Add another arc-eager oracle test 2020-06-22 00:54:38 +02:00
Matthew Honnibal
ecf192aa70 Use get_aligned_parse in ArcEager 2020-06-22 00:54:38 +02:00
Matthew Honnibal
87f5348e17 Update nonproj 2020-06-22 00:54:38 +02:00
Matthew Honnibal
7d329cd1ac Add kwargs to Corpus.dev_dataset to match train_dataset 2020-06-22 00:54:38 +02:00
Matthew Honnibal
59098a5f62 Add get_aligned_parse method in Example
Fix Example.get_aligned_parse
2020-06-22 00:54:38 +02:00
Matthew Honnibal
6fa9a19e52 Update tests
Remove beam test

Update test

Unskip

Unskip tests
2020-06-22 00:54:38 +02:00
Matthew Honnibal
5ca4c19ef2 Work on parser oracle
Update arc_eager oracle

Restore ArcEager.get_cost function

Update transition system
2020-06-22 00:54:38 +02:00
Matthew Honnibal
75a5f2d499 Remove GoldCorpus
Update imports

Update after removing GoldCorpus

Fix module name of corpus

Fix mimport
2020-06-22 00:54:38 +02:00
Matthew Honnibal
50d4b21743 Xfail some tests
Skip tests that cause crashes

Skip test causing segfault
2020-06-22 00:54:38 +02:00
Matthew Honnibal
2efe01bf26 Fix parser declaration 2020-06-22 00:54:38 +02:00
Matthew Honnibal
29d39d8a34 Update header 2020-06-22 00:54:38 +02:00
Matthew Honnibal
456e27dc8b Start debugging arc_eager oracle 2020-06-22 00:54:38 +02:00
Matthew Honnibal
b60eede321 Fix parser model 2020-06-22 00:54:38 +02:00
Matthew Honnibal
17efd6bfec Update train.py 2020-06-22 00:54:38 +02:00
Matthew Honnibal
49145b9ec1 Update DocBin
Add missing strings when serializing
2020-06-22 00:54:35 +02:00
Matthew Honnibal
17226a60ac Draft Corpus class for DocBin
Update Corpus

Fix Corpus
2020-06-22 00:51:22 +02:00
Matthew Honnibal
6e7a7ab6da Work on train script 2020-06-22 00:48:09 +02:00
Matthew Honnibal
a5ebfb20f5 Serialize all attrs by default
Move converters under spacy.gold

Move things around

Fix naming

Fix name

Update converter to produce DocBin

Update converters

Make spacy convert output docbin

Fix import

Fix docbin

Fix import

Update converter

Remove jsonl converter

Add json2docs converter
2020-06-22 00:46:08 +02:00
Matthew Honnibal
5467cb4aae Allow DocBin to take list of Doc objects. 2020-06-22 00:46:08 +02:00
Matthew Honnibal
d422f30a18 Start updating converters 2020-06-22 00:46:12 +02:00
svlandeg
6d5bfd6f6a fix test checking for variants 2020-06-22 00:46:08 +02:00
svlandeg
a427ca9355 clean up 2020-06-22 00:46:08 +02:00
svlandeg
5477bf054f add links to to_dict 2020-06-22 00:46:08 +02:00
Matthew Honnibal
39117de4f9 Fix compile in ArcEager 2020-06-22 00:46:08 +02:00
Matthew Honnibal
e2279eab1c Make doc.from_array several times faster 2020-06-22 00:46:08 +02:00
Matthew Honnibal
de32515bf8 Allocate Doc before starting to add words 2020-06-22 00:46:08 +02:00
Matthew Honnibal
6670c44390 Unskip tests 2020-06-21 01:17:52 +02:00
Matthew Honnibal
90d9f04e0b Unskip 2020-06-21 01:16:33 +02:00
Matthew Honnibal
2b180ea033 Update test 2020-06-21 01:15:41 +02:00
Matthew Honnibal
192b94f0a1 Remove beam test 2020-06-21 01:15:12 +02:00
Matthew Honnibal
9db66ddd48 Update test_arc_eager_oracle 2020-06-21 01:12:28 +02:00
Matthew Honnibal
7544c21f5b Update transition system 2020-06-21 01:12:05 +02:00
Matthew Honnibal
318a046fb0 Restore ArcEager.get_cost function 2020-06-21 01:11:08 +02:00
Matthew Honnibal
e90341810c Update arc_eager oracle 2020-06-21 01:04:02 +02:00
Matthew Honnibal
c58deb3546 Work on parser oracle 2020-06-21 01:01:09 +02:00
Matthew Honnibal
914924a68b Fix mimport 2020-06-20 22:22:40 +02:00
Matthew Honnibal
2791c1c0dc Fix module name of corpus 2020-06-20 22:22:14 +02:00
Matthew Honnibal
4bbc277758 Update after removing GoldCorpus 2020-06-20 22:21:24 +02:00
Matthew Honnibal
64d00520e2 Update imports 2020-06-20 22:21:08 +02:00
Matthew Honnibal
cfd024536d Remove GoldCorpus 2020-06-20 22:13:37 +02:00