Commit Graph

11991 Commits

Author SHA1 Message Date
Matthew Honnibal
031673dc35 Update test 2020-06-22 16:08:01 +02:00
Matthew Honnibal
1d4322eb04 Merge branch 'whatif/arrow' of https://github.com/explosion/spaCy into whatif/arrow 2020-06-22 16:01:13 +02:00
Matthew Honnibal
4cd1c74385 Update test 2020-06-22 16:00:55 +02:00
Matthew Honnibal
53931be9a1 Replace unseen labels for parser 2020-06-22 16:00:45 +02:00
Matthew Honnibal
c65f0ed8f6 Xfail some tests 2020-06-22 15:54:04 +02:00
Matthew Honnibal
bc481d8338 Remove print 2020-06-22 15:35:55 +02:00
Matthew Honnibal
2de72b30fe Remove prints 2020-06-22 15:34:55 +02:00
Matthew Honnibal
5a2d37c18f Xfail tests 2020-06-22 15:34:46 +02:00
Matthew Honnibal
e92be79ffc Clean up debugging 2020-06-22 15:34:34 +02:00
svlandeg
1720c58287 Merge branch 'whatif/arrow' of https://github.com/explosion/spaCy into whatif/arrow 2020-06-22 15:15:29 +02:00
svlandeg
bf819ba302 Merge remote-tracking branch 'upstream/develop' into whatif/arrow
# Conflicts:
#	spacy/cli/train.py
#	spacy/gold.pyx
#	spacy/ml/models/multi_task.py
#	spacy/ml/models/simple_ner.py
#	spacy/ml/models/textcat.py
#	spacy/ml/models/tok2vec.py
#	spacy/pipeline/pipes.pyx
#	spacy/pipeline/simple_ner.py
#	spacy/scorer.py
#	spacy/tests/parser/test_add_label.py
#	spacy/tests/parser/test_nn_beam.py
#	spacy/tests/pipeline/test_morphologizer.py
#	spacy/tests/test_scorer.py
#	spacy/tests/test_util.py
#	spacy/util.py
2020-06-22 15:15:20 +02:00
Matthew Honnibal
b250f6b62f Update test 2020-06-22 14:59:05 +02:00
Matthew Honnibal
72ab21166d Work on Example.get_aligned_ner method 2020-06-22 14:55:33 +02:00
Matthew Honnibal
497fef4b5f Update test 2020-06-22 14:53:23 +02:00
Matthew Honnibal
ad50c8baca Add missing costs to NER oracle 2020-06-22 14:30:08 +02:00
Matthew Honnibal
f73fa77bb9 Update test 2020-06-22 14:29:52 +02:00
svlandeg
5e71919322 avoid writing temp dir in json2docs, fixing 4402 test 2020-06-22 14:27:35 +02:00
Matthew Honnibal
792bf9476c Update tests 2020-06-22 14:26:38 +02:00
svlandeg
ffddff03b8 fix add_label test 2020-06-22 11:58:59 +02:00
svlandeg
8ba8defa78 fix test_make_orth_variants 2020-06-22 11:52:17 +02:00
Ines Montani
34d59b494f
Merge pull request #5619 from explosion/master-tmp 2020-06-22 02:36:08 -07:00
svlandeg
bb87e8c2b1 fix test_roundtrip_docs_to_docbin 2020-06-22 10:48:04 +02:00
svlandeg
0b3985d307 limit arg for Corpus 2020-06-22 10:22:26 +02:00
svlandeg
0d64c435b0 small fixes 2020-06-22 10:05:12 +02:00
Matthew Honnibal
6a75992af6 Format 2020-06-22 01:11:43 +02:00
Matthew Honnibal
455dc0d9e2 Fix eg.doc 2020-06-22 01:03:39 +02:00
Matthew Honnibal
79288e7110 Merge from remote 2020-06-22 00:58:18 +02:00
Matthew Honnibal
e634bae69e Fix Corpus 2020-06-22 00:54:38 +02:00
Matthew Honnibal
6e4d486b1e Debugging 2020-06-22 00:54:38 +02:00
Matthew Honnibal
e9860daf4b Update ArcEager oracle
Fix Break oracle
2020-06-22 00:54:38 +02:00
Matthew Honnibal
3354758351 Remove Example.doc property
Remove Example.doc

Remove Example.doc

Remove Example.doc

Remove Example.doc
2020-06-22 00:54:38 +02:00
Matthew Honnibal
3cc267d600 Add another arc-eager oracle test 2020-06-22 00:54:38 +02:00
Matthew Honnibal
ecf192aa70 Use get_aligned_parse in ArcEager 2020-06-22 00:54:38 +02:00
Matthew Honnibal
87f5348e17 Update nonproj 2020-06-22 00:54:38 +02:00
Matthew Honnibal
7d329cd1ac Add kwargs to Corpus.dev_dataset to match train_dataset 2020-06-22 00:54:38 +02:00
Matthew Honnibal
59098a5f62 Add get_aligned_parse method in Example
Fix Example.get_aligned_parse
2020-06-22 00:54:38 +02:00
Matthew Honnibal
6fa9a19e52 Update tests
Remove beam test

Update test

Unskip

Unskip tests
2020-06-22 00:54:38 +02:00
Matthew Honnibal
5ca4c19ef2 Work on parser oracle
Update arc_eager oracle

Restore ArcEager.get_cost function

Update transition system
2020-06-22 00:54:38 +02:00
Matthew Honnibal
75a5f2d499 Remove GoldCorpus
Update imports

Update after removing GoldCorpus

Fix module name of corpus

Fix mimport
2020-06-22 00:54:38 +02:00
Matthew Honnibal
50d4b21743 Xfail some tests
Skip tests that cause crashes

Skip test causing segfault
2020-06-22 00:54:38 +02:00
Matthew Honnibal
2efe01bf26 Fix parser declaration 2020-06-22 00:54:38 +02:00
Matthew Honnibal
29d39d8a34 Update header 2020-06-22 00:54:38 +02:00
Matthew Honnibal
456e27dc8b Start debugging arc_eager oracle 2020-06-22 00:54:38 +02:00
Matthew Honnibal
b60eede321 Fix parser model 2020-06-22 00:54:38 +02:00
Matthew Honnibal
17efd6bfec Update train.py 2020-06-22 00:54:38 +02:00
Matthew Honnibal
49145b9ec1 Update DocBin
Add missing strings when serializing
2020-06-22 00:54:35 +02:00
Matthew Honnibal
17226a60ac Draft Corpus class for DocBin
Update Corpus

Fix Corpus
2020-06-22 00:51:22 +02:00
Matthew Honnibal
6e7a7ab6da Work on train script 2020-06-22 00:48:09 +02:00
Matthew Honnibal
a5ebfb20f5 Serialize all attrs by default
Move converters under spacy.gold

Move things around

Fix naming

Fix name

Update converter to produce DocBin

Update converters

Make spacy convert output docbin

Fix import

Fix docbin

Fix import

Update converter

Remove jsonl converter

Add json2docs converter
2020-06-22 00:46:08 +02:00
Matthew Honnibal
5467cb4aae Allow DocBin to take list of Doc objects. 2020-06-22 00:46:08 +02:00