Commit Graph

11994 Commits

Author SHA1 Message Date
Matthew Honnibal
2d34d2f24a Support gold_preproc in Corpus 2020-06-22 17:47:12 +02:00
Matthew Honnibal
afe6ee4548 Fix Corpus 2020-06-22 16:28:47 +02:00
Matthew Honnibal
fedfabec80 Xfail test 2020-06-22 16:11:42 +02:00
Matthew Honnibal
031673dc35 Update test 2020-06-22 16:08:01 +02:00
Matthew Honnibal
1d4322eb04 Merge branch 'whatif/arrow' of https://github.com/explosion/spaCy into whatif/arrow 2020-06-22 16:01:13 +02:00
Matthew Honnibal
4cd1c74385 Update test 2020-06-22 16:00:55 +02:00
Matthew Honnibal
53931be9a1 Replace unseen labels for parser 2020-06-22 16:00:45 +02:00
Matthew Honnibal
c65f0ed8f6 Xfail some tests 2020-06-22 15:54:04 +02:00
Matthew Honnibal
bc481d8338 Remove print 2020-06-22 15:35:55 +02:00
Matthew Honnibal
2de72b30fe Remove prints 2020-06-22 15:34:55 +02:00
Matthew Honnibal
5a2d37c18f Xfail tests 2020-06-22 15:34:46 +02:00
Matthew Honnibal
e92be79ffc Clean up debugging 2020-06-22 15:34:34 +02:00
svlandeg
1720c58287 Merge branch 'whatif/arrow' of https://github.com/explosion/spaCy into whatif/arrow 2020-06-22 15:15:29 +02:00
svlandeg
bf819ba302 Merge remote-tracking branch 'upstream/develop' into whatif/arrow
# Conflicts:
#	spacy/cli/train.py
#	spacy/gold.pyx
#	spacy/ml/models/multi_task.py
#	spacy/ml/models/simple_ner.py
#	spacy/ml/models/textcat.py
#	spacy/ml/models/tok2vec.py
#	spacy/pipeline/pipes.pyx
#	spacy/pipeline/simple_ner.py
#	spacy/scorer.py
#	spacy/tests/parser/test_add_label.py
#	spacy/tests/parser/test_nn_beam.py
#	spacy/tests/pipeline/test_morphologizer.py
#	spacy/tests/test_scorer.py
#	spacy/tests/test_util.py
#	spacy/util.py
2020-06-22 15:15:20 +02:00
Matthew Honnibal
b250f6b62f Update test 2020-06-22 14:59:05 +02:00
Matthew Honnibal
72ab21166d Work on Example.get_aligned_ner method 2020-06-22 14:55:33 +02:00
Matthew Honnibal
497fef4b5f Update test 2020-06-22 14:53:23 +02:00
Matthew Honnibal
ad50c8baca Add missing costs to NER oracle 2020-06-22 14:30:08 +02:00
Matthew Honnibal
f73fa77bb9 Update test 2020-06-22 14:29:52 +02:00
svlandeg
5e71919322 avoid writing temp dir in json2docs, fixing 4402 test 2020-06-22 14:27:35 +02:00
Matthew Honnibal
792bf9476c Update tests 2020-06-22 14:26:38 +02:00
svlandeg
ffddff03b8 fix add_label test 2020-06-22 11:58:59 +02:00
svlandeg
8ba8defa78 fix test_make_orth_variants 2020-06-22 11:52:17 +02:00
Ines Montani
34d59b494f
Merge pull request #5619 from explosion/master-tmp 2020-06-22 02:36:08 -07:00
svlandeg
bb87e8c2b1 fix test_roundtrip_docs_to_docbin 2020-06-22 10:48:04 +02:00
svlandeg
0b3985d307 limit arg for Corpus 2020-06-22 10:22:26 +02:00
svlandeg
0d64c435b0 small fixes 2020-06-22 10:05:12 +02:00
Matthew Honnibal
6a75992af6 Format 2020-06-22 01:11:43 +02:00
Matthew Honnibal
455dc0d9e2 Fix eg.doc 2020-06-22 01:03:39 +02:00
Matthew Honnibal
79288e7110 Merge from remote 2020-06-22 00:58:18 +02:00
Matthew Honnibal
e634bae69e Fix Corpus 2020-06-22 00:54:38 +02:00
Matthew Honnibal
6e4d486b1e Debugging 2020-06-22 00:54:38 +02:00
Matthew Honnibal
e9860daf4b Update ArcEager oracle
Fix Break oracle
2020-06-22 00:54:38 +02:00
Matthew Honnibal
3354758351 Remove Example.doc property
Remove Example.doc

Remove Example.doc

Remove Example.doc

Remove Example.doc
2020-06-22 00:54:38 +02:00
Matthew Honnibal
3cc267d600 Add another arc-eager oracle test 2020-06-22 00:54:38 +02:00
Matthew Honnibal
ecf192aa70 Use get_aligned_parse in ArcEager 2020-06-22 00:54:38 +02:00
Matthew Honnibal
87f5348e17 Update nonproj 2020-06-22 00:54:38 +02:00
Matthew Honnibal
7d329cd1ac Add kwargs to Corpus.dev_dataset to match train_dataset 2020-06-22 00:54:38 +02:00
Matthew Honnibal
59098a5f62 Add get_aligned_parse method in Example
Fix Example.get_aligned_parse
2020-06-22 00:54:38 +02:00
Matthew Honnibal
6fa9a19e52 Update tests
Remove beam test

Update test

Unskip

Unskip tests
2020-06-22 00:54:38 +02:00
Matthew Honnibal
5ca4c19ef2 Work on parser oracle
Update arc_eager oracle

Restore ArcEager.get_cost function

Update transition system
2020-06-22 00:54:38 +02:00
Matthew Honnibal
75a5f2d499 Remove GoldCorpus
Update imports

Update after removing GoldCorpus

Fix module name of corpus

Fix mimport
2020-06-22 00:54:38 +02:00
Matthew Honnibal
50d4b21743 Xfail some tests
Skip tests that cause crashes

Skip test causing segfault
2020-06-22 00:54:38 +02:00
Matthew Honnibal
2efe01bf26 Fix parser declaration 2020-06-22 00:54:38 +02:00
Matthew Honnibal
29d39d8a34 Update header 2020-06-22 00:54:38 +02:00
Matthew Honnibal
456e27dc8b Start debugging arc_eager oracle 2020-06-22 00:54:38 +02:00
Matthew Honnibal
b60eede321 Fix parser model 2020-06-22 00:54:38 +02:00
Matthew Honnibal
17efd6bfec Update train.py 2020-06-22 00:54:38 +02:00
Matthew Honnibal
49145b9ec1 Update DocBin
Add missing strings when serializing
2020-06-22 00:54:35 +02:00
Matthew Honnibal
17226a60ac Draft Corpus class for DocBin
Update Corpus

Fix Corpus
2020-06-22 00:51:22 +02:00