svlandeg
bf819ba302
Merge remote-tracking branch 'upstream/develop' into whatif/arrow
...
# Conflicts:
# spacy/cli/train.py
# spacy/gold.pyx
# spacy/ml/models/multi_task.py
# spacy/ml/models/simple_ner.py
# spacy/ml/models/textcat.py
# spacy/ml/models/tok2vec.py
# spacy/pipeline/pipes.pyx
# spacy/pipeline/simple_ner.py
# spacy/scorer.py
# spacy/tests/parser/test_add_label.py
# spacy/tests/parser/test_nn_beam.py
# spacy/tests/pipeline/test_morphologizer.py
# spacy/tests/test_scorer.py
# spacy/tests/test_util.py
# spacy/util.py
2020-06-22 15:15:20 +02:00
svlandeg
5e71919322
avoid writing temp dir in json2docs, fixing 4402 test
2020-06-22 14:27:35 +02:00
svlandeg
ffddff03b8
fix add_label test
2020-06-22 11:58:59 +02:00
svlandeg
8ba8defa78
fix test_make_orth_variants
2020-06-22 11:52:17 +02:00
svlandeg
bb87e8c2b1
fix test_roundtrip_docs_to_docbin
2020-06-22 10:48:04 +02:00
svlandeg
0d64c435b0
small fixes
2020-06-22 10:05:12 +02:00
Matthew Honnibal
6a75992af6
Format
2020-06-22 01:11:43 +02:00
Matthew Honnibal
3cc267d600
Add another arc-eager oracle test
2020-06-22 00:54:38 +02:00
Matthew Honnibal
6fa9a19e52
Update tests
...
Remove beam test
Update test
Unskip
Unskip tests
2020-06-22 00:54:38 +02:00
Matthew Honnibal
75a5f2d499
Remove GoldCorpus
...
Update imports
Update after removing GoldCorpus
Fix module name of corpus
Fix mimport
2020-06-22 00:54:38 +02:00
Matthew Honnibal
50d4b21743
Xfail some tests
...
Skip tests that cause crashes
Skip test causing segfault
2020-06-22 00:54:38 +02:00
svlandeg
6d5bfd6f6a
fix test checking for variants
2020-06-22 00:46:08 +02:00
svlandeg
a427ca9355
clean up
2020-06-22 00:46:08 +02:00
svlandeg
5477bf054f
add links to to_dict
2020-06-22 00:46:08 +02:00
Ines Montani
ef5f548fb0
Tidy up and auto-format
2020-06-21 22:38:04 +02:00
Ines Montani
40bb918a4c
Remove unicode declarations and tidy up
2020-06-21 22:34:10 +02:00
svlandeg
689600e17d
add additional test back in (it works now)
2020-06-20 23:23:57 +02:00
svlandeg
12dc8ab208
remove redundant code from master in EntityLinker
2020-06-20 23:07:42 +02:00
svlandeg
6179774278
fix test_build_dependencies by ignoring new libs
2020-06-20 22:49:37 +02:00
Ines Montani
5424b70e51
Remove v2 test
2020-06-20 16:18:53 +02:00
Ines Montani
63c22969f4
Update test_issue5230.py
2020-06-20 16:17:48 +02:00
Ines Montani
296b5d633b
Remove references to Python 2 / is_python2
2020-06-20 16:11:13 +02:00
Ines Montani
52728d8fa3
Merge branch 'develop' into master-tmp
2020-06-20 15:52:00 +02:00
Ines Montani
f91e9e8c84
Remove F841 [ci skip]
2020-06-20 14:47:17 +02:00
Ines Montani
8283df80e9
Tidy up and auto-format
2020-06-20 14:15:04 +02:00
svlandeg
6ca6d7d6b4
test for split sentences with various alignment issues, works
2020-06-18 20:01:02 +02:00
svlandeg
0b6d45eae1
various small fixes
2020-06-18 15:55:00 +02:00
svlandeg
01f9ae774c
small fixes
2020-06-18 14:01:19 +02:00
svlandeg
0c6f1f3891
fix BiluoPushDown parsing entities
2020-06-18 13:00:03 +02:00
svlandeg
cd790aaa2a
fix parser tests to work with example (most still failing)
2020-06-18 11:19:22 +02:00
svlandeg
9f43ba839a
throw informative error when running the components with the wrong type of objects
2020-06-18 10:36:05 +02:00
svlandeg
0f123af35e
ensure test keeps working with non-linked entities
2020-06-17 21:13:38 +02:00
svlandeg
6d73e139b0
fix entity linker
2020-06-17 21:12:25 +02:00
svlandeg
10d396977e
add support for MORPH in to/from_array, fix morphologizer overfitting test
2020-06-17 17:48:07 +02:00
svlandeg
1a151b10d6
correct silly typo
2020-06-17 14:48:14 +02:00
svlandeg
f7ad8e8c83
various fixes in scripts - needs to be further tested
2020-06-17 12:05:58 +02:00
svlandeg
4ed399c848
minibatch utiltiy can deal with strings, docs or examples
2020-06-16 21:35:55 +02:00
svlandeg
8b66c11ff2
add spaces to json output format
2020-06-16 19:30:03 +02:00
svlandeg
ba80ad7efd
fixed some tests + WIP roundtrip unit test
2020-06-16 18:26:50 +02:00
svlandeg
43d41d6bb6
allow None as BILUO annotation
2020-06-16 15:30:05 +02:00
svlandeg
44a0f9c2c8
test_gold_biluo_different_tokenization works
2020-06-16 15:21:20 +02:00
svlandeg
1c35b8efcd
fix spaces
2020-06-16 12:08:25 +02:00
svlandeg
0702a1d3fb
fix test for misaligned
2020-06-15 23:10:47 +02:00
svlandeg
a28f8f369e
Fix many-to-one IOB codes
2020-06-15 23:06:22 +02:00
svlandeg
12886b787b
fixing NER one-to-many alignment
2020-06-15 22:44:17 +02:00
svlandeg
68986a252e
additional tests for new get_aligned function
2020-06-15 17:42:40 +02:00
svlandeg
41d29983a7
start testing get_aligned
2020-06-15 17:16:01 +02:00
svlandeg
fd5f199feb
fixing language and scoring tests
2020-06-15 15:02:05 +02:00
Adriane Boyd
c482f20778
Fix and add warnings related to spacy-lookups-data ( #5588 )
...
* Fix warning message for lemmatization tables
* Add a warning when the `lexeme_norm` table is empty. (Given the
relatively lang-specific loading for `Lookups`, it seemed like too much
overhead to dynamically extract the list of languages, so for now it's
hard-coded.)
2020-06-15 14:56:04 +02:00
Matthew Honnibal
98ca14f577
Remove GoldParse
...
WIP on removing goldparse
Get ArcEager compiling after GoldParse excise
Update setup.py
Get spacy.syntax compiling after removing GoldParse
Rename NewExample -> Example and clean up
Clean html files
Start updating tests
Update Morphologizer
2020-06-14 19:53:30 +02:00