Commit Graph

11992 Commits

Author SHA1 Message Date
Matthew Honnibal
5467cb4aae Allow DocBin to take list of Doc objects. 2020-06-22 00:46:08 +02:00
Matthew Honnibal
d422f30a18 Start updating converters 2020-06-22 00:46:12 +02:00
svlandeg
6d5bfd6f6a fix test checking for variants 2020-06-22 00:46:08 +02:00
svlandeg
a427ca9355 clean up 2020-06-22 00:46:08 +02:00
svlandeg
5477bf054f add links to to_dict 2020-06-22 00:46:08 +02:00
Matthew Honnibal
39117de4f9 Fix compile in ArcEager 2020-06-22 00:46:08 +02:00
Matthew Honnibal
e2279eab1c Make doc.from_array several times faster 2020-06-22 00:46:08 +02:00
Matthew Honnibal
de32515bf8 Allocate Doc before starting to add words 2020-06-22 00:46:08 +02:00
Ines Montani
ef5f548fb0 Tidy up and auto-format 2020-06-21 22:38:04 +02:00
Ines Montani
f77e0bc028 Merge branch 'develop' into master-tmp 2020-06-21 22:34:15 +02:00
Ines Montani
40bb918a4c Remove unicode declarations and tidy up 2020-06-21 22:34:10 +02:00
Matthew Honnibal
6670c44390 Unskip tests 2020-06-21 01:17:52 +02:00
Matthew Honnibal
90d9f04e0b Unskip 2020-06-21 01:16:33 +02:00
Matthew Honnibal
2b180ea033 Update test 2020-06-21 01:15:41 +02:00
Matthew Honnibal
192b94f0a1 Remove beam test 2020-06-21 01:15:12 +02:00
Matthew Honnibal
9db66ddd48 Update test_arc_eager_oracle 2020-06-21 01:12:28 +02:00
Matthew Honnibal
7544c21f5b Update transition system 2020-06-21 01:12:05 +02:00
Matthew Honnibal
318a046fb0 Restore ArcEager.get_cost function 2020-06-21 01:11:08 +02:00
Matthew Honnibal
e90341810c Update arc_eager oracle 2020-06-21 01:04:02 +02:00
Matthew Honnibal
c58deb3546 Work on parser oracle 2020-06-21 01:01:09 +02:00
svlandeg
689600e17d add additional test back in (it works now) 2020-06-20 23:23:57 +02:00
svlandeg
2f6062a8a4 add line that got removed from EntityLinker 2020-06-20 23:14:45 +02:00
svlandeg
12dc8ab208 remove redundant code from master in EntityLinker 2020-06-20 23:07:42 +02:00
svlandeg
6179774278 fix test_build_dependencies by ignoring new libs 2020-06-20 22:49:37 +02:00
svlandeg
256d4c27c8 fix tagger begin_training being called without examples 2020-06-20 22:38:00 +02:00
Matthew Honnibal
914924a68b Fix mimport 2020-06-20 22:22:40 +02:00
Matthew Honnibal
2791c1c0dc Fix module name of corpus 2020-06-20 22:22:14 +02:00
Matthew Honnibal
4bbc277758 Update after removing GoldCorpus 2020-06-20 22:21:24 +02:00
Matthew Honnibal
64d00520e2 Update imports 2020-06-20 22:21:08 +02:00
Matthew Honnibal
cfd024536d Remove GoldCorpus 2020-06-20 22:13:37 +02:00
Matthew Honnibal
fd83551eb5 Skip test causing segfault 2020-06-20 22:11:27 +02:00
svlandeg
5cb812e0ab fix NER warn empty lookups (cf PR #5588) 2020-06-20 22:04:18 +02:00
Matthew Honnibal
095710e40e Skip tests that cause crashes 2020-06-20 22:02:32 +02:00
Matthew Honnibal
0b23fd3891 Xfail some tests 2020-06-20 21:52:57 +02:00
Matthew Honnibal
6af99f2f2d Fix parser declaration 2020-06-20 21:50:17 +02:00
Matthew Honnibal
52edb24f07 Update header 2020-06-20 21:50:06 +02:00
Matthew Honnibal
0c10831b14 Start debugging arc_eager oracle 2020-06-20 21:49:46 +02:00
Matthew Honnibal
2bcb5881d7 Fix parser model 2020-06-20 21:49:31 +02:00
Matthew Honnibal
396dd60b3a Fix Corpus 2020-06-20 21:49:15 +02:00
Matthew Honnibal
450c6fe39c Update train.py 2020-06-20 21:49:06 +02:00
svlandeg
c9242e9bf4 fix entity linker (cf PR #5548) 2020-06-20 21:47:23 +02:00
svlandeg
dc069e90b3 fix token.morph_ for v.3 (cf PR #5517) 2020-06-20 21:13:11 +02:00
Matthew Honnibal
6d821b2e55 Make doc.from_array several times faster 2020-06-20 20:17:13 +02:00
Matthew Honnibal
fa86aa581d Allocate Doc before starting to add words 2020-06-20 20:15:21 +02:00
Matthew Honnibal
652f31d3ee Update DocBin 2020-06-20 20:12:54 +02:00
Matthew Honnibal
0a8b6631a2 Update Corpus 2020-06-20 20:12:31 +02:00
Matthew Honnibal
11fa0658f7 Work on train script 2020-06-20 20:12:19 +02:00
Ines Montani
988d2a4eda
Add --code-path option to train CLI (#5618) 2020-06-20 18:43:12 +02:00
Matthew Honnibal
0de361cd00 Draft Corpus class for DocBin 2020-06-20 18:31:07 +02:00
Ines Montani
5424b70e51 Remove v2 test 2020-06-20 16:18:53 +02:00