Matthew Honnibal
|
6e7a7ab6da
|
Work on train script
|
2020-06-22 00:48:09 +02:00 |
|
Matthew Honnibal
|
a5ebfb20f5
|
Serialize all attrs by default
Move converters under spacy.gold
Move things around
Fix naming
Fix name
Update converter to produce DocBin
Update converters
Make spacy convert output docbin
Fix import
Fix docbin
Fix import
Update converter
Remove jsonl converter
Add json2docs converter
|
2020-06-22 00:46:08 +02:00 |
|
Matthew Honnibal
|
5467cb4aae
|
Allow DocBin to take list of Doc objects.
|
2020-06-22 00:46:08 +02:00 |
|
Matthew Honnibal
|
d422f30a18
|
Start updating converters
|
2020-06-22 00:46:12 +02:00 |
|
svlandeg
|
6d5bfd6f6a
|
fix test checking for variants
|
2020-06-22 00:46:08 +02:00 |
|
svlandeg
|
a427ca9355
|
clean up
|
2020-06-22 00:46:08 +02:00 |
|
svlandeg
|
5477bf054f
|
add links to to_dict
|
2020-06-22 00:46:08 +02:00 |
|
Matthew Honnibal
|
39117de4f9
|
Fix compile in ArcEager
|
2020-06-22 00:46:08 +02:00 |
|
Matthew Honnibal
|
e2279eab1c
|
Make doc.from_array several times faster
|
2020-06-22 00:46:08 +02:00 |
|
Matthew Honnibal
|
de32515bf8
|
Allocate Doc before starting to add words
|
2020-06-22 00:46:08 +02:00 |
|
Ines Montani
|
ef5f548fb0
|
Tidy up and auto-format
|
2020-06-21 22:38:04 +02:00 |
|
Ines Montani
|
f77e0bc028
|
Merge branch 'develop' into master-tmp
|
2020-06-21 22:34:15 +02:00 |
|
Ines Montani
|
40bb918a4c
|
Remove unicode declarations and tidy up
|
2020-06-21 22:34:10 +02:00 |
|
Matthew Honnibal
|
6670c44390
|
Unskip tests
|
2020-06-21 01:17:52 +02:00 |
|
Matthew Honnibal
|
90d9f04e0b
|
Unskip
|
2020-06-21 01:16:33 +02:00 |
|
Matthew Honnibal
|
2b180ea033
|
Update test
|
2020-06-21 01:15:41 +02:00 |
|
Matthew Honnibal
|
192b94f0a1
|
Remove beam test
|
2020-06-21 01:15:12 +02:00 |
|
Matthew Honnibal
|
9db66ddd48
|
Update test_arc_eager_oracle
|
2020-06-21 01:12:28 +02:00 |
|
Matthew Honnibal
|
7544c21f5b
|
Update transition system
|
2020-06-21 01:12:05 +02:00 |
|
Matthew Honnibal
|
318a046fb0
|
Restore ArcEager.get_cost function
|
2020-06-21 01:11:08 +02:00 |
|
Matthew Honnibal
|
e90341810c
|
Update arc_eager oracle
|
2020-06-21 01:04:02 +02:00 |
|
Matthew Honnibal
|
c58deb3546
|
Work on parser oracle
|
2020-06-21 01:01:09 +02:00 |
|
svlandeg
|
689600e17d
|
add additional test back in (it works now)
|
2020-06-20 23:23:57 +02:00 |
|
svlandeg
|
2f6062a8a4
|
add line that got removed from EntityLinker
|
2020-06-20 23:14:45 +02:00 |
|
svlandeg
|
12dc8ab208
|
remove redundant code from master in EntityLinker
|
2020-06-20 23:07:42 +02:00 |
|
svlandeg
|
6179774278
|
fix test_build_dependencies by ignoring new libs
|
2020-06-20 22:49:37 +02:00 |
|
svlandeg
|
256d4c27c8
|
fix tagger begin_training being called without examples
|
2020-06-20 22:38:00 +02:00 |
|
Matthew Honnibal
|
914924a68b
|
Fix mimport
|
2020-06-20 22:22:40 +02:00 |
|
Matthew Honnibal
|
2791c1c0dc
|
Fix module name of corpus
|
2020-06-20 22:22:14 +02:00 |
|
Matthew Honnibal
|
4bbc277758
|
Update after removing GoldCorpus
|
2020-06-20 22:21:24 +02:00 |
|
Matthew Honnibal
|
64d00520e2
|
Update imports
|
2020-06-20 22:21:08 +02:00 |
|
Matthew Honnibal
|
cfd024536d
|
Remove GoldCorpus
|
2020-06-20 22:13:37 +02:00 |
|
Matthew Honnibal
|
fd83551eb5
|
Skip test causing segfault
|
2020-06-20 22:11:27 +02:00 |
|
svlandeg
|
5cb812e0ab
|
fix NER warn empty lookups (cf PR #5588)
|
2020-06-20 22:04:18 +02:00 |
|
Matthew Honnibal
|
095710e40e
|
Skip tests that cause crashes
|
2020-06-20 22:02:32 +02:00 |
|
Matthew Honnibal
|
0b23fd3891
|
Xfail some tests
|
2020-06-20 21:52:57 +02:00 |
|
Matthew Honnibal
|
6af99f2f2d
|
Fix parser declaration
|
2020-06-20 21:50:17 +02:00 |
|
Matthew Honnibal
|
52edb24f07
|
Update header
|
2020-06-20 21:50:06 +02:00 |
|
Matthew Honnibal
|
0c10831b14
|
Start debugging arc_eager oracle
|
2020-06-20 21:49:46 +02:00 |
|
Matthew Honnibal
|
2bcb5881d7
|
Fix parser model
|
2020-06-20 21:49:31 +02:00 |
|
Matthew Honnibal
|
396dd60b3a
|
Fix Corpus
|
2020-06-20 21:49:15 +02:00 |
|
Matthew Honnibal
|
450c6fe39c
|
Update train.py
|
2020-06-20 21:49:06 +02:00 |
|
svlandeg
|
c9242e9bf4
|
fix entity linker (cf PR #5548)
|
2020-06-20 21:47:23 +02:00 |
|
svlandeg
|
dc069e90b3
|
fix token.morph_ for v.3 (cf PR #5517)
|
2020-06-20 21:13:11 +02:00 |
|
Matthew Honnibal
|
6d821b2e55
|
Make doc.from_array several times faster
|
2020-06-20 20:17:13 +02:00 |
|
Matthew Honnibal
|
fa86aa581d
|
Allocate Doc before starting to add words
|
2020-06-20 20:15:21 +02:00 |
|
Matthew Honnibal
|
652f31d3ee
|
Update DocBin
|
2020-06-20 20:12:54 +02:00 |
|
Matthew Honnibal
|
0a8b6631a2
|
Update Corpus
|
2020-06-20 20:12:31 +02:00 |
|
Matthew Honnibal
|
11fa0658f7
|
Work on train script
|
2020-06-20 20:12:19 +02:00 |
|
Ines Montani
|
988d2a4eda
|
Add --code-path option to train CLI (#5618)
|
2020-06-20 18:43:12 +02:00 |
|