Jim Geovedi
|
58d8078971
|
Merge remote-tracking branch 'upstream/develop' into indonesian
|
2017-08-25 09:21:49 +08:00 |
|
Matthew Honnibal
|
6ceb0f0518
|
Allow Lexeme.rank to be set
|
2017-08-24 21:43:00 +02:00 |
|
Matthew Honnibal
|
44a1fa80d3
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-23 13:02:16 +02:00 |
|
ines
|
bb1abbeba5
|
Only link model if download was successfull
|
2017-08-23 12:36:31 +02:00 |
|
Matthew Honnibal
|
bb2541ffd3
|
Fix PROB attr for OOV words
|
2017-08-23 12:11:52 +02:00 |
|
Matthew Honnibal
|
1c5c256e58
|
Fix fine_tune when optimizer is None
|
2017-08-23 10:51:33 +02:00 |
|
Matthew Honnibal
|
9c580ad28a
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-22 17:02:04 -05:00 |
|
Matthew Honnibal
|
a4633fff6f
|
Restore use of batch norm in model
|
2017-08-22 17:01:58 -05:00 |
|
Matthew Honnibal
|
03b5b9727a
|
Fix Doc.vector for empty doc objects
|
2017-08-22 19:52:19 +02:00 |
|
Matthew Honnibal
|
0551b7b03a
|
Fix doc.vector
|
2017-08-22 19:46:52 +02:00 |
|
Matthew Honnibal
|
83f8e98450
|
Fix retrieval of OOV vectors
|
2017-08-22 19:46:35 +02:00 |
|
Matthew Honnibal
|
df2745eb08
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-22 19:00:43 +02:00 |
|
Matthew Honnibal
|
5b329acbf2
|
Fix vectors_length property in vocab
|
2017-08-22 19:00:27 +02:00 |
|
Matthew Honnibal
|
1fe605dfe5
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-21 19:18:31 -05:00 |
|
Matthew Honnibal
|
18b64e79ec
|
Fix fine tuning
|
2017-08-21 19:18:26 -05:00 |
|
Matthew Honnibal
|
682346dd66
|
Restore optimized hidden_depth=0 for parser
|
2017-08-21 19:18:04 -05:00 |
|
Matthew Honnibal
|
a21d8f3f0b
|
Add predict paths to _ml models
|
2017-08-21 23:23:45 +02:00 |
|
Matthew Honnibal
|
cec76801dc
|
Add profile command to CLI
|
2017-08-21 23:23:05 +02:00 |
|
Matthew Honnibal
|
7be5f30f17
|
Add profile function
|
2017-08-21 23:22:49 +02:00 |
|
ines
|
a68dc891ea
|
Port over changes from #1281
|
2017-08-21 23:19:18 +02:00 |
|
Matthew Honnibal
|
5e50a65252
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-21 14:15:46 -05:00 |
|
Matthew Honnibal
|
80acbc5f1f
|
Fix fine-tune weight mixture
|
2017-08-21 14:15:29 -05:00 |
|
ines
|
d15775c3ad
|
Fix typos and commands in alpha docs
|
2017-08-21 13:40:11 +02:00 |
|
Gyorgy Orosz
|
b3576bfc86
|
Added vector leading to model cli
|
2017-08-20 23:16:12 +02:00 |
|
Matthew Honnibal
|
c10f63bf10
|
Initialize fine tuning to 0.5
|
2017-08-20 15:59:48 -05:00 |
|
Matthew Honnibal
|
62878e50db
|
Fix misalignment caued by filtering inputs at wrong point in parser
|
2017-08-20 15:59:28 -05:00 |
|
Matthew Honnibal
|
78a5f842e9
|
Fix update when update_shared=False
|
2017-08-20 15:58:34 -05:00 |
|
Matthew Honnibal
|
7a6edeea68
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-20 12:55:39 -05:00 |
|
Matthew Honnibal
|
f2f9229964
|
Fix name of update_shared flag
|
2017-08-20 18:19:06 +02:00 |
|
Matthew Honnibal
|
8a59718fd6
|
Fix fine-tuning
|
2017-08-20 18:17:35 +02:00 |
|
Matthew Honnibal
|
80a5146ec2
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-20 11:07:08 -05:00 |
|
Matthew Honnibal
|
84bb543e4d
|
Add gold_preproc flag to cli/train
|
2017-08-20 11:07:00 -05:00 |
|
Matthew Honnibal
|
3fe0d76e6d
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-20 14:50:01 +02:00 |
|
Matthew Honnibal
|
c1d3ff517a
|
Track loss in tagger
|
2017-08-20 14:42:23 +02:00 |
|
Matthew Honnibal
|
8875590081
|
Add optimizer in Language.update if sgd=None
|
2017-08-20 14:42:07 +02:00 |
|
Matthew Honnibal
|
84b7ed49e4
|
Ensure updates aren't made if no gold available
|
2017-08-20 14:41:38 +02:00 |
|
Ines Montani
|
c2bbd393af
|
Merge pull request #1276 from oroszgy/model_cli_v2
Ported model cli from v1
|
2017-08-20 11:52:59 +02:00 |
|
Jim Geovedi
|
f77443ab68
|
reworked
|
2017-08-20 13:43:21 +07:00 |
|
Jim Geovedi
|
fbc62a09c7
|
added {pre,suf,in}fix tests
|
2017-08-20 13:43:00 +07:00 |
|
Jim Geovedi
|
713d7c0aa0
|
added indonesian lang test
|
2017-08-20 12:17:14 +07:00 |
|
Jim Geovedi
|
b7d83f37c8
|
indonesian abbr.
|
2017-08-20 12:16:50 +07:00 |
|
Jim Geovedi
|
7193c47f0b
|
direct lookup
|
2017-08-20 11:57:52 +07:00 |
|
Jim Geovedi
|
fdf802d505
|
added examples
|
2017-08-20 11:57:10 +07:00 |
|
Jim Geovedi
|
fa544e6c9a
|
Merge remote-tracking branch 'upstream/develop' into indonesian
|
2017-08-20 11:49:40 +07:00 |
|
Matthew Honnibal
|
42fa84075f
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-19 22:42:50 +02:00 |
|
Matthew Honnibal
|
aefef6fd28
|
Prevent strings from being lost during from_disk and from_bytes
|
2017-08-19 22:42:17 +02:00 |
|
ines
|
281e7e58b3
|
Don't escape forward slashes on ujson.dumps
|
2017-08-19 22:32:16 +02:00 |
|
ines
|
2d126a00ae
|
Fix typo
|
2017-08-19 22:32:07 +02:00 |
|
Matthew Honnibal
|
41c2218c53
|
Fix test for vectors
|
2017-08-19 22:09:12 +02:00 |
|
Matthew Honnibal
|
b8e1603cc4
|
Fix load fail for missing vectors
|
2017-08-19 22:07:00 +02:00 |
|
Matthew Honnibal
|
a3c51a0355
|
Fix creation of pipeline
|
2017-08-19 21:58:57 +02:00 |
|
Gyorgy Orosz
|
e5344b83a3
|
Ported model cli from v1
|
2017-08-19 21:45:23 +02:00 |
|
Matthew Honnibal
|
6a94648373
|
Fix serialization
|
2017-08-19 21:27:35 +02:00 |
|
Matthew Honnibal
|
1157294434
|
Improve vector handling
|
2017-08-19 20:35:33 +02:00 |
|
Matthew Honnibal
|
ef87562741
|
Restore vectors test utils
|
2017-08-19 20:35:16 +02:00 |
|
Matthew Honnibal
|
1391f9da37
|
Restore vectors tests
|
2017-08-19 20:34:58 +02:00 |
|
Matthew Honnibal
|
8cfeeb4884
|
Increment version
|
2017-08-19 19:52:58 +02:00 |
|
Matthew Honnibal
|
93fb8b64e9
|
Fix vector loading
|
2017-08-19 19:52:25 +02:00 |
|
Matthew Honnibal
|
49a615e7d9
|
Create Vectors object in Vocab
|
2017-08-19 18:50:16 +02:00 |
|
Matthew Honnibal
|
3d049af563
|
Improve vectors to/from disk
|
2017-08-19 18:42:11 +02:00 |
|
Matthew Honnibal
|
d55d6e1cfa
|
Fix comparison of Token from different docs. Closes #1257
|
2017-08-19 16:39:32 +02:00 |
|
Matthew Honnibal
|
9b6a5df15e
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-19 16:24:57 +02:00 |
|
Matthew Honnibal
|
4fda02c7e6
|
Add test for new Span.to_array method
|
2017-08-19 16:24:38 +02:00 |
|
Matthew Honnibal
|
dea229c634
|
Fix Span.to_array method
|
2017-08-19 16:24:28 +02:00 |
|
Matthew Honnibal
|
c606b4a42c
|
Add test for Doc.char_span
|
2017-08-19 16:18:23 +02:00 |
|
Matthew Honnibal
|
8b7ac77c23
|
Allow span label to be string in Doc.char_span
|
2017-08-19 16:18:09 +02:00 |
|
Matthew Honnibal
|
7c47e38c12
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-19 09:03:15 -05:00 |
|
Matthew Honnibal
|
ab28f911b4
|
Fix parser learning rates
|
2017-08-19 09:02:57 -05:00 |
|
ines
|
1fe5e1a4d1
|
Add language example sentences (see #1107)
da, de, en, es, fr, he, it, nb, pl, pt, sv
|
2017-08-19 12:22:29 +02:00 |
|
Matthew Honnibal
|
97aabafb5f
|
Document as_tuples keyword arg of Language.pipe
|
2017-08-19 12:21:33 +02:00 |
|
Matthew Honnibal
|
80236116a6
|
Add Doc.char_span method, to get a span by character offset
|
2017-08-19 12:21:09 +02:00 |
|
Matthew Honnibal
|
482bba1722
|
Add Span.to_array method
|
2017-08-19 12:20:45 +02:00 |
|
Matthew Honnibal
|
19c495f451
|
Fix vectors deserialization
|
2017-08-19 04:33:03 +02:00 |
|
Matthew Honnibal
|
42d47c1e5c
|
Fix tagger serialization
|
2017-08-19 04:16:32 +02:00 |
|
Matthew Honnibal
|
2da96a0ec7
|
Fix beam test
|
2017-08-19 04:15:46 +02:00 |
|
Matthew Honnibal
|
a7309a217d
|
Update tagger serialization
|
2017-08-18 23:12:05 +02:00 |
|
Matthew Honnibal
|
bae59bf92f
|
Remove BiLSTM import
|
2017-08-18 22:46:59 +02:00 |
|
Matthew Honnibal
|
c307a0ffb8
|
Restore patches from nn-beam-parser to spacy/syntax
|
2017-08-18 22:38:59 +02:00 |
|
Matthew Honnibal
|
fe90dfc390
|
Restore changes from nn-beam-parser to spacy/_ml
|
2017-08-18 22:38:28 +02:00 |
|
Matthew Honnibal
|
de7e8703e3
|
Restore tests for beam parser
|
2017-08-18 22:27:42 +02:00 |
|
Matthew Honnibal
|
11c31d285c
|
Restore changes from nn-beam-parser
|
2017-08-18 22:26:12 +02:00 |
|
Matthew Honnibal
|
ce321b0322
|
Restore changes from nn-beam-parser to spacy/_ml
|
2017-08-18 22:24:46 +02:00 |
|
Matthew Honnibal
|
5f81d700ff
|
Restore patches from nn-beam-parser to spacy/syntax
|
2017-08-18 22:23:03 +02:00 |
|
Matthew Honnibal
|
ec482580b5
|
Restore changes to pipeline.pyx from nn-beam-parser branch
|
2017-08-18 22:02:35 +02:00 |
|
Matthew Honnibal
|
931509d96a
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-18 21:57:15 +02:00 |
|
Matthew Honnibal
|
ed95009b5c
|
Fix data loading on Python 2
|
2017-08-18 21:57:06 +02:00 |
|
Matthew Honnibal
|
baf36d0588
|
Add compat function for importlib.util
|
2017-08-18 21:56:47 +02:00 |
|
Matthew Honnibal
|
263366729e
|
Don't import BiLSTM
|
2017-08-18 21:56:31 +02:00 |
|
Matthew Honnibal
|
28162290b3
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-18 14:55:40 -05:00 |
|
Matthew Honnibal
|
85794c1167
|
Restore state of _ml.py
|
2017-08-18 14:55:23 -05:00 |
|
Matthew Honnibal
|
d456d2efe1
|
Fix conflicts in nn_parser
|
2017-08-18 20:55:58 +02:00 |
|
Matthew Honnibal
|
1cec1efca7
|
Fix merge conflicts in nn_parser from beam stuff
|
2017-08-18 20:50:49 +02:00 |
|
Matthew Honnibal
|
69bcacdc09
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-08-18 20:47:13 +02:00 |
|
Matthew Honnibal
|
2993b54fff
|
Load vectors in vocab
|
2017-08-18 20:46:56 +02:00 |
|
Matthew Honnibal
|
a1ec41298c
|
Restore CFile loader
|
2017-08-18 20:46:16 +02:00 |
|
Matthew Honnibal
|
ed4fb991dc
|
Work on vectors loading
|
2017-08-18 20:45:48 +02:00 |
|
Matthew Honnibal
|
426f84937f
|
Resolve conflicts when merging new beam parsing stuff
|
2017-08-18 13:38:32 -05:00 |
|
Matthew Honnibal
|
5181e8bedb
|
Fix merge conflict in _ml
|
2017-08-18 13:35:51 -05:00 |
|
Matthew Honnibal
|
f75420ae79
|
Unhack beam parsing, moving it under options instead of global flags
|
2017-08-18 13:31:15 -05:00 |
|
Jim Geovedi
|
7ae45bffcf
|
Merge remote-tracking branch 'upstream/develop' into indonesian
|
2017-08-18 10:14:46 +07:00 |
|