svlandeg
|
76184374e2
|
test corner cases
|
2019-07-22 13:39:32 +02:00 |
|
svlandeg
|
9f8c1e71a2
|
fix for Issue #4000
|
2019-07-22 13:34:12 +02:00 |
|
svlandeg
|
dae8a21282
|
rename entity frequency
|
2019-07-19 17:40:28 +02:00 |
|
svlandeg
|
f75d1299a7
|
formatting
|
2019-07-19 14:52:45 +02:00 |
|
svlandeg
|
41fb5204ba
|
output tensors as part of predict
|
2019-07-19 14:47:36 +02:00 |
|
svlandeg
|
21176517a7
|
have gold.links correspond exactly to doc.ents
|
2019-07-19 12:36:15 +02:00 |
|
svlandeg
|
e1213eaf6a
|
use original gold object in get_loss function
|
2019-07-18 13:35:10 +02:00 |
|
svlandeg
|
ec55d2fccd
|
filter training data beforehand (+black formatting)
|
2019-07-18 10:22:24 +02:00 |
|
svlandeg
|
d833d4c358
|
fixes in kb and gold
|
2019-07-17 17:18:26 +02:00 |
|
svlandeg
|
4086c6ff60
|
get vector functionality + unit test
|
2019-07-17 12:17:02 +02:00 |
|
svlandeg
|
a63d15a142
|
code cleanup
|
2019-07-15 17:36:43 +02:00 |
|
svlandeg
|
cdc589d344
|
small fix
|
2019-07-15 12:04:45 +02:00 |
|
svlandeg
|
60f299374f
|
set default context width
|
2019-07-15 12:03:09 +02:00 |
|
svlandeg
|
6e809e9b8b
|
proper error for missing cfg arguments
|
2019-07-15 11:42:50 +02:00 |
|
svlandeg
|
6026958957
|
tokenizer doc fix
|
2019-07-15 11:19:34 +02:00 |
|
Ines Montani
|
c0e29f7029
|
Merge pull request #3957 from sorenlind/danish-tokenizer-slash
Make Danish tokenizer split on forward slash
|
2019-07-12 18:19:22 +02:00 |
|
Matthew Honnibal
|
ef666656b3
|
Fix attrs alignment
|
2019-07-12 17:59:47 +02:00 |
|
Matthew Honnibal
|
c345c042b0
|
Fix symbol alignment
|
2019-07-12 17:48:38 +02:00 |
|
Ines Montani
|
7281026879
|
Increment version [ci skip]
|
2019-07-12 17:40:00 +02:00 |
|
Søren Lind Kristiansen
|
26aee70d95
|
Make Danish tokenizer split on forward slash
|
2019-07-12 15:20:42 +02:00 |
|
Ines Montani
|
02e12b0852
|
Update landing with IRL videos [ci skip]
|
2019-07-12 13:36:47 +02:00 |
|
Matthew Honnibal
|
3bc4d618f9
|
Set version to v2.1.5
|
2019-07-12 13:26:12 +02:00 |
|
Sofie Van Landeghem
|
ed774cb953
|
Fixing ngram bug (#3953)
* minimal failing example for Issue #3661
* referenced Issue #3661 instead of Issue #3611
* cleanup
|
2019-07-12 10:01:35 +02:00 |
|
Ines Montani
|
123929b58b
|
Update Thinc version pin
|
2019-07-12 00:15:35 +02:00 |
|
Ines Montani
|
cda9fc3dae
|
Update Thinc version pin
|
2019-07-11 15:53:13 +02:00 |
|
Matthew Honnibal
|
09dc01a426
|
Fix #3853, and add warning
|
2019-07-11 14:46:47 +02:00 |
|
Matthew Honnibal
|
7369949d2e
|
Add warning for #3853
|
2019-07-11 14:46:47 +02:00 |
|
Ines Montani
|
673c864a06
|
Fix doc.count_by functionality (#3950)
Fix doc.count_by functionality
|
2019-07-11 13:44:00 +02:00 |
|
Ines Montani
|
2426f4d44c
|
Fix default punctuation rules for splitting Hindi text (#3948)
Fix default punctuation rules for splitting Hindi text
Co-authored-by: yash <patadiayash@gmail.com>
Co-authored-by: Ines Montani <ines@ines.io>
|
2019-07-11 13:36:28 +02:00 |
|
svlandeg
|
349107daa3
|
cleanup
|
2019-07-11 13:09:22 +02:00 |
|
svlandeg
|
0f0f07318a
|
counter instead of preshcounter
|
2019-07-11 13:05:53 +02:00 |
|
Matthew Honnibal
|
b40b4c2c31
|
💫 Fix issue #3839: Incorrect entity IDs from Matcher with operators (#3949)
* Add regression test for issue #3541
* Add comment on bugfix
* Remove incorrect test
* Un-xfail test
|
2019-07-11 12:55:11 +02:00 |
|
Matthew Honnibal
|
e19f4ee719
|
Add warning message re Issue #3853
|
2019-07-11 12:50:38 +02:00 |
|
Ines Montani
|
197cfd7ebc
|
Merge branch 'master' into pr/3948
|
2019-07-11 12:18:31 +02:00 |
|
Ines Montani
|
d166756607
|
Fix test
|
2019-07-11 12:16:43 +02:00 |
|
Ines Montani
|
0b8406a05c
|
Tidy up and auto-format
|
2019-07-11 12:02:25 +02:00 |
|
yash
|
6751af3e78
|
Merge branch 'master' of https://github.com/yash1994/spaCy
|
2019-07-11 15:26:57 +05:30 |
|
yash
|
ae2d52e323
|
Add default encoding utf-8 for test file
|
2019-07-11 15:26:27 +05:30 |
|
Ines Montani
|
33ca0a036a
|
Merge branch 'master' into pr/3948
|
2019-07-11 11:55:54 +02:00 |
|
Matthew Honnibal
|
0491a8e7c8
|
Reformat
|
2019-07-11 11:49:36 +02:00 |
|
Matthew Honnibal
|
bd3c3f342b
|
Fix _serialize
|
2019-07-11 11:48:55 +02:00 |
|
yash
|
815f8d13dd
|
Fix default punctuation rules for hindi text (#3625 explosion)
|
2019-07-11 15:00:51 +05:30 |
|
yash
|
d5311b3c42
|
Add test file for issue (#3625) and spacy contributor agreement
|
2019-07-11 14:53:14 +05:30 |
|
svlandeg
|
e080412385
|
tracked the bug down to PreshCounter.inc - still unclear what goes wrong
|
2019-07-11 01:53:06 +02:00 |
|
svlandeg
|
a89fecce97
|
failing unit test for issue #3869
|
2019-07-11 00:43:55 +02:00 |
|
Matthew Honnibal
|
a388888074
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2019-07-10 22:54:17 +02:00 |
|
Matthew Honnibal
|
c6cb782758
|
Set version to 2.1.5.dev0
|
2019-07-10 22:54:09 +02:00 |
|
Sofie Van Landeghem
|
c4c21cb428
|
more friendly textcat errors (#3946)
* more friendly textcat errors with require_model and require_labels
* update thinc version with recent bugfix
|
2019-07-10 19:39:38 +02:00 |
|
Matthew Honnibal
|
b94c5443d9
|
Rename Binder->DocBox, and improve it.
|
2019-07-10 19:37:20 +02:00 |
|
Matthew Honnibal
|
3d18600c05
|
Return True from doc.is_... when no ambiguity
* Make doc.is_sentenced return True if len(doc) < 2.
* Make doc.is_nered return True if len(doc) == 0, for consistency.
Closes #3934
|
2019-07-10 19:21:42 +02:00 |
|