Matthew Honnibal
|
50817dc9ad
|
Improve parser oracle around sentence breaks.
|
2018-02-22 19:22:26 +01:00 |
|
Matthew Honnibal
|
001e2ec6d6
|
Refactor CoNLL training script
|
2018-02-22 16:00:34 +01:00 |
|
Matthew Honnibal
|
6a27a4f77c
|
Set accelerating batch size in CONLL train script
|
2018-02-21 21:02:41 +01:00 |
|
Matthew Honnibal
|
661873ee4c
|
Randomize the rebatch size in parser
|
2018-02-21 21:02:07 +01:00 |
|
Matthew Honnibal
|
0872cf611d
|
Don't lower-case lemmas of proper nouns
|
2018-02-21 16:01:16 +01:00 |
|
Matthew Honnibal
|
a0ddb803fd
|
Make error when no label found more helpful
|
2018-02-21 16:00:59 +01:00 |
|
Matthew Honnibal
|
ea2fc5d45f
|
Improve length and freq cutoffs in parser
|
2018-02-21 16:00:38 +01:00 |
|
Matthew Honnibal
|
e5757d4bf0
|
Add labels property to parser
|
2018-02-21 16:00:00 +01:00 |
|
Matthew Honnibal
|
4dc0fc9954
|
Replace labels that didn't make freq cutoff
|
2018-02-21 15:59:22 +01:00 |
|
Matthew Honnibal
|
eff4ae809a
|
Fix nonproj label filter
|
2018-02-21 15:59:04 +01:00 |
|
Matthew Honnibal
|
97164b1763
|
Fix conllu script
|
2018-02-21 14:46:54 +01:00 |
|
Matthew Honnibal
|
24fb2c246f
|
Add script to do conllu training
|
2018-02-21 13:53:59 +01:00 |
|
Matthew Honnibal
|
e624405cda
|
Temporarily remove cutoff when filtering labels in nonproj
|
2018-02-21 13:53:40 +01:00 |
|
Matthew Honnibal
|
f466f0186e
|
Use new alignment implementation in GoldParse
|
2018-02-20 21:16:35 +01:00 |
|
Matthew Honnibal
|
c0734ba526
|
Make alignment work with strings
|
2018-02-20 17:51:49 +01:00 |
|
Matthew Honnibal
|
8180c84a98
|
Add tests for new Levenshtein alignment
|
2018-02-20 17:32:25 +01:00 |
|
Matthew Honnibal
|
f46bf2a7e9
|
Build _align.pyx
|
2018-02-20 17:32:13 +01:00 |
|
Matthew Honnibal
|
930c980570
|
Add improved Levenshtein alignment implementation
|
2018-02-20 17:31:56 +01:00 |
|
Matthew Honnibal
|
2bccad8815
|
Fix incorrect matcher test
|
2018-02-18 14:56:12 +01:00 |
|
Matthew Honnibal
|
530172d57a
|
Merge branch 'master' of https://github.com/explosion/spaCy into feature/better-faster-matcher
|
2018-02-18 14:40:42 +01:00 |
|
Matthew Honnibal
|
c9eeceba00
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2018-02-18 14:18:06 +01:00 |
|
Matthew Honnibal
|
cf0e320f2b
|
Add doc.is_sentenced attribute, re #1959
|
2018-02-18 14:16:55 +01:00 |
|
ines
|
29106ec740
|
Add "new" tag to is_currency [ci skip]
|
2018-02-18 14:16:26 +01:00 |
|
ines
|
ca2fcad5a3
|
Add v2.1 tag to new arguments [ci skip]
|
2018-02-18 14:15:18 +01:00 |
|
ines
|
64f97adef1
|
Document new Matcher.pipe keyword args [ci skip]
See 1cf774bdc1
|
2018-02-18 14:13:58 +01:00 |
|
Matthew Honnibal
|
1e5aeb4eec
|
Merge pull request #1987 from thomasopsomer/span-sent
Make span.sent work when only manual / custom sbd
|
2018-02-18 14:05:37 +01:00 |
|
Matthew Honnibal
|
1cf774bdc1
|
Add output options return_matches and as_tuples to Matcher
|
2018-02-18 14:00:45 +01:00 |
|
Matthew Honnibal
|
dd9b0945af
|
Fix inconsistencies in the symbols table
|
2018-02-18 13:51:31 +01:00 |
|
Matthew Honnibal
|
66496ac8e1
|
Set version to v2.1.0.dev0
|
2018-02-18 13:48:39 +01:00 |
|
Matthew Honnibal
|
eb3040ce46
|
Merge pull request #1891 from fucking-signup/master
Fix issue #1889
|
2018-02-18 13:47:47 +01:00 |
|
Matthew Honnibal
|
70cd94f866
|
Remove matcher2 from setup.py
|
2018-02-18 13:46:00 +01:00 |
|
Matthew Honnibal
|
3d7285870b
|
Update matcher branch with v2.0.8 master
|
2018-02-18 13:42:58 +01:00 |
|
ines
|
61052df31f
|
Document is_currency
|
2018-02-18 13:30:03 +01:00 |
|
ines
|
6bba1db4cc
|
Drop six and related hacks as a dependency
|
2018-02-18 13:29:56 +01:00 |
|
Matthew Honnibal
|
b30b09192a
|
Merge pull request #1665 from jimregan/animacy
typo in "inan", add "nhum"
|
2018-02-18 13:26:53 +01:00 |
|
Matthew Honnibal
|
1b3c98e01b
|
Set version to v2.0.8
|
2018-02-18 12:16:31 +01:00 |
|
Matthew Honnibal
|
f9f46e5a07
|
Revert matcher fixes from GregDubbin
|
2018-02-18 10:59:28 +01:00 |
|
Matthew Honnibal
|
86405e4ad1
|
Fix CLI for multitask objectives
|
2018-02-18 10:59:11 +01:00 |
|
Matthew Honnibal
|
a34749b2bf
|
Add multitask objectives options to train CLI
|
2018-02-17 22:03:54 +01:00 |
|
Matthew Honnibal
|
8f06903e09
|
Fix multitask objectives
|
2018-02-17 18:41:36 +01:00 |
|
Matthew Honnibal
|
d1246c95fb
|
Fix model loading when using multitask objectives
|
2018-02-17 18:11:36 +01:00 |
|
Matthew Honnibal
|
262d0a3148
|
Fix overwriting of lexical attributes when loading vectors during training
|
2018-02-17 18:11:11 +01:00 |
|
Matthew Honnibal
|
c0caf7cf27
|
Fix LANG symbol
|
2018-02-17 18:10:50 +01:00 |
|
Matthew Honnibal
|
0bf2f6be29
|
Add missing symbol for LANG attr. Fixes inconsistent numeric ID
|
2018-02-17 17:37:02 +01:00 |
|
Matthew Honnibal
|
97a228a4ce
|
Increment to v2.0.8.dev0
|
2018-02-17 16:54:36 +01:00 |
|
Matthew Honnibal
|
f7dc64d2a3
|
Merge branch 'master' of https://github.com/explosion/spaCy into feature/better-faster-matcher
|
2018-02-17 16:47:35 +01:00 |
|
Matthew Honnibal
|
95c1de90fd
|
Merge pull request #1988 from enerrio/issue-1959
Fix Issue #1959
|
2018-02-17 16:41:55 +01:00 |
|
ines
|
612c79a4f5
|
Update first matcher example and match_id (resolves #1989)
|
2018-02-17 11:57:38 +01:00 |
|
Aaron Marquez
|
ea571e8325
|
Merge branch 'master' into issue-1959
|
2018-02-16 15:14:09 -08:00 |
|
Matthew Honnibal
|
7d5c720fc3
|
Fix multitask objective when no pipeline provided
|
2018-02-15 23:50:21 +01:00 |
|