Aaron Marquez
|
c7926f72eb
|
add contributor agreement for @enerrio
|
2018-02-15 12:43:04 -08:00 |
|
Matthew Honnibal
|
59b7cf9db8
|
Add get_beam_parse method in ArcEager, for Prodigy
|
2018-02-15 21:03:16 +01:00 |
|
Matthew Honnibal
|
3e541de440
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2018-02-15 21:02:55 +01:00 |
|
Thomas Opsomer
|
5d24a81c0b
|
add test for span.sent when doc not parsed
|
2018-02-15 16:59:16 +01:00 |
|
Thomas Opsomer
|
deab391cbf
|
correct check on sent_start & raise if no boundaries
|
2018-02-15 16:58:30 +01:00 |
|
Matthew Honnibal
|
afbd46adfb
|
Remove length cap in PhraseMatcher
|
2018-02-15 16:10:54 +01:00 |
|
Matthew Honnibal
|
4533c7408d
|
Update matcher tests
|
2018-02-15 15:39:47 +01:00 |
|
Matthew Honnibal
|
1c19605426
|
Move matcher2.pyx to matcher.pyx
|
2018-02-15 15:27:03 +01:00 |
|
Matthew Honnibal
|
9ebf2fe7c3
|
Make helper function to get longest matches
|
2018-02-15 15:26:15 +01:00 |
|
ines
|
ca56fb53d1
|
Add user survey to navigation [ci skip]
|
2018-02-15 12:14:30 +01:00 |
|
ines
|
cab5b775e7
|
Document ENT_TYPE matcher attribute [ci skip]
|
2018-02-15 12:14:19 +01:00 |
|
Matthew Honnibal
|
4cb861e080
|
Merge pull request #1968 from DuyguA/is_currency
New lexical feature is_currency
|
2018-02-15 12:13:36 +01:00 |
|
Matthew Honnibal
|
ec273857ef
|
Merge pull request #1982 from ursachec/issue-1706
Fix issue #1706
|
2018-02-15 12:09:04 +01:00 |
|
Ines Montani
|
2392ec9d8c
|
Merge pull request #1956 from pktippa/web-doc-patches
Website documentation changes Linguistic features, dependency parse
|
2018-02-15 12:06:26 +01:00 |
|
Thomas Opsomer
|
b902731313
|
Find span sentence when only sentence boundaries (no parser)
|
2018-02-14 22:18:54 +01:00 |
|
Matthew Honnibal
|
d19dc67886
|
Make get_action nogil, for efficiency
|
2018-02-14 12:16:36 +01:00 |
|
Matthew Honnibal
|
7885b92b45
|
Refactor matcher2, hopefully making it faster
|
2018-02-14 12:11:17 +01:00 |
|
Matthew Honnibal
|
00261eea27
|
Make tests refer to matcher2
|
2018-02-14 12:10:51 +01:00 |
|
Claudiu-Vlad Ursache
|
e28de12cbd
|
Ensure files opened in from_disk are closed
Fixes [issue 1706](https://github.com/explosion/spaCy/issues/1706).
|
2018-02-13 20:49:43 +01:00 |
|
Claudiu-Vlad Ursache
|
cdd4b3d05c
|
Add contributor agreement for @ursachec
|
2018-02-13 20:49:42 +01:00 |
|
Matthew Honnibal
|
262cbe356e
|
Remove caching, as doesn't seem to help for now.
|
2018-02-13 17:15:20 +01:00 |
|
Matthew Honnibal
|
f43d53f2c5
|
Remove print statement
|
2018-02-13 17:15:07 +01:00 |
|
Matthew Honnibal
|
c2b0910db4
|
Merge pull request #1979 from emulbreh/1974-random-seeds
Don't fix random seeds on import
|
2018-02-13 16:38:28 +01:00 |
|
Matthew Honnibal
|
dcd8d89aef
|
Update test for 850, making it work with matcher2
|
2018-02-13 16:35:20 +01:00 |
|
Matthew Honnibal
|
9bdfa5cd4f
|
Remove re comparisons tests, as matcher behaves differently
|
2018-02-13 16:28:52 +01:00 |
|
Matthew Honnibal
|
6d7986b0f1
|
Fix matcher test
|
2018-02-13 16:28:06 +01:00 |
|
Matthew Honnibal
|
9efda9e9ab
|
Add PhraseMatcher in matcher2.pyx
|
2018-02-13 16:27:46 +01:00 |
|
Johannes Dollinger
|
012e874d09
|
Add contributor agreement for emulbreh
|
2018-02-13 13:40:33 +01:00 |
|
Johannes Dollinger
|
bf94c13382
|
Don't fix random seeds on import
|
2018-02-13 12:42:23 +01:00 |
|
Matthew Honnibal
|
0004331895
|
Update notes on matcher2
|
2018-02-13 11:45:45 +01:00 |
|
Matthew Honnibal
|
b4cc39eb74
|
Fix zero-width quantifiers. Passes test_matcher
|
2018-02-13 11:45:32 +01:00 |
|
Matthew Honnibal
|
1b01685f47
|
Fix ZERO_PLUS operator
|
2018-02-12 12:28:03 +01:00 |
|
Matthew Honnibal
|
9115c3ba0a
|
Add TODO in notes
|
2018-02-12 12:06:48 +01:00 |
|
Matthew Honnibal
|
b00326a7fe
|
Move pattern_id out of TokenPattern
|
2018-02-12 12:05:54 +01:00 |
|
Matthew Honnibal
|
d34c732635
|
Add Python notes for rethinking matcher
|
2018-02-12 10:19:29 +01:00 |
|
Matthew Honnibal
|
d7c9b53120
|
Pass kwargs into pipeline components during begin_training
|
2018-02-12 10:18:39 +01:00 |
|
Matthew Honnibal
|
0d3262a9f3
|
Compile matcher2
|
2018-02-12 10:18:04 +01:00 |
|
Matthew Honnibal
|
fae5c0dc18
|
Work on matcher2
|
2018-02-12 10:17:43 +01:00 |
|
4altinok
|
ca8728035d
|
added new lex feat to token
|
2018-02-11 18:55:48 +01:00 |
|
4altinok
|
edd7202a06
|
added new symbol
|
2018-02-11 18:55:32 +01:00 |
|
4altinok
|
ed1ac2969e
|
added new lexical feat to lexeme
|
2018-02-11 18:51:48 +01:00 |
|
4altinok
|
94fb0b75e3
|
code for is_currency
|
2018-02-11 18:51:32 +01:00 |
|
4altinok
|
3deef1497a
|
removed 18 and replaced 18 with is_currency
|
2018-02-11 18:51:09 +01:00 |
|
4altinok
|
471d3c9e23
|
added lex test for is_currency
|
2018-02-11 18:50:50 +01:00 |
|
ines
|
c63e99da8a
|
Fix typo in glossary (resolves #1964)
Co-Authored-By: SThomasP <sthomasp@users.noreply.github.com>
|
2018-02-10 11:58:41 +01:00 |
|
Ines Montani
|
b21a096528
|
Merge pull request #1961 from oxinabox/patch-1
Make python 3.4 compat module loading (fix #1733)
|
2018-02-09 17:51:51 +01:00 |
|
Lyndon White
|
94ce43adf0
|
squashme
|
2018-02-09 23:19:11 +08:00 |
|
Lyndon White
|
5b1bc8d101
|
Sign contributors agreement
|
2018-02-09 23:14:29 +08:00 |
|
Lyndon White
|
6ee5dff51c
|
Make python 3.4 compat module loading (fix #1733)
|
2018-02-09 23:03:35 +08:00 |
|
Pradeep Kumar Tippa
|
416cd021ce
|
Added TAG from spacy symbols which used below
|
2018-02-09 19:16:59 +05:30 |
|