Matthew Honnibal
7885b92b45
Refactor matcher2, hopefully making it faster
2018-02-14 12:11:17 +01:00
Matthew Honnibal
00261eea27
Make tests refer to matcher2
2018-02-14 12:10:51 +01:00
Claudiu-Vlad Ursache
e28de12cbd
Ensure files opened in from_disk
are closed
...
Fixes [issue 1706](https://github.com/explosion/spaCy/issues/1706 ).
2018-02-13 20:49:43 +01:00
Claudiu-Vlad Ursache
cdd4b3d05c
Add contributor agreement for @ursachec
2018-02-13 20:49:42 +01:00
Matthew Honnibal
262cbe356e
Remove caching, as doesn't seem to help for now.
2018-02-13 17:15:20 +01:00
Matthew Honnibal
f43d53f2c5
Remove print statement
2018-02-13 17:15:07 +01:00
Matthew Honnibal
c2b0910db4
Merge pull request #1979 from emulbreh/1974-random-seeds
...
Don't fix random seeds on import
2018-02-13 16:38:28 +01:00
Matthew Honnibal
dcd8d89aef
Update test for 850, making it work with matcher2
2018-02-13 16:35:20 +01:00
Matthew Honnibal
9bdfa5cd4f
Remove re comparisons tests, as matcher behaves differently
2018-02-13 16:28:52 +01:00
Matthew Honnibal
6d7986b0f1
Fix matcher test
2018-02-13 16:28:06 +01:00
Matthew Honnibal
9efda9e9ab
Add PhraseMatcher in matcher2.pyx
2018-02-13 16:27:46 +01:00
Johannes Dollinger
012e874d09
Add contributor agreement for emulbreh
2018-02-13 13:40:33 +01:00
Johannes Dollinger
bf94c13382
Don't fix random seeds on import
2018-02-13 12:42:23 +01:00
Matthew Honnibal
0004331895
Update notes on matcher2
2018-02-13 11:45:45 +01:00
Matthew Honnibal
b4cc39eb74
Fix zero-width quantifiers. Passes test_matcher
2018-02-13 11:45:32 +01:00
Matthew Honnibal
1b01685f47
Fix ZERO_PLUS operator
2018-02-12 12:28:03 +01:00
Matthew Honnibal
9115c3ba0a
Add TODO in notes
2018-02-12 12:06:48 +01:00
Matthew Honnibal
b00326a7fe
Move pattern_id out of TokenPattern
2018-02-12 12:05:54 +01:00
Matthew Honnibal
d34c732635
Add Python notes for rethinking matcher
2018-02-12 10:19:29 +01:00
Matthew Honnibal
d7c9b53120
Pass kwargs into pipeline components during begin_training
2018-02-12 10:18:39 +01:00
Matthew Honnibal
0d3262a9f3
Compile matcher2
2018-02-12 10:18:04 +01:00
Matthew Honnibal
fae5c0dc18
Work on matcher2
2018-02-12 10:17:43 +01:00
4altinok
ca8728035d
added new lex feat to token
2018-02-11 18:55:48 +01:00
4altinok
edd7202a06
added new symbol
2018-02-11 18:55:32 +01:00
4altinok
ed1ac2969e
added new lexical feat to lexeme
2018-02-11 18:51:48 +01:00
4altinok
94fb0b75e3
code for is_currency
2018-02-11 18:51:32 +01:00
4altinok
3deef1497a
removed 18 and replaced 18 with is_currency
2018-02-11 18:51:09 +01:00
4altinok
471d3c9e23
added lex test for is_currency
2018-02-11 18:50:50 +01:00
ines
c63e99da8a
Fix typo in glossary ( resolves #1964 )
...
Co-Authored-By: SThomasP <sthomasp@users.noreply.github.com>
2018-02-10 11:58:41 +01:00
Ines Montani
b21a096528
Merge pull request #1961 from oxinabox/patch-1
...
Make python 3.4 compat module loading (fix #1733 )
2018-02-09 17:51:51 +01:00
Lyndon White
94ce43adf0
squashme
2018-02-09 23:19:11 +08:00
Lyndon White
5b1bc8d101
Sign contributors agreement
2018-02-09 23:14:29 +08:00
Lyndon White
6ee5dff51c
Make python 3.4 compat module loading ( fix #1733 )
2018-02-09 23:03:35 +08:00
Pradeep Kumar Tippa
416cd021ce
Added TAG from spacy symbols which used below
2018-02-09 19:16:59 +05:30
Pradeep Kumar Tippa
01cc9cd9c0
assert statement syntax fix in doc
2018-02-09 19:16:25 +05:30
Pradeep Kumar Tippa
a78062e466
Merge remote-tracking branch 'upstream/master' into web-doc-patches
2018-02-09 19:13:19 +05:30
ines
ab33e274f5
Add more details on symlink error & Windows solution ( resolves #1941 ) [ci skip]
2018-02-09 10:43:33 +01:00
ines
8eaa934382
Merge branch 'master' of https://github.com/explosion/spaCy
2018-02-09 10:23:36 +01:00
ines
e9f67be04d
Fix regex flag matcher example ( resolves #1950 )
2018-02-09 10:23:33 +01:00
ines
fc4ae04c55
Document LENGTH attribute in matcher
2018-02-09 10:23:03 +01:00
Pradeep Kumar Tippa
8a7467b26e
Merge remote-tracking branch 'upstream/master' into web-doc-patches
2018-02-09 13:54:26 +05:30
Matthew Honnibal
18e02e366e
Merge pull request #1957 from mdcclv/patch-1
...
update link to Honnibal and Johnson 2015
2018-02-09 07:26:47 +01:00
Orion Montoya
24af6375db
update link to Honnibal and Johnson 2015
...
aclweb.org is throwing a gateway timeout on the link as `https`+`aclweb.org`, but is fine with `https`+`www.aclweb.org` (also with `http`+`aclweb.org`, but let's keep it in `https`, shall we?
2018-02-08 10:49:09 -08:00
Matthew Honnibal
ab35ac4e6f
Merge branch 'master' of https://github.com/explosion/spaCy
2018-02-08 15:25:57 +01:00
Matthew Honnibal
e361b4f82b
Fix #1929 : Incorrect NER when pre-set sentence boundaries.
2018-02-08 15:25:41 +01:00
Pradeep Kumar Tippa
03113d6779
Fixing navigating parse tree doc under dependency parse
2018-02-08 19:34:15 +05:30
Pradeep Kumar Tippa
76cee1a1c3
Merge remote-tracking branch 'upstream/master' into web-doc-patches
2018-02-08 19:27:33 +05:30
ines
a3b965b29d
Remove UPPER from Matcher attributes docs ( resolves #1949 )
2018-02-08 11:29:27 +01:00
ines
696ae87b47
Fix whitespace
2018-02-08 11:28:54 +01:00
ines
26bc75134d
Fix typo
2018-02-08 11:28:44 +01:00