Matthew Honnibal
|
1ee6b468a9
|
* Adjust arc_eager oracle, so that recovering errors via non-monotonic actions gives negative cost. Need to test this with greedy parser.
|
2016-07-26 19:12:00 +02:00 |
|
Matthew Honnibal
|
0bf448461e
|
Work on beam parser, with max violation
|
2016-07-24 14:26:52 +02:00 |
|
Matthew Honnibal
|
a1281835a8
|
Clean up commented out code from beam parser.
|
2016-07-24 11:02:39 +02:00 |
|
Matthew Honnibal
|
476977ef62
|
Start work on max violation update. About to clean up commented out code.
|
2016-07-24 11:01:54 +02:00 |
|
Matthew Honnibal
|
8b4abc24e3
|
Fix beam parsing. Starting to work with early update.
|
2016-07-24 10:45:50 +02:00 |
|
Matthew Honnibal
|
407ed4652d
|
* Work on neural network beam
|
2016-07-24 10:44:59 +02:00 |
|
Matthew Honnibal
|
27176c3d2f
|
Fix beam parser. Starting to work
|
2016-07-24 01:14:56 +02:00 |
|
Matthew Honnibal
|
e2a9a68b66
|
* Work on beam parser
|
2016-07-23 06:07:09 +02:00 |
|
Matthew Honnibal
|
de7c6c48d8
|
Working NN, but very messy. Relies on BLIS.
|
2016-07-20 16:28:02 +02:00 |
|
Matthew Honnibal
|
7c2f1a673b
|
* Working neural net, but features hacky. Switching to extractor.
|
2016-05-26 19:06:10 +02:00 |
|
Matthew Honnibal
|
8036368d96
|
* Fix model saving
|
2016-05-23 12:01:46 +00:00 |
|
Matthew Honnibal
|
35214053fd
|
* Work around get_lex_attr bug introduced during German parsing
|
2016-05-23 10:53:00 +00:00 |
|
Matthew Honnibal
|
bc3c8d8adf
|
Fix lemma of "coping"
Fix Issue #389: Incorrect lemma for "coping"
|
2016-05-20 19:03:41 +10:00 |
|
Matthew Honnibal
|
cdc10e9a1c
|
* Fix Issue #375: noun phrase iteration results in index error if noun phrases are merged during the loop. Fix by accumulating the spans inside the noun_chunks property, allowing the Span index tricks to work.
|
2016-05-20 10:14:06 +02:00 |
|
Matthew Honnibal
|
13fad36e49
|
* Cosmetic change to english noun chunks iterator -- use enumerate instead of range loop
|
2016-05-20 10:11:05 +02:00 |
|
Matthew Honnibal
|
02276cc444
|
Merge branch 'master' of ssh://github.com/spacy-io/spaCy
|
2016-05-17 16:56:22 +02:00 |
|
Matthew Honnibal
|
4d7f5468bb
|
* Change Language class to use a .pipeline attribute, instead of having the pipeline hard coded
|
2016-05-17 16:55:42 +02:00 |
|
Matthew Honnibal
|
2d25339c47
|
Merge pull request #386 from daylen/master
Fix get_lang_class parsing (take 2)
|
2016-05-17 23:15:47 +10:00 |
|
Daylen Yang
|
5405e7dd73
|
Fix get_lang_class parsing (take 2)
|
2016-05-16 16:40:31 -07:00 |
|
Matthew Honnibal
|
88538b339e
|
Merge pull request #385 from spacy-io/revert-384-master
Revert "Fix get_lang_class parsing"
|
2016-05-17 08:04:44 +10:00 |
|
Matthew Honnibal
|
b240104f40
|
Revert "Fix get_lang_class parsing"
|
2016-05-17 08:04:26 +10:00 |
|
Matthew Honnibal
|
9bd3c316c9
|
Merge pull request #384 from daylen/master
Fix get_lang_class parsing
|
2016-05-17 07:52:22 +10:00 |
|
Daylen Yang
|
bffbe9b9d0
|
Merge pull request #1 from daylen/fixed_get_lang_class_parse
Fix get_lang_class parsing
|
2016-05-16 14:40:20 -07:00 |
|
Daylen Yang
|
1692c2df3c
|
Fix get_lang_class parsing
We want the get_lang_class to return "en" for both "en" and "en_glove_cc_300_1m_vectors". Changed the split rule to "_" so that this happens.
|
2016-05-16 14:38:20 -07:00 |
|
Matthew Honnibal
|
17137f5c0c
|
* Fix issue #372: mistake in Lexeme rich comparison
|
2016-05-12 12:58:57 +02:00 |
|
Matthew Honnibal
|
cc8bf62208
|
* Fix Issue #360: Tokenizer failed when the infix regex matched the start of the string while trying to tokenize multi-infix tokens.
|
2016-05-09 13:23:47 +02:00 |
|
Matthew Honnibal
|
eab2376547
|
* Allow longer ellipses to be treated as a single token, e.g. Hello......there
|
2016-05-09 13:22:53 +02:00 |
|
Matthew Honnibal
|
c61ee8f9fa
|
* Increment version
|
2016-05-09 13:20:00 +02:00 |
|
Matthew Honnibal
|
f6ef64f02c
|
* Update changelog in preparation for 0.101.0 release
|
2016-05-09 12:57:07 +02:00 |
|
Matthew Honnibal
|
5d86c30f0b
|
* Fix Issue #367: Missing has_vector property on Doc and Span objects
|
2016-05-09 12:36:14 +02:00 |
|
Wolfgang Seeker
|
7b78239436
|
add fix for German noun chunk iterator (issue #365)
|
2016-05-06 01:41:26 +02:00 |
|
Matthew Honnibal
|
8c0888d6cb
|
* Fix error in span.sent
|
2016-05-06 00:28:05 +02:00 |
|
Matthew Honnibal
|
bb94022975
|
* Fix Issue #365: Error introduced during noun phrase chunking, due to use of corrected PRON/PROPN/etc tags.
|
2016-05-06 00:21:05 +02:00 |
|
Matthew Honnibal
|
41342ca79b
|
Merge branch 'master' of ssh://github.com/spacy-io/spaCy
|
2016-05-06 00:17:58 +02:00 |
|
Matthew Honnibal
|
26095f9722
|
* Add span.sent property, re Issue #366
|
2016-05-06 00:17:38 +02:00 |
|
Wolfgang Seeker
|
dbf8f5f3ec
|
fix bug in StateC.set_break()
|
2016-05-05 15:15:34 +02:00 |
|
Wolfgang Seeker
|
3c44b5dc1a
|
call deprojectivization after parsing
|
2016-05-05 15:10:36 +02:00 |
|
Matthew Honnibal
|
472f576b82
|
* Deprojectivize German parses
|
2016-05-05 15:01:10 +02:00 |
|
Matthew Honnibal
|
9bbd6cf031
|
* Work on Chinese support
|
2016-05-05 11:39:12 +02:00 |
|
Matthew Honnibal
|
a6a25166ba
|
* Remove print from test
|
2016-05-05 11:10:59 +02:00 |
|
Matthew Honnibal
|
c4c55d9005
|
Merge branch 'master' of ssh://github.com/spacy-io/spaCy
|
2016-05-05 01:33:36 +02:00 |
|
Matthew Honnibal
|
e31df66d26
|
* Fix Issue #361: Lexemes didn't have rich comparison.
|
2016-05-05 01:32:26 +02:00 |
|
Matthew Honnibal
|
7441ca30ee
|
* Add tests for Issue #361: Lexeme rich comparison
|
2016-05-05 01:31:58 +02:00 |
|
Matthew Honnibal
|
02d0fe242c
|
Make latest release note the end of the readme
|
2016-05-05 00:26:16 +10:00 |
|
Matthew Honnibal
|
4f46c0f398
|
Fix code format in README.rst
|
2016-05-05 00:25:19 +10:00 |
|
Matthew Honnibal
|
886bf55bd9
|
Fix list formatting
|
2016-05-05 00:18:25 +10:00 |
|
Matthew Honnibal
|
1b8b888a57
|
Update readme with release notes for v0.100.8
|
2016-05-05 00:16:13 +10:00 |
|
Matthew Honnibal
|
72564213e3
|
* Add test for Issue #309
|
2016-05-04 16:00:28 +02:00 |
|
Matthew Honnibal
|
76f1d871da
|
Merge branch 'master' of ssh://github.com/spacy-io/spaCy
|
2016-05-04 15:54:00 +02:00 |
|
Matthew Honnibal
|
519366f677
|
* Fix Issue #351: Indices off when leading whitespace
|
2016-05-04 15:53:36 +02:00 |
|