Matthew Honnibal
9c73983bdd
* Add test for hyphenation problem in Issue #302
2016-03-29 14:27:13 +11:00
Matthew Honnibal
d249e2f7f3
* Improve error message in bin/parser/train.py
2016-03-29 13:04:33 +11:00
Matthew Honnibal
910a6c805f
* Add infix rule for double hyphens, re Issue #302
2016-03-29 13:03:44 +11:00
Matthew Honnibal
ad119c074f
* Fix incorrect whitespacing in Doc.text. This change is potentially breaking, to anyone who was relying on the previous incorrect semantics.
2016-03-29 13:02:42 +11:00
Matthew Honnibal
a862edc0e6
Merge pull request #296 from elyase/patch-2
...
make use of log_smooth_count
2016-03-19 06:50:30 +11:00
Yaser Martinez Palenzuela
3c210f45fa
make use of log_smooth_count
2016-03-17 12:19:52 +01:00
Matthew Honnibal
80134eb12d
Merge branch 'master' of https://github.com/spacy-io/spaCy
2016-03-15 19:14:50 +00:00
Matthew Honnibal
eaccbcda0f
Fix bug in pos_tag.py script
2016-03-16 06:04:14 +11:00
Henning Peters
8f870854c4
move bootstrap script to gist
2016-03-14 11:32:20 +01:00
Henning Peters
c12d3dd200
add __init__.py to empty package dirs
2016-03-14 11:28:03 +01:00
Henning Peters
54f3447b5f
cleanup
2016-03-14 01:46:33 +01:00
Henning Peters
8ef5b6e126
cleanup
2016-03-13 19:52:13 +01:00
Henning Peters
1fe29c6919
cleanup
2016-03-13 18:12:32 +01:00
Henning Peters
9f628688ce
cleanup
2016-03-12 14:31:39 +01:00
Henning Peters
49f499ca1c
cleanup
2016-03-12 14:30:24 +01:00
Henning Peters
5701686272
cleanup
2016-03-12 13:47:10 +01:00
Matthew Honnibal
b37571063a
Merge pull request #286 from gushecht/patch-2
...
added batch_size as keyword argument
2016-03-11 09:46:36 +11:00
Gus Hecht
feefe64ab2
added batch_size as keyword argument
...
There's probably a better default value....
2016-03-10 14:16:34 -08:00
Matthew Honnibal
1508528c8c
* Increment version
2016-03-08 15:58:45 +00:00
Matthew Honnibal
963fe5258e
* Add missing __contains__ method to vocab
2016-03-08 15:49:10 +00:00
Matthew Honnibal
478aa21cb0
* Remove broken __reduce__ method on vocab
2016-03-08 15:48:21 +00:00
Matthew Honnibal
20235bde00
Merge pull request #282 from henningpeters/switch_vectors
...
initial proposal for ability to switch vectors
2016-03-09 01:39:41 +11:00
Henning Peters
5b3b3ebc8e
upgrade to latest sputnik
2016-03-08 15:30:17 +01:00
Henning Peters
eb7ae61b1c
cleanup api
2016-03-08 12:59:18 +01:00
Henning Peters
b740f20191
hash_string() should not depend on python's internal unicode representation, also fixes https://github.com/spacy-io/sense2vec/issues/5 for py2
2016-03-06 09:19:27 +01:00
Henning Peters
aa4d964c14
cleanup api
2016-03-05 17:51:32 +01:00
Henning Peters
931c07a609
initial proposal for separate vector package
2016-03-04 11:09:06 +01:00
Wolfgang Seeker
7adbd7a785
replace Counter with normal dict
2016-03-03 21:36:27 +01:00
Wolfgang Seeker
1ae487a4f6
add backwards compatibility with python 2.6
2016-03-03 21:18:12 +01:00
Wolfgang Seeker
9d1e6de4a0
make a proper list from zip iterator
2016-03-03 19:51:01 +01:00
Wolfgang Seeker
49f9d1c085
change test_nonproj.py to not use zip inside numpy.asarray
2016-03-03 19:42:09 +01:00
Wolfgang Seeker
72b8df0684
turned PseudoProjectivity into a normal python class
2016-03-03 19:05:08 +01:00
Matthew Honnibal
fcaa0ad7ce
Merge pull request #280 from wbwseeker/german_parser
...
German parser
2016-03-04 03:27:42 +11:00
Wolfgang Seeker
690c5acabf
adjust train.py to train both english and german models
2016-03-03 15:21:00 +01:00
Matthew Honnibal
9d51e4d13c
Delete gather_freqs.py
...
This script was in a broken state, and should be unnecessary. The functionality is subsumed by `get_freqs.py`
2016-03-02 00:42:55 +11:00
Matthew Honnibal
ae2b479312
Merge pull request #278 from elyase/patch-1
...
replace codecs.open with io.open
2016-03-02 00:41:23 +11:00
Yaser Martinez Palenzuela
1a93d7f725
replace codecs.open with io.open
2016-03-01 14:10:11 +01:00
Wolfgang Seeker
3448cb40a4
integrated pseudo-projective parsing into parser
...
- nonproj.pyx holds a class PseudoProjectivity which currently holds
all functionality to implement Nivre & Nilsson 2005's pseudo-projective
parsing using the HEAD decoration scheme
- changed lefts/rights in Token to account for possible non-projective
structures
2016-03-01 10:09:08 +01:00
Henning Peters
ee4c4f6a77
add run section to buildbot.json
2016-02-26 23:04:33 +01:00
Henning Peters
d3a65ef261
fix wheel build/test
2016-02-26 20:47:35 +01:00
Wolfgang Seeker
56b7210e82
moved nonproj.py to syntax/nonproj.pyx
2016-02-25 15:08:49 +01:00
Henning Peters
304e27624f
run tests for wheels
2016-02-24 20:21:40 +01:00
Henning Peters
4d375afb91
run tests for wheels
2016-02-24 19:59:08 +01:00
Henning Peters
f3df736e0a
remove unidecode-related test
2016-02-24 18:22:22 +01:00
Matthew Honnibal
1ba31f6229
Merge pull request #275 from henningpeters/unidecode
...
remove text-unidecode dependency
2016-02-25 04:10:45 +11:00
Wolfgang Seeker
4b2297d5d4
add class PseudoProjective for pseudo-projective parsing
...
PseudoProjective() implements the algorithm from Nivre & Nilsson 2005
using their HEAD decoration scheme.
2016-02-24 11:26:25 +01:00
Henning Peters
12d58a7099
remove text-unidecode dependency
2016-02-24 08:01:59 +01:00
Henning Peters
63deae47fe
Update buildbot.json
2016-02-23 13:36:04 +01:00
Wolfgang Seeker
8d531c958b
replace tests for non-projectivity
...
- add functions to find non-projective edges
- add test file for non-projectivity functions
2016-02-22 14:40:40 +01:00
Henning Peters
dfd1a1d3a2
Update buildbot.json
2016-02-22 06:13:09 +01:00