Commit Graph

19 Commits

Author SHA1 Message Date
Matthew Honnibal
c772df2f0c * Fix StringIO for Python3 tests 2015-10-13 20:50:07 +11:00
Matthew Honnibal
719221a215 * Fix StringIO for Python 3 2015-10-13 20:20:25 +11:00
Matthew Honnibal
f6d74b14de * Merge 2015-10-13 05:25:49 +02:00
Matthew Honnibal
20fd36a0f7 * Very scrappy, likely buggy first-cut pickle implementation, to work on Issue #125: allow pickle for Apache Spark. The current implementation sends stuff to temp files, and does almost nothing to ensure all modifiable state is actually preserved. The Language() instance is a deep tree of extension objects, and if pickling during training, some of the C-data state is hard to preserve. 2015-10-13 13:44:41 +11:00
Matthew Honnibal
f8de403483 * Work on pickling Vocab instances. The current implementation is not correct, but it may serve to see whether this approach is workable. Pickling is necessary to address Issue #125 2015-10-13 13:44:41 +11:00
Matthew Honnibal
0cee928467 * Allow StringStore to be pickled, to start addressing Issue #125 2015-10-13 13:44:41 +11:00
Matthew Honnibal
d70e8cac2c * Fix empty values in attributes and parts of speech, so symbols align correctly with the StringStore 2015-10-13 13:44:40 +11:00
Matthew Honnibal
20e909d2bb * Fix empty values in attributes and parts of speech, so symbols align correctly with the StringStore 2015-10-10 18:27:03 +11:00
Matthew Honnibal
9ff288c7bb * Update tests, after removal of spacy.en.attrs 2015-10-09 13:37:25 +11:00
Matthew Honnibal
ce984f471c * Update tests for python3 2015-07-24 03:47:59 +02:00
Matthew Honnibal
1c9ea7b835 * Add tests for short string optimization 2015-07-20 12:05:45 +02:00
Matthew Honnibal
4dddc8a69b * Fix type declarations for attr_t. Remove unused id_t. 2015-07-18 22:39:57 +02:00
Matthew Honnibal
68374149ae * Move huffman encoding test to tests/serialize directory 2015-07-17 21:22:18 +02:00
Matthew Honnibal
db9dfd2e23 * Major refactor of serialization. Nearly complete now. 2015-07-17 01:27:54 +02:00
Matthew Honnibal
5b0a7190c9 * Round-trip for serialization finally working. Needs a lot of optimization. 2015-07-13 18:39:38 +02:00
Matthew Honnibal
0628e0e2a8 * Add tests for huffman encoding 2015-07-13 12:58:07 +02:00
Matthew Honnibal
e3af6af83c * Add tests/vocab/conftest.py 2015-06-07 18:02:47 +02:00
Matthew Honnibal
88041f69d1 * More work on reorganising tests, using conftest.py 2015-06-07 18:02:24 +02:00
Matthew Honnibal
d37dca72dd * Reorganize tests 2015-06-07 16:49:46 +02:00