Commit Graph

2186 Commits

Author SHA1 Message Date
Matthew Honnibal
b866f1443e Merge branch 'master' of https://github.com/honnibal/spaCy into attrs 2015-10-13 04:52:27 +02:00
Matthew Honnibal
6c2da06c18 * Package tag_map.json 2015-10-13 13:52:10 +11:00
Matthew Honnibal
e886e6a406 * Inc version 2015-10-13 13:46:17 +11:00
Matthew Honnibal
20fd36a0f7 * Very scrappy, likely buggy first-cut pickle implementation, to work on Issue #125: allow pickle for Apache Spark. The current implementation sends stuff to temp files, and does almost nothing to ensure all modifiable state is actually preserved. The Language() instance is a deep tree of extension objects, and if pickling during training, some of the C-data state is hard to preserve. 2015-10-13 13:44:41 +11:00
Matthew Honnibal
f8de403483 * Work on pickling Vocab instances. The current implementation is not correct, but it may serve to see whether this approach is workable. Pickling is necessary to address Issue #125 2015-10-13 13:44:41 +11:00
Matthew Honnibal
85e7944572 * Start trying to pickle Vocab 2015-10-13 13:44:41 +11:00
Matthew Honnibal
5ca57bd859 * Ensure Morphology can be pickled, to address Issue #125. 2015-10-13 13:44:41 +11:00
Matthew Honnibal
dfe0ad51ff * Add pickle test for lemmatizer 2015-10-13 13:44:41 +11:00
Matthew Honnibal
0cee928467 * Allow StringStore to be pickled, to start addressing Issue #125 2015-10-13 13:44:41 +11:00
Matthew Honnibal
41012907a8 * Fix variable name 2015-10-13 13:44:40 +11:00
Matthew Honnibal
e70368d157 * Use lower case strings for dependency label names in symbols enum 2015-10-13 13:44:40 +11:00
Matthew Honnibal
7b4af3d1e7 * Fix parts_of_speech now that symbols list has been reformed 2015-10-13 13:44:40 +11:00
Matthew Honnibal
37b909b6b6 * Use the symbols file in vocab instead of the symbols subfiles like attrs.pxd 2015-10-13 13:44:40 +11:00
Matthew Honnibal
ce65ec698c * Remove qualified naming in symbols 2015-10-13 13:44:40 +11:00
Matthew Honnibal
9f4be0adcd * Map NO_TAG to NIL in parts_of_speech.pxd 2015-10-13 13:44:40 +11:00
Matthew Honnibal
278e12f7e8 * Addmorphology symbols to morphology. May need to remove these as an enum. 2015-10-13 13:44:40 +11:00
Matthew Honnibal
d80067eda1 * Map empty string to NULL_ATTR in attrs 2015-10-13 13:44:40 +11:00
Matthew Honnibal
fd204d3cd5 * Map NIL to empty string in tag map 2015-10-13 13:44:40 +11:00
Matthew Honnibal
d70e8cac2c * Fix empty values in attributes and parts of speech, so symbols align correctly with the StringStore 2015-10-13 13:44:40 +11:00
Matthew Honnibal
ce3e306376 * Allow SPACY_DATA environment variable in website tests 2015-10-13 13:44:40 +11:00
Matthew Honnibal
a29c8ee23d * Add symbols to the vocab before reading the strings, so that they line up correctly 2015-10-13 13:44:39 +11:00
Matthew Honnibal
74c0853471 * Rename ATTR_IDS to attrs.IDS. Rename ATTR_NAMES to attrs.NAMES. Rename UNIV_POS_IDS to parts_of_speech.IDS 2015-10-13 13:44:39 +11:00
Matthew Honnibal
10a4a843ea * Enumerate all symbols in one file 2015-10-13 13:44:39 +11:00
Matthew Honnibal
5c24ad3f5c * Whitespace 2015-10-13 13:44:39 +11:00
Matthew Honnibal
85ce36ab11 * Refactor symbols, so that frequency rank can be derived from the orth id of a word. 2015-10-13 13:44:39 +11:00
maxirmx
bf963c3cce Merging Windows\Linux versions of setup.py
Python 3.0 compatibility fix
2015-10-13 02:11:21 +03:00
maxirmx
ccf6156261 Merging Windows\Linux versions of setup.py #2 2015-10-13 01:46:52 +03:00
maxirmx
7c5bfc5916 Merging Windows/Linux versions of setup.py 2015-10-13 01:31:59 +03:00
maxirmx
8a95c8b1f2 Appveyor clen build #4 2015-10-12 23:22:00 +03:00
maxirmx
3069929c25 Appveyor clean build #3 2015-10-12 23:18:42 +03:00
maxirmx
ec8e622cec Appveyor clean build #2 2015-10-12 23:05:47 +03:00
maxirmx
298769cdf3 Appveyor clean build
Cleaned temporary stuff
2015-10-12 23:01:37 +03:00
maxirmx
26b8dd47db Appveyor build
Yet another stdint.h download attempt
2015-10-12 01:10:30 +03:00
maxirmx
fc68d1c51c Appveyor build
+ setup.py
2015-10-12 01:04:21 +03:00
maxirmx
c14c6f5579 Appveyour build
stdint.h download fix
2015-10-12 00:58:03 +03:00
maxirmx
ded37538ce Appveyor build
stdint.h download fix
2015-10-12 00:51:33 +03:00
maxirmx
91ec215590 Appveyor build
stdint.h download fix
2015-10-12 00:50:34 +03:00
maxirmx
b73c7ab45e Appveyor build
+ stdint.h
2015-10-12 00:43:31 +03:00
maxirmx
c1ca220769 Appveyor build
+ push preshed v. 0.42
2015-10-11 23:58:41 +03:00
maxirmx
a0ffb856da Appveyor build
+ requirements.txt
2015-10-11 23:55:38 +03:00
maxirmx
8a91527952 Appveyor 2015-10-11 23:50:06 +03:00
maxirmx
deacdfd647 Appveyor - Removed noise 2015-10-11 23:49:02 +03:00
maxirmx
f5ebbbde80 Appveyour build
+ Setuptools
+ Cython
2015-10-11 23:47:12 +03:00
maxirmx
7587f6170f Added Pythhon3 environment configuration batch 2015-10-11 20:28:13 +03:00
maxirmx
5aeb8ac4e0 Added appveyor Python installation script 2015-10-11 20:25:42 +03:00
Matthew Honnibal
3b79d67462 * Fix assertion in test_basic_create 2015-10-12 00:48:18 +11:00
maxirmx
aab71668cd Merge remote-tracking branch 'refs/remotes/honnibal/master' 2015-10-11 16:47:49 +03:00
Matthew Honnibal
afec8cac20 * Add more tests to probe mingw32 failure 2015-10-11 22:40:04 +11:00
maxirmx
356181944a Merge remote-tracking branch 'refs/remotes/honnibal/master' 2015-10-11 13:38:03 +03:00
Matthew Honnibal
dba1daf597 * Add script to test loading different components 2015-10-11 19:46:53 +11:00