Matthew Honnibal
6076213c16
* Fix init_model script
2015-07-25 22:35:52 +02:00
Matthew Honnibal
1a99eb69da
Merge branch 'master' of https://github.com/honnibal/spaCy
2015-07-25 22:19:48 +02:00
Matthew Honnibal
ef448649b3
* Add read_freqs function in init_model
2015-07-25 22:16:36 +02:00
Matthew Honnibal
2e6a60eaec
Merge branch 'master' of https://github.com/honnibal/spaCy
2015-07-25 21:14:07 +02:00
Matthew Honnibal
105305b4aa
* Upd get_freqs script
2015-07-25 21:13:41 +02:00
Matthew Honnibal
616445e027
* Add simple script to collate frequencies from sorted file
2015-07-25 21:12:45 +02:00
Matthew Honnibal
823ef4a00b
* Remove profile declarations
2015-07-25 18:13:06 +02:00
Matthew Honnibal
f4809e562f
* Allow json to be used as a fallback if ujson is not available
2015-07-25 18:11:36 +02:00
Matthew Honnibal
9da06671cf
* Remove unused import
2015-07-25 18:11:16 +02:00
Matthew Honnibal
2060935cdb
* Remove explicit bytes type in doc.from_bytes, to accept bytearray
2015-07-24 04:54:13 +02:00
Matthew Honnibal
aa28e2e01d
* Release the GIL around parse function
2015-07-24 04:53:27 +02:00
Matthew Honnibal
287d90e792
* Use thinc 3.3
2015-07-24 04:52:50 +02:00
Matthew Honnibal
c52179f5fa
* Use print function in train.py, for py 2/3 compatibility
2015-07-24 04:52:35 +02:00
Matthew Honnibal
d62eb34b76
* More Py 2/3 compatibility in bit strings
2015-07-24 04:52:06 +02:00
Matthew Honnibal
6d0cdb1630
* Py 2/3 compatibility of serialize tests
2015-07-24 04:51:53 +02:00
Matthew Honnibal
0bb839d299
* Fix string coercion for Python 3
2015-07-24 03:49:30 +02:00
Matthew Honnibal
c4ff410fdb
* Fix bytes problems for Python3
2015-07-24 03:48:23 +02:00
Matthew Honnibal
ce984f471c
* Update tests for python3
2015-07-24 03:47:59 +02:00
Matthew Honnibal
1ab25e4dad
* Fix python3 type error
2015-07-24 02:45:34 +02:00
Matthew Honnibal
35a9fb437f
* Fix travis.yml
2015-07-24 01:43:27 +02:00
Matthew Honnibal
f35ff173b0
* Fix bits.pyx unicode error
2015-07-23 20:37:57 +02:00
Matthew Honnibal
1406e24327
* Fix unicode error for Python3
2015-07-23 19:36:21 +02:00
Matthew Honnibal
dbda6c27fa
* Fix python3 error
2015-07-23 14:52:30 +02:00
Matthew Honnibal
99387f9572
* Fix python3 error
2015-07-23 14:30:29 +02:00
Matthew Honnibal
b81ffe9032
* Fix typing on mode string in CFile
2015-07-23 13:24:43 +02:00
Matthew Honnibal
22028602a9
* Add unicode_literals declaration in vocab.pyx
2015-07-23 13:24:20 +02:00
Matthew Honnibal
6be3ee311c
Py3 compatibility tweak
2015-07-23 13:13:15 +02:00
Matthew Honnibal
d4407d8e2f
Py3 compatibility tweak
2015-07-23 09:45:15 +02:00
Matthew Honnibal
8d7e044e2c
* Add clusters file
2015-07-23 09:35:56 +02:00
Matthew Honnibal
5b41744270
* Check for directory presence before loading annotators
2015-07-23 09:27:37 +02:00
Matthew Honnibal
da4821fc14
* Add cluster words to probs in init_model
2015-07-23 09:27:07 +02:00
Matthew Honnibal
bee2e77983
* Update tests
2015-07-23 09:26:43 +02:00
Matthew Honnibal
4af2595d99
* Fix structure of wordnet directory for init_model
2015-07-23 06:35:38 +02:00
Matthew Honnibal
16617142b7
* Fix travis.yml
2015-07-23 06:15:01 +02:00
Matthew Honnibal
24b98bc1c8
* Fix travis.yml
2015-07-23 06:06:13 +02:00
Matthew Honnibal
1364d9c74e
* Fix travis.yml
2015-07-23 05:58:26 +02:00
Matthew Honnibal
2c1aae3cfb
* Fix travis.yml
2015-07-23 05:30:33 +02:00
Matthew Honnibal
16d0e1f598
* Fix travis.yml
2015-07-23 04:41:28 +02:00
Matthew Honnibal
0c3caae085
* Fix travis.yml
2015-07-23 04:13:54 +02:00
Matthew Honnibal
58e6adfacd
* Fix travis.yml
2015-07-23 04:05:51 +02:00
Matthew Honnibal
c4018d6827
* Fix travis.yml
2015-07-23 03:34:23 +02:00
Matthew Honnibal
2fb4e932a3
* Fix travis.yml
2015-07-23 03:16:49 +02:00
Matthew Honnibal
fc0e964a5c
* Fix travis.yml
2015-07-23 03:00:17 +02:00
Matthew Honnibal
af5bab55b3
* Update travis.yml
2015-07-23 02:52:06 +02:00
Matthew Honnibal
082ae5aaf7
* Fix travis.yml
2015-07-23 02:38:14 +02:00
Matthew Honnibal
247e592c2c
* Update travis.yml
2015-07-23 02:32:47 +02:00
Matthew Honnibal
83c0f0da22
* Remove lemmatizer from init_model
2015-07-23 02:32:34 +02:00
Matthew Honnibal
df01a88763
Merge branch 'refactor' (and serializaton)
...
Add Huffman-code serialization, and do a lot of
refactoring. Highlights include:
* Much more efficient StringStore
* Vocab maintains a by-orth mapping of Lexemes
* Avoid manually slicing Py_UNICODE buffers,
simplifying tokenizer and vocab C APIs
* Remove various bits of dead code
* Work on removing GIL around parser
* Work on bridge to Theano
Conflicts:
spacy/strings.pxd
spacy/strings.pyx
spacy/structs.pxd
2015-07-23 02:18:35 +02:00
Matthew Honnibal
4970ebe887
* Add test_io
2015-07-23 01:19:59 +02:00
Matthew Honnibal
4729200dfc
* Whitespace
2015-07-23 01:19:26 +02:00