Henning Peters
|
9d8966a2c0
|
Update test_tokenizer.py
|
2016-02-10 19:24:37 +01:00 |
|
Henning Peters
|
3b5f1e753b
|
py26 compatibility
|
2016-02-10 14:32:54 +01:00 |
|
Henning Peters
|
ee1f1ac300
|
mark test_sentence_space() as model test
|
2016-02-10 07:49:11 +01:00 |
|
Matthew Honnibal
|
5d96b3ef4f
|
* Increment version
|
2016-02-07 13:48:58 +01:00 |
|
Matthew Honnibal
|
1b83cb9dfa
|
* Fix Issue #251: Incorrect right edge calculation on left-clobber low in the tree
|
2016-02-07 00:00:42 +01:00 |
|
Matthew Honnibal
|
c6623889c1
|
* Add test for Issue #251: Incorrect right edges, caused by bad update to r_edge in del_arc, triggered from non-monotonic left-arc
|
2016-02-06 23:47:51 +01:00 |
|
Matthew Honnibal
|
a95974ad3f
|
* Fix oov probability
|
2016-02-06 15:13:55 +01:00 |
|
Matthew Honnibal
|
af8514cb0c
|
* Refine the way the is_parsed attribute is set by from_array
|
2016-02-06 14:44:35 +01:00 |
|
Matthew Honnibal
|
161b01d4c0
|
* Tweak usage example for multi-processing
|
2016-02-06 14:44:11 +01:00 |
|
Matthew Honnibal
|
7f24229f10
|
* Don't try to pickle the tokenizer
|
2016-02-06 14:09:05 +01:00 |
|
Matthew Honnibal
|
dcb401f3e1
|
* Remove broken Vocab pickling
|
2016-02-06 14:08:47 +01:00 |
|
Matthew Honnibal
|
e66d45bf66
|
* Restore previous patch to Span.root, as it seems it wasn't the cause of the problem.
|
2016-02-06 13:37:41 +01:00 |
|
Matthew Honnibal
|
4412a70dc5
|
* Initialize StateC._empty_token to 0, to avoid undefined behaviour.
|
2016-02-06 13:34:38 +01:00 |
|
Matthew Honnibal
|
1b41f868d2
|
* Check for errors in parser, and parallelise the left-over batch
|
2016-02-06 10:06:30 +01:00 |
|
Matthew Honnibal
|
031b00cb91
|
* Fix Span.root calculation
|
2016-02-05 20:12:09 +01:00 |
|
Matthew Honnibal
|
165ca28b80
|
* Set is_parsed flag in Parser.pipe
|
2016-02-05 19:51:44 +01:00 |
|
Matthew Honnibal
|
bdd579db0a
|
* Set is_parsed flag in Parser.pipe
|
2016-02-05 19:50:11 +01:00 |
|
Matthew Honnibal
|
7119e77fb6
|
* Fix Matcher.pipe
|
2016-02-05 19:46:02 +01:00 |
|
Matthew Honnibal
|
1cf0100bf6
|
* Add test for multithreading
|
2016-02-05 19:38:22 +01:00 |
|
Matthew Honnibal
|
b04c9aad71
|
* Fix off-by-one in Parser.pipe
|
2016-02-05 19:37:50 +01:00 |
|
Matthew Honnibal
|
e5c447e237
|
* Questionable fix to problem in Span.root
|
2016-02-05 19:18:35 +01:00 |
|
Matthew Honnibal
|
1ef84a0557
|
* Merge master into rethinc2
|
2016-02-05 12:55:59 +01:00 |
|
Matthew Honnibal
|
4cf34fc170
|
Merge branch 'rethinc2' of ssh://github.com/honnibal/spaCy into rethinc2
|
2016-02-05 12:48:28 +01:00 |
|
Matthew Honnibal
|
249dccbe95
|
* Fix Language.pipe
|
2016-02-05 12:47:57 +01:00 |
|
Matthew Honnibal
|
c0e63feccc
|
* xfail pickle tests
|
2016-02-05 12:46:58 +01:00 |
|
Matthew Honnibal
|
6aa92b70f1
|
* Fix merge problem in span
|
2016-02-05 12:46:11 +01:00 |
|
Matthew Honnibal
|
048dfe35aa
|
* cimport cython.parallel
|
2016-02-05 12:20:42 +01:00 |
|
Matthew Honnibal
|
af58f273b3
|
* Fix spacy.language.pipe
|
2016-02-05 12:20:29 +01:00 |
|
Matthew Honnibal
|
8a13cebdcc
|
* Update for modified thinc interface
|
2016-02-05 11:44:39 +01:00 |
|
Matthew Honnibal
|
48ce09687d
|
* Skip pickling the vocab in the tests
|
2016-02-04 15:51:19 +01:00 |
|
Matthew Honnibal
|
419edfab50
|
* Use generic flags for the new attributes until they're added
|
2016-02-04 15:50:54 +01:00 |
|
Matthew Honnibal
|
c4017a06d9
|
* Add placeholders for the new flags in attrs and symbols
|
2016-02-04 15:49:45 +01:00 |
|
Matthew Honnibal
|
e5c96c969f
|
* Wire up new attributes
|
2016-02-04 13:04:58 +01:00 |
|
Matthew Honnibal
|
9703ccc3de
|
* Remove unused import
|
2016-02-04 13:04:33 +01:00 |
|
Matthew Honnibal
|
11810be33e
|
* Add Python hooks for is_bracket/is_quote/is_left_punct/is_right_punct
|
2016-02-04 13:04:16 +01:00 |
|
Matthew Honnibal
|
fe611132f0
|
* Add stubs for is_bracket/is_quote/is_left_punct/is_right_punct functions
|
2016-02-04 13:03:04 +01:00 |
|
Matthew Honnibal
|
ee975d36d0
|
* Add stubs to test is_bracket/is_quote/is_left_punct/is_right_punct functions
|
2016-02-04 13:02:25 +01:00 |
|
Matthew Honnibal
|
f9e765cae7
|
* Add pipe() method to tokenizer
|
2016-02-03 02:32:37 +01:00 |
|
Matthew Honnibal
|
4cbad510ff
|
* Fix calculation of head for spans with punctuation.
|
2016-02-03 02:32:21 +01:00 |
|
Matthew Honnibal
|
84b247ef83
|
* Add a .pipe method, that takes a stream of input, operates on it, and streams the output. Internally, the stream may be buffered, to allow multi-threading.
|
2016-02-03 02:10:58 +01:00 |
|
Matthew Honnibal
|
fcfc17a164
|
Merge branch 'master' into rethinc2
|
2016-02-02 23:05:34 +01:00 |
|
Matthew Honnibal
|
f204daf27b
|
* Add error warning that a gold tag is unrecognised
|
2016-02-02 22:59:59 +01:00 |
|
Matthew Honnibal
|
99b8906100
|
* Accept punct_labels as an argument to the scorer
|
2016-02-02 22:59:06 +01:00 |
|
Matthew Honnibal
|
59123443e2
|
* Check for presence/absence of the different models in Language.end_training
|
2016-02-02 22:49:55 +01:00 |
|
Matthew Honnibal
|
9e9d4c8706
|
* Fix stupid error in Language.batch
|
2016-02-01 09:49:32 +01:00 |
|
Matthew Honnibal
|
e3db39dd21
|
* Fix compiler warning about signed/unsigned comparison
|
2016-02-01 09:08:07 +01:00 |
|
Matthew Honnibal
|
98fbdf2856
|
* Add Language.batch() method, to support multi-threaded jobs
|
2016-02-01 09:01:13 +01:00 |
|
Matthew Honnibal
|
b3802562d6
|
Merge branch 'rethinc2' of https://github.com/honnibal/spaCy into rethinc2
|
2016-02-01 08:59:24 +01:00 |
|
Matthew Honnibal
|
4b08a3fafd
|
* Fix merge conflict
|
2016-02-01 08:58:18 +01:00 |
|
Matthew Honnibal
|
5188f6d9d8
|
* Fix parseC function
|
2016-02-01 08:48:48 +01:00 |
|