Commit Graph

261 Commits

Author SHA1 Message Date
Matthew Honnibal
5bc0e83f9a * Reimplement matching in Cython, instead of Python. 2015-08-05 01:05:54 +02:00
Matthew Honnibal
4c87a696b3 * Add draft dfa matcher, in Python. Passing tests. 2015-08-04 15:55:28 +02:00
Matthew Honnibal
d17a15ae66 * Add test to check parse is being deserialized properly 2015-07-28 21:04:00 +02:00
Matthew Honnibal
9034f8a1cf * Update test_docs 2015-07-27 22:15:19 +02:00
Matthew Honnibal
6deb1e84b6 * Upd serialization tests 2015-07-27 21:25:48 +02:00
Matthew Honnibal
0f4d0d51ab * Test is_oov property 2015-07-27 01:50:34 +02:00
Matthew Honnibal
b997b1122b * Mark test_io as requiring the model 2015-07-26 21:36:22 +02:00
Matthew Honnibal
5c9b8d05e4 * Upd test_docs 2015-07-26 17:41:13 +02:00
Matthew Honnibal
609f729cc5 * Fix infix test 2015-07-26 17:32:55 +02:00
Matthew Honnibal
460b4c3207 * Add more infix tests 2015-07-26 17:30:34 +02:00
Matthew Honnibal
48a4d15264 * Test token properties 2015-07-26 16:37:39 +02:00
Matthew Honnibal
847c08e411 * Unhack serialization api tests 2015-07-26 16:23:41 +02:00
Matthew Honnibal
3a4c2a3276 * Update doctests 2015-07-26 13:04:18 +02:00
Matthew Honnibal
2b2032d1a0 * Update doctests 2015-07-26 12:57:59 +02:00
Matthew Honnibal
6d0cdb1630 * Py 2/3 compatibility of serialize tests 2015-07-24 04:51:53 +02:00
Matthew Honnibal
ce984f471c * Update tests for python3 2015-07-24 03:47:59 +02:00
Matthew Honnibal
bee2e77983 * Update tests 2015-07-23 09:26:43 +02:00
Matthew Honnibal
4970ebe887 * Add test_io 2015-07-23 01:19:59 +02:00
Matthew Honnibal
3a5299faec * Upd tests 2015-07-23 01:19:11 +02:00
Matthew Honnibal
20c2db08b7 * Add flags to pytest to tests requiring models, vectors or slow functions to be toggled. 2015-07-23 01:19:03 +02:00
Matthew Honnibal
1c9ea7b835 * Add tests for short string optimization 2015-07-20 12:05:45 +02:00
Matthew Honnibal
f13d5dae91 * Update test_packer 2015-07-20 01:38:29 +02:00
Matthew Honnibal
fb7202a173 * Update test_codecs 2015-07-20 01:38:15 +02:00
Matthew Honnibal
879ef9fa3e * Update tests for huffman codec 2015-07-19 17:59:51 +02:00
Matthew Honnibal
317cbbc015 * Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time. 2015-07-19 15:18:17 +02:00
Matthew Honnibal
0973e2f107 * Update serializer tests 2015-07-18 22:46:40 +02:00
Matthew Honnibal
4dddc8a69b * Fix type declarations for attr_t. Remove unused id_t. 2015-07-18 22:39:57 +02:00
Matthew Honnibal
f7f0ad1a78 * Fix tests 2015-07-17 21:31:44 +02:00
Matthew Honnibal
68374149ae * Move huffman encoding test to tests/serialize directory 2015-07-17 21:22:18 +02:00
Matthew Honnibal
e950f5a408 * Tests for serializer 2015-07-17 21:21:10 +02:00
Matthew Honnibal
db9dfd2e23 * Major refactor of serialization. Nearly complete now. 2015-07-17 01:27:54 +02:00
Matthew Honnibal
e1c702e498 * Upd tests after refactor 2015-07-14 00:08:50 +02:00
Matthew Honnibal
98382bd7a0 * Update tests after refactor 2015-07-13 22:30:01 +02:00
Matthew Honnibal
5b0a7190c9 * Round-trip for serialization finally working. Needs a lot of optimization. 2015-07-13 18:39:38 +02:00
Matthew Honnibal
0628e0e2a8 * Add tests for huffman encoding 2015-07-13 12:58:07 +02:00
Matthew Honnibal
53d1f5b2eb * Rename Span.head to Span.root. 2015-07-09 17:30:58 +02:00
Matthew Honnibal
7d2964f673 * Test that whitespace is not assigned a tag 2015-07-09 13:31:40 +02:00
Matthew Honnibal
f95da0bd52 * Allow tests to read model dir from SPACY_DATA environment variable 2015-07-09 12:18:02 +02:00
Matthew Honnibal
fe9118a528 * Add test for strip_bad_periods reading in read_conll.parse 2015-06-18 16:36:04 +02:00
Matthew Honnibal
bd4f5f89cb * Add note about failed tokenization 2015-06-08 16:17:07 +02:00
Matthew Honnibal
7195c07897 * Add explicit tokenizer test 2015-06-07 21:29:17 +02:00
Matthew Honnibal
957e6eaa8a * Fixes to prag_sbd tests 2015-06-07 21:19:17 +02:00
Matthew Honnibal
7a9e9ab9b2 * Update prag_sbd challenge set 2015-06-07 21:13:55 +02:00
Matthew Honnibal
e85d1a3808 * Fix test_conjuncts 2015-06-07 19:31:16 +02:00
Matthew Honnibal
1cfa326f6e * Comment out test_conjuncts 2015-06-07 19:08:04 +02:00
Matthew Honnibal
638e07939d * Avoid laoding vectors in test_token_references 2015-06-07 19:03:16 +02:00
Matthew Honnibal
50768241b3 * Fix test_docs.py 2015-06-07 19:02:43 +02:00
Matthew Honnibal
d83255db17 * Fix ner test 2015-06-07 18:57:42 +02:00
Matthew Honnibal
c6dc151fc3 * Fix spans/test_merge.py 2015-06-07 18:46:16 +02:00
Matthew Honnibal
2676240cbb * Fix spans/test_merge.py 2015-06-07 18:45:19 +02:00
Matthew Honnibal
9abb0dd4fd * Fix spans/test_merge.py 2015-06-07 18:44:18 +02:00
Matthew Honnibal
8a4c9c33f1 * Fix test_token_references test 2015-06-07 18:33:04 +02:00
Matthew Honnibal
5f44adc659 * Add tests/spans/conftest.py 2015-06-07 18:07:59 +02:00
Matthew Honnibal
dd587b7477 * Fix tests 2015-06-07 18:07:32 +02:00
Matthew Honnibal
e3af6af83c * Add tests/vocab/conftest.py 2015-06-07 18:02:47 +02:00
Matthew Honnibal
88041f69d1 * More work on reorganising tests, using conftest.py 2015-06-07 18:02:24 +02:00
Matthew Honnibal
674ee5dde7 * Add conftest.py to tests/, to allow session-global pipeline. This allows much faster tests. 2015-06-07 17:53:14 +02:00
Matthew Honnibal
877abb0e5b * Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds. 2015-06-07 17:24:49 +02:00
Matthew Honnibal
1d5f20fdda * Move nlp variable from global scope 2015-06-07 16:55:11 +02:00
Matthew Honnibal
d37dca72dd * Reorganize tests 2015-06-07 16:49:46 +02:00
Matthew Honnibal
e2578fbb90 * Avoid parsing and tagging in test_emoticons 2015-06-06 05:59:20 +02:00
Matthew Honnibal
4126ef3b8c * Restore hyphenation test to test_infix 2015-06-06 05:57:36 +02:00
Matthew Honnibal
b7fd77779a * Add some tests for reading NER data 2015-05-27 17:37:03 +02:00
Matthew Honnibal
89c3364041 * Update tests, preventing the parser from being loaded if possible 2015-05-25 01:02:03 +02:00
Matthew Honnibal
a9c70c9447 * Add tests for ontonotes sgml extraction 2015-05-24 21:52:12 +02:00
Matthew Honnibal
f460a8d2b6 * Comment out failing test in test_conjuncts 2015-05-24 21:51:41 +02:00
Matthew Honnibal
13a8595a4b * Add tests for Levenshtein alignment of training data 2015-05-24 21:46:11 +02:00
Matthew Honnibal
bdaddc4103 * Add PTB file read tests 2015-05-20 17:29:27 +02:00
Matthew Honnibal
bdb56497b5 * Add test for right_edge and left_edge 2015-05-12 20:27:55 +02:00
Matthew Honnibal
fb8d50b3d5 Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-04-30 12:45:15 +02:00
Matthew Honnibal
918b820472 * Add testing file for issues such as raised in #57 2015-04-28 20:46:29 +02:00
Jordan Suchow
38ed265b7d Tweak line spacing 2015-04-19 13:01:38 -07:00
Jordan Suchow
3a8d9b37a6 Remove trailing whitespace 2015-04-19 13:01:38 -07:00
Jordan Suchow
5f0f940a1f Remove unused imports 2015-04-19 01:05:22 -07:00
Matthew Honnibal
f7ffd94e6a * Add Token.conjuncts property 2015-04-17 01:40:53 +02:00
Matthew Honnibal
4757899370 * Fix times test 2015-04-16 04:50:40 +02:00
Matthew Honnibal
2ef170a991 * Fix Issue #54: Error merging multi-word token when there's a mid-token match. 2015-04-16 04:28:06 +02:00
Matthew Honnibal
0a62c32c14 * Add test for simple NER case 2015-04-13 21:33:54 +02:00
Matthew Honnibal
3faaad0271 * Remove bad import from test_span 2015-04-12 22:42:14 +02:00
Matthew Honnibal
ed1907b4df * Add pragmatic sentence boundary detection tests, from that Ruby gem. Not automatically run, as they can arbitrarily fail based on model changes. Currently 8/15 fail. 2015-04-12 04:46:40 +02:00
Matthew Honnibal
085574ccc1 * Add test for Issue #44 2015-04-07 06:05:18 +02:00
Matthew Honnibal
6640386b25 * Fix Issue #43: TAG attr not supported. Also add DEP attr, while I'm at it. Need better way of ensuring future changes don't break in similar way. 2015-04-07 06:00:57 +02:00
Matthew Honnibal
6674d719a5 * Test for Issue #43: TAG attribute not working in array export 2015-04-07 05:53:50 +02:00
Matthew Honnibal
3b5ea3731a * Add tests for Span stuff 2015-04-07 04:52:25 +02:00
Matthew Honnibal
c2b9a61ee2 * Upd merge test 2015-04-07 04:51:31 +02:00
Matthew Honnibal
3f1e17bd3c * Add tests for new merge() method 2015-03-30 01:37:57 +02:00
Matthew Honnibal
ee927fbbb4 * Fix test_morph_exceptions 2015-03-26 16:44:46 +01:00
Matthew Honnibal
27d9df49e7 * Upd sbd tests 2015-03-26 16:44:45 +01:00
Matthew Honnibal
d81b7be6a2 * Merge train.py 2015-03-26 16:44:41 +01:00
Matthew Honnibal
0962ffc095 * Fix issue #37: missing check_flag attribute from Token class 2015-03-26 15:06:26 +01:00
Matthew Honnibal
a61dacb4e5 * Add tests for new subtree method 2015-03-03 05:41:00 -05:00
Matthew Honnibal
47a4371fea * Upd tokenizer with i.e. tests 2015-02-18 06:37:04 -05:00
Matthew Honnibal
9dbc31d72c * Add test from NSchrading 2015-02-16 11:49:31 -05:00
Matthew Honnibal
7572e31f5e * Pass ownership of C data to Token instances if Tokens object is being garbage-collected, but Token instances are staying alive. 2015-02-11 18:05:06 -05:00
leofidus
363473aeed Add rokenizer test for zero length string 2015-02-10 08:20:32 -05:00
Matthew Honnibal
99f0a315f9 * Add test for Issue 24 2015-02-08 18:30:46 -05:00
Matthew Honnibal
3e8c87af1a * Extend parse tree navigation tests 2015-02-07 18:28:45 -05:00
Matthew Honnibal
aadc57ab00 * Add tests for tokens api 2015-02-07 13:14:07 -05:00
Matthew Honnibal
6b68607b1f * Add some tests for the code in the index.html docstrings 2015-02-07 08:52:13 -05:00
Matthew Honnibal
7d9d8c1489 * Fix test_tag_names again 2015-02-01 16:25:03 +11:00