Commit Graph

280 Commits

Author SHA1 Message Date
Matthew Honnibal
83d1a1e512 * Fix lemmatizer tests 2015-09-08 15:39:43 +02:00
Matthew Honnibal
fc8f7b123d * Mark a matcher test as requiring the model 2015-09-06 20:19:51 +02:00
Matthew Honnibal
86c888667f * Merge in changes from de branch 2015-09-06 19:49:28 +02:00
Matthew Honnibal
dbf8dce109 Merge branch 'gaz' of ssh://github.com/honnibal/spaCy into gaz 2015-09-06 18:44:14 +02:00
Matthew Honnibal
571b6eda88 * Upd tests 2015-09-06 05:40:10 +02:00
Matthew Honnibal
c07eea8563 * Comment out old doc tests for now 2015-08-26 19:23:04 +02:00
Matthew Honnibal
884251801e * Mark space tests as requiring model 2015-08-26 19:22:50 +02:00
Matthew Honnibal
ff9db9f3ae * Fix serializer tests for new attr scheme 2015-08-26 19:22:26 +02:00
Matthew Honnibal
658c4a3930 * Mark test_inital as requiring models 2015-08-26 19:22:06 +02:00
Matthew Honnibal
aa12b374c0 * Remove old doc tests 2015-08-22 22:12:55 +02:00
Matthew Honnibal
dcc8fadc7e * Add gazetteer-matcher 2015-08-22 22:10:43 +02:00
Matthew Honnibal
0ea08c4319 * Fix test partial parse 2015-08-08 23:45:36 +02:00
Matthew Honnibal
d4e79a5ca2 * Fix test initial parse 2015-08-08 23:41:36 +02:00
Matthew Honnibal
c044c0e18e * Fix partial actions test 2015-08-08 23:38:19 +02:00
Matthew Honnibal
63f86efa8b * Add test for specifying initial actions 2015-08-08 19:14:01 +02:00
Matthew Honnibal
8b8df851ca * Fix print statement in test_merge 2015-08-06 16:28:31 +02:00
Matthew Honnibal
2767979135 * Update matcher tests 2015-08-06 16:09:28 +02:00
Matthew Honnibal
faf75dfcb9 * Update matcher tests 2015-08-06 14:33:35 +02:00
Matthew Honnibal
9c1724ecae * Gazetteer stuff working, now need to wire up to API 2015-08-06 00:35:40 +02:00
Matthew Honnibal
5bc0e83f9a * Reimplement matching in Cython, instead of Python. 2015-08-05 01:05:54 +02:00
Matthew Honnibal
4c87a696b3 * Add draft dfa matcher, in Python. Passing tests. 2015-08-04 15:55:28 +02:00
Matthew Honnibal
d17a15ae66 * Add test to check parse is being deserialized properly 2015-07-28 21:04:00 +02:00
Matthew Honnibal
9034f8a1cf * Update test_docs 2015-07-27 22:15:19 +02:00
Matthew Honnibal
6deb1e84b6 * Upd serialization tests 2015-07-27 21:25:48 +02:00
Matthew Honnibal
0f4d0d51ab * Test is_oov property 2015-07-27 01:50:34 +02:00
Matthew Honnibal
b997b1122b * Mark test_io as requiring the model 2015-07-26 21:36:22 +02:00
Matthew Honnibal
5c9b8d05e4 * Upd test_docs 2015-07-26 17:41:13 +02:00
Matthew Honnibal
609f729cc5 * Fix infix test 2015-07-26 17:32:55 +02:00
Matthew Honnibal
460b4c3207 * Add more infix tests 2015-07-26 17:30:34 +02:00
Matthew Honnibal
48a4d15264 * Test token properties 2015-07-26 16:37:39 +02:00
Matthew Honnibal
847c08e411 * Unhack serialization api tests 2015-07-26 16:23:41 +02:00
Matthew Honnibal
3a4c2a3276 * Update doctests 2015-07-26 13:04:18 +02:00
Matthew Honnibal
2b2032d1a0 * Update doctests 2015-07-26 12:57:59 +02:00
Matthew Honnibal
6d0cdb1630 * Py 2/3 compatibility of serialize tests 2015-07-24 04:51:53 +02:00
Matthew Honnibal
ce984f471c * Update tests for python3 2015-07-24 03:47:59 +02:00
Matthew Honnibal
bee2e77983 * Update tests 2015-07-23 09:26:43 +02:00
Matthew Honnibal
4970ebe887 * Add test_io 2015-07-23 01:19:59 +02:00
Matthew Honnibal
3a5299faec * Upd tests 2015-07-23 01:19:11 +02:00
Matthew Honnibal
20c2db08b7 * Add flags to pytest to tests requiring models, vectors or slow functions to be toggled. 2015-07-23 01:19:03 +02:00
Matthew Honnibal
1c9ea7b835 * Add tests for short string optimization 2015-07-20 12:05:45 +02:00
Matthew Honnibal
f13d5dae91 * Update test_packer 2015-07-20 01:38:29 +02:00
Matthew Honnibal
fb7202a173 * Update test_codecs 2015-07-20 01:38:15 +02:00
Matthew Honnibal
879ef9fa3e * Update tests for huffman codec 2015-07-19 17:59:51 +02:00
Matthew Honnibal
317cbbc015 * Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time. 2015-07-19 15:18:17 +02:00
Matthew Honnibal
0973e2f107 * Update serializer tests 2015-07-18 22:46:40 +02:00
Matthew Honnibal
4dddc8a69b * Fix type declarations for attr_t. Remove unused id_t. 2015-07-18 22:39:57 +02:00
Matthew Honnibal
f7f0ad1a78 * Fix tests 2015-07-17 21:31:44 +02:00
Matthew Honnibal
68374149ae * Move huffman encoding test to tests/serialize directory 2015-07-17 21:22:18 +02:00
Matthew Honnibal
e950f5a408 * Tests for serializer 2015-07-17 21:21:10 +02:00
Matthew Honnibal
db9dfd2e23 * Major refactor of serialization. Nearly complete now. 2015-07-17 01:27:54 +02:00
Matthew Honnibal
e1c702e498 * Upd tests after refactor 2015-07-14 00:08:50 +02:00
Matthew Honnibal
98382bd7a0 * Update tests after refactor 2015-07-13 22:30:01 +02:00
Matthew Honnibal
5b0a7190c9 * Round-trip for serialization finally working. Needs a lot of optimization. 2015-07-13 18:39:38 +02:00
Matthew Honnibal
0628e0e2a8 * Add tests for huffman encoding 2015-07-13 12:58:07 +02:00
Matthew Honnibal
53d1f5b2eb * Rename Span.head to Span.root. 2015-07-09 17:30:58 +02:00
Matthew Honnibal
7d2964f673 * Test that whitespace is not assigned a tag 2015-07-09 13:31:40 +02:00
Matthew Honnibal
f95da0bd52 * Allow tests to read model dir from SPACY_DATA environment variable 2015-07-09 12:18:02 +02:00
Matthew Honnibal
fe9118a528 * Add test for strip_bad_periods reading in read_conll.parse 2015-06-18 16:36:04 +02:00
Matthew Honnibal
bd4f5f89cb * Add note about failed tokenization 2015-06-08 16:17:07 +02:00
Matthew Honnibal
7195c07897 * Add explicit tokenizer test 2015-06-07 21:29:17 +02:00
Matthew Honnibal
957e6eaa8a * Fixes to prag_sbd tests 2015-06-07 21:19:17 +02:00
Matthew Honnibal
7a9e9ab9b2 * Update prag_sbd challenge set 2015-06-07 21:13:55 +02:00
Matthew Honnibal
e85d1a3808 * Fix test_conjuncts 2015-06-07 19:31:16 +02:00
Matthew Honnibal
1cfa326f6e * Comment out test_conjuncts 2015-06-07 19:08:04 +02:00
Matthew Honnibal
638e07939d * Avoid laoding vectors in test_token_references 2015-06-07 19:03:16 +02:00
Matthew Honnibal
50768241b3 * Fix test_docs.py 2015-06-07 19:02:43 +02:00
Matthew Honnibal
d83255db17 * Fix ner test 2015-06-07 18:57:42 +02:00
Matthew Honnibal
c6dc151fc3 * Fix spans/test_merge.py 2015-06-07 18:46:16 +02:00
Matthew Honnibal
2676240cbb * Fix spans/test_merge.py 2015-06-07 18:45:19 +02:00
Matthew Honnibal
9abb0dd4fd * Fix spans/test_merge.py 2015-06-07 18:44:18 +02:00
Matthew Honnibal
8a4c9c33f1 * Fix test_token_references test 2015-06-07 18:33:04 +02:00
Matthew Honnibal
5f44adc659 * Add tests/spans/conftest.py 2015-06-07 18:07:59 +02:00
Matthew Honnibal
dd587b7477 * Fix tests 2015-06-07 18:07:32 +02:00
Matthew Honnibal
e3af6af83c * Add tests/vocab/conftest.py 2015-06-07 18:02:47 +02:00
Matthew Honnibal
88041f69d1 * More work on reorganising tests, using conftest.py 2015-06-07 18:02:24 +02:00
Matthew Honnibal
674ee5dde7 * Add conftest.py to tests/, to allow session-global pipeline. This allows much faster tests. 2015-06-07 17:53:14 +02:00
Matthew Honnibal
877abb0e5b * Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds. 2015-06-07 17:24:49 +02:00
Matthew Honnibal
1d5f20fdda * Move nlp variable from global scope 2015-06-07 16:55:11 +02:00
Matthew Honnibal
d37dca72dd * Reorganize tests 2015-06-07 16:49:46 +02:00
Matthew Honnibal
e2578fbb90 * Avoid parsing and tagging in test_emoticons 2015-06-06 05:59:20 +02:00
Matthew Honnibal
4126ef3b8c * Restore hyphenation test to test_infix 2015-06-06 05:57:36 +02:00
Matthew Honnibal
b7fd77779a * Add some tests for reading NER data 2015-05-27 17:37:03 +02:00
Matthew Honnibal
89c3364041 * Update tests, preventing the parser from being loaded if possible 2015-05-25 01:02:03 +02:00
Matthew Honnibal
a9c70c9447 * Add tests for ontonotes sgml extraction 2015-05-24 21:52:12 +02:00
Matthew Honnibal
f460a8d2b6 * Comment out failing test in test_conjuncts 2015-05-24 21:51:41 +02:00
Matthew Honnibal
13a8595a4b * Add tests for Levenshtein alignment of training data 2015-05-24 21:46:11 +02:00
Matthew Honnibal
bdaddc4103 * Add PTB file read tests 2015-05-20 17:29:27 +02:00
Matthew Honnibal
bdb56497b5 * Add test for right_edge and left_edge 2015-05-12 20:27:55 +02:00
Matthew Honnibal
fb8d50b3d5 Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-04-30 12:45:15 +02:00
Matthew Honnibal
918b820472 * Add testing file for issues such as raised in #57 2015-04-28 20:46:29 +02:00
Jordan Suchow
38ed265b7d Tweak line spacing 2015-04-19 13:01:38 -07:00
Jordan Suchow
3a8d9b37a6 Remove trailing whitespace 2015-04-19 13:01:38 -07:00
Jordan Suchow
5f0f940a1f Remove unused imports 2015-04-19 01:05:22 -07:00
Matthew Honnibal
f7ffd94e6a * Add Token.conjuncts property 2015-04-17 01:40:53 +02:00
Matthew Honnibal
4757899370 * Fix times test 2015-04-16 04:50:40 +02:00
Matthew Honnibal
2ef170a991 * Fix Issue #54: Error merging multi-word token when there's a mid-token match. 2015-04-16 04:28:06 +02:00
Matthew Honnibal
0a62c32c14 * Add test for simple NER case 2015-04-13 21:33:54 +02:00
Matthew Honnibal
3faaad0271 * Remove bad import from test_span 2015-04-12 22:42:14 +02:00
Matthew Honnibal
ed1907b4df * Add pragmatic sentence boundary detection tests, from that Ruby gem. Not automatically run, as they can arbitrarily fail based on model changes. Currently 8/15 fail. 2015-04-12 04:46:40 +02:00
Matthew Honnibal
085574ccc1 * Add test for Issue #44 2015-04-07 06:05:18 +02:00