Commit Graph

181 Commits

Author SHA1 Message Date
Matthew Honnibal
db9dfd2e23 * Major refactor of serialization. Nearly complete now. 2015-07-17 01:27:54 +02:00
Matthew Honnibal
e1c702e498 * Upd tests after refactor 2015-07-14 00:08:50 +02:00
Matthew Honnibal
98382bd7a0 * Update tests after refactor 2015-07-13 22:30:01 +02:00
Matthew Honnibal
5b0a7190c9 * Round-trip for serialization finally working. Needs a lot of optimization. 2015-07-13 18:39:38 +02:00
Matthew Honnibal
0628e0e2a8 * Add tests for huffman encoding 2015-07-13 12:58:07 +02:00
Matthew Honnibal
53d1f5b2eb * Rename Span.head to Span.root. 2015-07-09 17:30:58 +02:00
Matthew Honnibal
7d2964f673 * Test that whitespace is not assigned a tag 2015-07-09 13:31:40 +02:00
Matthew Honnibal
f95da0bd52 * Allow tests to read model dir from SPACY_DATA environment variable 2015-07-09 12:18:02 +02:00
Matthew Honnibal
fe9118a528 * Add test for strip_bad_periods reading in read_conll.parse 2015-06-18 16:36:04 +02:00
Matthew Honnibal
bd4f5f89cb * Add note about failed tokenization 2015-06-08 16:17:07 +02:00
Matthew Honnibal
7195c07897 * Add explicit tokenizer test 2015-06-07 21:29:17 +02:00
Matthew Honnibal
957e6eaa8a * Fixes to prag_sbd tests 2015-06-07 21:19:17 +02:00
Matthew Honnibal
7a9e9ab9b2 * Update prag_sbd challenge set 2015-06-07 21:13:55 +02:00
Matthew Honnibal
e85d1a3808 * Fix test_conjuncts 2015-06-07 19:31:16 +02:00
Matthew Honnibal
1cfa326f6e * Comment out test_conjuncts 2015-06-07 19:08:04 +02:00
Matthew Honnibal
638e07939d * Avoid laoding vectors in test_token_references 2015-06-07 19:03:16 +02:00
Matthew Honnibal
50768241b3 * Fix test_docs.py 2015-06-07 19:02:43 +02:00
Matthew Honnibal
d83255db17 * Fix ner test 2015-06-07 18:57:42 +02:00
Matthew Honnibal
c6dc151fc3 * Fix spans/test_merge.py 2015-06-07 18:46:16 +02:00
Matthew Honnibal
2676240cbb * Fix spans/test_merge.py 2015-06-07 18:45:19 +02:00
Matthew Honnibal
9abb0dd4fd * Fix spans/test_merge.py 2015-06-07 18:44:18 +02:00
Matthew Honnibal
8a4c9c33f1 * Fix test_token_references test 2015-06-07 18:33:04 +02:00
Matthew Honnibal
5f44adc659 * Add tests/spans/conftest.py 2015-06-07 18:07:59 +02:00
Matthew Honnibal
dd587b7477 * Fix tests 2015-06-07 18:07:32 +02:00
Matthew Honnibal
e3af6af83c * Add tests/vocab/conftest.py 2015-06-07 18:02:47 +02:00
Matthew Honnibal
88041f69d1 * More work on reorganising tests, using conftest.py 2015-06-07 18:02:24 +02:00
Matthew Honnibal
674ee5dde7 * Add conftest.py to tests/, to allow session-global pipeline. This allows much faster tests. 2015-06-07 17:53:14 +02:00
Matthew Honnibal
877abb0e5b * Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds. 2015-06-07 17:24:49 +02:00
Matthew Honnibal
1d5f20fdda * Move nlp variable from global scope 2015-06-07 16:55:11 +02:00
Matthew Honnibal
d37dca72dd * Reorganize tests 2015-06-07 16:49:46 +02:00
Matthew Honnibal
e2578fbb90 * Avoid parsing and tagging in test_emoticons 2015-06-06 05:59:20 +02:00
Matthew Honnibal
4126ef3b8c * Restore hyphenation test to test_infix 2015-06-06 05:57:36 +02:00
Matthew Honnibal
b7fd77779a * Add some tests for reading NER data 2015-05-27 17:37:03 +02:00
Matthew Honnibal
89c3364041 * Update tests, preventing the parser from being loaded if possible 2015-05-25 01:02:03 +02:00
Matthew Honnibal
a9c70c9447 * Add tests for ontonotes sgml extraction 2015-05-24 21:52:12 +02:00
Matthew Honnibal
f460a8d2b6 * Comment out failing test in test_conjuncts 2015-05-24 21:51:41 +02:00
Matthew Honnibal
13a8595a4b * Add tests for Levenshtein alignment of training data 2015-05-24 21:46:11 +02:00
Matthew Honnibal
bdaddc4103 * Add PTB file read tests 2015-05-20 17:29:27 +02:00
Matthew Honnibal
bdb56497b5 * Add test for right_edge and left_edge 2015-05-12 20:27:55 +02:00
Matthew Honnibal
fb8d50b3d5 Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-04-30 12:45:15 +02:00
Matthew Honnibal
918b820472 * Add testing file for issues such as raised in #57 2015-04-28 20:46:29 +02:00
Jordan Suchow
38ed265b7d Tweak line spacing 2015-04-19 13:01:38 -07:00
Jordan Suchow
3a8d9b37a6 Remove trailing whitespace 2015-04-19 13:01:38 -07:00
Jordan Suchow
5f0f940a1f Remove unused imports 2015-04-19 01:05:22 -07:00
Matthew Honnibal
f7ffd94e6a * Add Token.conjuncts property 2015-04-17 01:40:53 +02:00
Matthew Honnibal
4757899370 * Fix times test 2015-04-16 04:50:40 +02:00
Matthew Honnibal
2ef170a991 * Fix Issue #54: Error merging multi-word token when there's a mid-token match. 2015-04-16 04:28:06 +02:00
Matthew Honnibal
0a62c32c14 * Add test for simple NER case 2015-04-13 21:33:54 +02:00
Matthew Honnibal
3faaad0271 * Remove bad import from test_span 2015-04-12 22:42:14 +02:00
Matthew Honnibal
ed1907b4df * Add pragmatic sentence boundary detection tests, from that Ruby gem. Not automatically run, as they can arbitrarily fail based on model changes. Currently 8/15 fail. 2015-04-12 04:46:40 +02:00