spaCy

mirror of https://github.com/explosion/spaCy.git synced 2024-12-26 18:06:29 +03:00

Author	SHA1	Message	Date
Matthew Honnibal	3a5299faec	* Upd tests	2015-07-23 01:19:11 +02:00
Matthew Honnibal	20c2db08b7	* Add flags to pytest to tests requiring models, vectors or slow functions to be toggled.	2015-07-23 01:19:03 +02:00
Matthew Honnibal	1c9ea7b835	* Add tests for short string optimization	2015-07-20 12:05:45 +02:00
Matthew Honnibal	f13d5dae91	* Update test_packer	2015-07-20 01:38:29 +02:00
Matthew Honnibal	fb7202a173	* Update test_codecs	2015-07-20 01:38:15 +02:00
Matthew Honnibal	879ef9fa3e	* Update tests for huffman codec	2015-07-19 17:59:51 +02:00
Matthew Honnibal	317cbbc015	* Serialization round trip now working with decent API, but with rough spots in the organisation and requiring vocabulary to be fixed ahead of time.	2015-07-19 15:18:17 +02:00
Matthew Honnibal	0973e2f107	* Update serializer tests	2015-07-18 22:46:40 +02:00
Matthew Honnibal	4dddc8a69b	* Fix type declarations for attr_t. Remove unused id_t.	2015-07-18 22:39:57 +02:00
Matthew Honnibal	f7f0ad1a78	* Fix tests	2015-07-17 21:31:44 +02:00
Matthew Honnibal	68374149ae	* Move huffman encoding test to tests/serialize directory	2015-07-17 21:22:18 +02:00
Matthew Honnibal	e950f5a408	* Tests for serializer	2015-07-17 21:21:10 +02:00
Matthew Honnibal	db9dfd2e23	* Major refactor of serialization. Nearly complete now.	2015-07-17 01:27:54 +02:00
Matthew Honnibal	e1c702e498	* Upd tests after refactor	2015-07-14 00:08:50 +02:00
Matthew Honnibal	98382bd7a0	* Update tests after refactor	2015-07-13 22:30:01 +02:00
Matthew Honnibal	5b0a7190c9	* Round-trip for serialization finally working. Needs a lot of optimization.	2015-07-13 18:39:38 +02:00
Matthew Honnibal	0628e0e2a8	* Add tests for huffman encoding	2015-07-13 12:58:07 +02:00
Matthew Honnibal	53d1f5b2eb	* Rename Span.head to Span.root.	2015-07-09 17:30:58 +02:00
Matthew Honnibal	7d2964f673	* Test that whitespace is not assigned a tag	2015-07-09 13:31:40 +02:00
Matthew Honnibal	f95da0bd52	* Allow tests to read model dir from SPACY_DATA environment variable	2015-07-09 12:18:02 +02:00
Matthew Honnibal	fe9118a528	* Add test for strip_bad_periods reading in read_conll.parse	2015-06-18 16:36:04 +02:00
Matthew Honnibal	bd4f5f89cb	* Add note about failed tokenization	2015-06-08 16:17:07 +02:00
Matthew Honnibal	7195c07897	* Add explicit tokenizer test	2015-06-07 21:29:17 +02:00
Matthew Honnibal	957e6eaa8a	* Fixes to prag_sbd tests	2015-06-07 21:19:17 +02:00
Matthew Honnibal	7a9e9ab9b2	* Update prag_sbd challenge set	2015-06-07 21:13:55 +02:00
Matthew Honnibal	e85d1a3808	* Fix test_conjuncts	2015-06-07 19:31:16 +02:00
Matthew Honnibal	1cfa326f6e	* Comment out test_conjuncts	2015-06-07 19:08:04 +02:00
Matthew Honnibal	638e07939d	* Avoid laoding vectors in test_token_references	2015-06-07 19:03:16 +02:00
Matthew Honnibal	50768241b3	* Fix test_docs.py	2015-06-07 19:02:43 +02:00
Matthew Honnibal	d83255db17	* Fix ner test	2015-06-07 18:57:42 +02:00
Matthew Honnibal	c6dc151fc3	* Fix spans/test_merge.py	2015-06-07 18:46:16 +02:00
Matthew Honnibal	2676240cbb	* Fix spans/test_merge.py	2015-06-07 18:45:19 +02:00
Matthew Honnibal	9abb0dd4fd	* Fix spans/test_merge.py	2015-06-07 18:44:18 +02:00
Matthew Honnibal	8a4c9c33f1	* Fix test_token_references test	2015-06-07 18:33:04 +02:00
Matthew Honnibal	5f44adc659	* Add tests/spans/conftest.py	2015-06-07 18:07:59 +02:00
Matthew Honnibal	dd587b7477	* Fix tests	2015-06-07 18:07:32 +02:00
Matthew Honnibal	e3af6af83c	* Add tests/vocab/conftest.py	2015-06-07 18:02:47 +02:00
Matthew Honnibal	88041f69d1	* More work on reorganising tests, using conftest.py	2015-06-07 18:02:24 +02:00
Matthew Honnibal	674ee5dde7	* Add conftest.py to tests/, to allow session-global pipeline. This allows much faster tests.	2015-06-07 17:53:14 +02:00
Matthew Honnibal	877abb0e5b	* Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds.	2015-06-07 17:24:49 +02:00
Matthew Honnibal	1d5f20fdda	* Move nlp variable from global scope	2015-06-07 16:55:11 +02:00
Matthew Honnibal	d37dca72dd	* Reorganize tests	2015-06-07 16:49:46 +02:00
Matthew Honnibal	e2578fbb90	* Avoid parsing and tagging in test_emoticons	2015-06-06 05:59:20 +02:00
Matthew Honnibal	4126ef3b8c	* Restore hyphenation test to test_infix	2015-06-06 05:57:36 +02:00
Matthew Honnibal	b7fd77779a	* Add some tests for reading NER data	2015-05-27 17:37:03 +02:00
Matthew Honnibal	89c3364041	* Update tests, preventing the parser from being loaded if possible	2015-05-25 01:02:03 +02:00
Matthew Honnibal	a9c70c9447	* Add tests for ontonotes sgml extraction	2015-05-24 21:52:12 +02:00
Matthew Honnibal	f460a8d2b6	* Comment out failing test in test_conjuncts	2015-05-24 21:51:41 +02:00
Matthew Honnibal	13a8595a4b	* Add tests for Levenshtein alignment of training data	2015-05-24 21:46:11 +02:00
Matthew Honnibal	bdaddc4103	* Add PTB file read tests	2015-05-20 17:29:27 +02:00
Matthew Honnibal	bdb56497b5	* Add test for right_edge and left_edge	2015-05-12 20:27:55 +02:00
Matthew Honnibal	fb8d50b3d5	Merge branch 'master' of ssh://github.com/honnibal/spaCy	2015-04-30 12:45:15 +02:00
Matthew Honnibal	918b820472	* Add testing file for issues such as raised in #57	2015-04-28 20:46:29 +02:00
Jordan Suchow	38ed265b7d	Tweak line spacing	2015-04-19 13:01:38 -07:00
Jordan Suchow	3a8d9b37a6	Remove trailing whitespace	2015-04-19 13:01:38 -07:00
Jordan Suchow	5f0f940a1f	Remove unused imports	2015-04-19 01:05:22 -07:00
Matthew Honnibal	f7ffd94e6a	* Add Token.conjuncts property	2015-04-17 01:40:53 +02:00
Matthew Honnibal	4757899370	* Fix times test	2015-04-16 04:50:40 +02:00
Matthew Honnibal	2ef170a991	* Fix Issue #54 : Error merging multi-word token when there's a mid-token match.	2015-04-16 04:28:06 +02:00
Matthew Honnibal	0a62c32c14	* Add test for simple NER case	2015-04-13 21:33:54 +02:00
Matthew Honnibal	3faaad0271	* Remove bad import from test_span	2015-04-12 22:42:14 +02:00
Matthew Honnibal	ed1907b4df	* Add pragmatic sentence boundary detection tests, from that Ruby gem. Not automatically run, as they can arbitrarily fail based on model changes. Currently 8/15 fail.	2015-04-12 04:46:40 +02:00
Matthew Honnibal	085574ccc1	* Add test for Issue #44	2015-04-07 06:05:18 +02:00
Matthew Honnibal	6640386b25	* Fix Issue #43 : TAG attr not supported. Also add DEP attr, while I'm at it. Need better way of ensuring future changes don't break in similar way.	2015-04-07 06:00:57 +02:00
Matthew Honnibal	6674d719a5	* Test for Issue #43 : TAG attribute not working in array export	2015-04-07 05:53:50 +02:00
Matthew Honnibal	3b5ea3731a	* Add tests for Span stuff	2015-04-07 04:52:25 +02:00
Matthew Honnibal	c2b9a61ee2	* Upd merge test	2015-04-07 04:51:31 +02:00
Matthew Honnibal	3f1e17bd3c	* Add tests for new merge() method	2015-03-30 01:37:57 +02:00
Matthew Honnibal	ee927fbbb4	* Fix test_morph_exceptions	2015-03-26 16:44:46 +01:00
Matthew Honnibal	27d9df49e7	* Upd sbd tests	2015-03-26 16:44:45 +01:00
Matthew Honnibal	d81b7be6a2	* Merge train.py	2015-03-26 16:44:41 +01:00
Matthew Honnibal	0962ffc095	* Fix issue #37 : missing check_flag attribute from Token class	2015-03-26 15:06:26 +01:00
Matthew Honnibal	a61dacb4e5	* Add tests for new subtree method	2015-03-03 05:41:00 -05:00
Matthew Honnibal	47a4371fea	* Upd tokenizer with i.e. tests	2015-02-18 06:37:04 -05:00
Matthew Honnibal	9dbc31d72c	* Add test from NSchrading	2015-02-16 11:49:31 -05:00
Matthew Honnibal	7572e31f5e	* Pass ownership of C data to Token instances if Tokens object is being garbage-collected, but Token instances are staying alive.	2015-02-11 18:05:06 -05:00
leofidus	363473aeed	Add rokenizer test for zero length string	2015-02-10 08:20:32 -05:00
Matthew Honnibal	99f0a315f9	* Add test for Issue 24	2015-02-08 18:30:46 -05:00
Matthew Honnibal	3e8c87af1a	* Extend parse tree navigation tests	2015-02-07 18:28:45 -05:00
Matthew Honnibal	aadc57ab00	* Add tests for tokens api	2015-02-07 13:14:07 -05:00
Matthew Honnibal	6b68607b1f	* Add some tests for the code in the index.html docstrings	2015-02-07 08:52:13 -05:00
Matthew Honnibal	7d9d8c1489	* Fix test_tag_names again	2015-02-01 16:25:03 +11:00
Matthew Honnibal	d1c9e7ff70	* Make test_tag_names version-portable	2015-02-01 15:40:20 +11:00
Matthew Honnibal	1f31774078	* Python3 fix for test_tag_names	2015-02-01 13:14:20 +11:00
Matthew Honnibal	0fd4a71bed	* Add provisional tests for sentence boundary detection	2015-01-31 13:46:11 +11:00
Matthew Honnibal	2da694f65e	* Don't load parser in test_pre_punct	2015-01-30 20:11:47 +11:00
Matthew Honnibal	0b53fd7daa	* Add test for parse tree navigation	2015-01-30 18:02:58 +11:00
Matthew Honnibal	7d432b7e39	* Add tests for vector-space model	2015-01-30 16:45:45 +11:00
Matthew Honnibal	48b98e3fb5	* Add test for tag names	2015-01-30 16:45:11 +11:00
Matthew Honnibal	613a195f92	* Add test for indices	2015-01-30 16:44:29 +11:00
Matthew Honnibal	03cc2ee08e	* Add test for numpy array transport	2015-01-30 16:43:55 +11:00
Matthew Honnibal	b3f9b199cf	Merge branch 'punctparse'	2015-01-30 16:38:56 +11:00
Matthew Honnibal	d0e08a5b57	* Upd index tests	2015-01-30 12:35:13 +11:00
Matthew Honnibal	35a18250cc	* Upd tests, avoiding unnecessary processing to make testing faster	2015-01-30 10:41:55 +11:00
Matthew Honnibal	bda5347c5c	* Fix unicode in test	2015-01-25 19:04:23 +11:00
Matthew Honnibal	706305ee26	* Upd tests for new meaning of 'string'	2015-01-24 07:22:30 +11:00
Matthew Honnibal	5ed8b2b98f	* Rename sic to orth	2015-01-23 02:08:25 +11:00
Matthew Honnibal	93d4bd6c2e	* Add test for ). in tokenizer	2015-01-22 22:25:18 +11:00
Matthew Honnibal	06e7456c65	* Upd tests	2015-01-17 17:33:23 +11:00
Matthew Honnibal	802867e96a	* Revise interface to Token. Strings now have attribute names like norm1_	2015-01-15 03:51:47 +11:00

1 2 3 4 5

243 Commits