spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-02-16 12:11:20 +03:00

History

Matthew Honnibal 4b2e5e59ed Add flush_cache method to tokenizer, to fix #1061 The tokenizer caches output for common chunks, for efficiency. This cache is be invalidated when the tokenizer rules change, e.g. when a new special-case rule is introduced. That's what was causing #1061. When the cache is flushed, we free the intermediate token chunks. I think this is safe --- but if we start getting segfaults, this patch is to blame. The resolution would be to simply not free those bits of memory. They'll be freed when the tokenizer exits anyway.		2017-07-22 15:06:50 +02:00
..
__init__.py	Add __init__.py file for regression tests	2016-11-01 13:45:06 +01:00
test_issue54.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue118.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue242.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue309.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue351.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue360.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue361.py	Move test for #361 to regression tests	2017-01-12 16:51:12 +01:00
test_issue401.py	Pass regression test for #401 (resolves #401 )	2017-03-18 17:06:49 +01:00
test_issue429.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue514.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue587.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue588.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue589.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue590.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue595.py	Fix regression test	2017-03-25 22:35:07 +01:00
test_issue599.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue600.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue605.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue615.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue617.py	Tidy up and rename regression tests and remove unnecessary imports	2017-01-12 22:00:37 +01:00
test_issue686.py	Add title-case version of morph rules (resolves #686 )	2017-03-18 17:27:11 +01:00
test_issue693.py	Fix noun_chunk rules around coordination	2017-04-07 17:06:40 +02:00
test_issue704.py	Add regression test for #704 and test new model (resolves #704 )	2017-03-18 16:04:14 +01:00
test_issue717.py	Fix #717 : Set correct lemma for contracted verbs	2017-03-18 16:16:10 +01:00
test_issue719.py	Fix #719 : Lemmatizer can no longer output empty string	2017-03-18 16:02:06 +01:00
test_issue736.py	Fix formatting	2017-01-12 22:56:20 +01:00
test_issue740.py	Fix formatting	2017-01-16 13:09:52 +01:00
test_issue743.py	Add missing unicode declaration	2017-03-13 12:08:19 +01:00
test_issue744.py	Exclude "were" and "Were" from tokenizer exceptions and add regression test (resolves #744 )	2017-01-16 13:10:38 +01:00
test_issue758.py	Fix unicode header in tests	2017-04-24 10:09:01 +02:00
test_issue759.py	Add regression test for #759	2017-01-20 15:11:48 +01:00
test_issue768.py	Fix test after updating the French tokenizer stuff	2017-02-27 11:20:47 +01:00
test_issue775.py	Add more test cases to #775 regression test to cover #847	2017-02-18 14:10:44 +01:00
test_issue781.py	Fix tests	2017-03-16 18:53:47 -05:00
test_issue792.py	Fix handling of trailing whitespace	2017-03-08 15:01:40 +01:00
test_issue801.py	Update regression test for #801 to match current expected behaviour	2017-02-02 16:23:05 +01:00
test_issue805.py	Use consistent unicode declarations	2017-03-12 13:07:28 +01:00
test_issue834.py	Reformat test and use text_file fixture	2017-02-16 23:49:19 +01:00
test_issue850.py	Whitespace	2017-03-07 17:16:26 +01:00
test_issue852.py	Use consistent unicode declarations	2017-03-12 13:07:28 +01:00
test_issue859.py	Use consistent unicode declarations	2017-03-12 13:07:28 +01:00
test_issue886.py	Update docstring in #886 regression test	2017-03-13 12:00:38 +01:00
test_issue891.py	Add / to tokenizer infixes (resolves #891 )	2017-04-07 17:30:44 +02:00
test_issue903.py	Prevent lemmatization of base nouns	2017-03-25 21:51:12 +01:00
test_issue910.py	Remove xfail on Test #910	2017-04-23 16:28:55 +02:00
test_issue912.py	Block lemmatization of base-form adjectives	2017-03-25 21:29:57 +01:00
test_issue957.py	Add unicode declaration on new regression test	2017-04-07 18:09:23 +02:00
test_issue995.py	Fix unicode header in tests	2017-04-24 10:09:01 +02:00
test_issue999.py	Try to make test999 less flakey	2017-04-26 18:42:06 +02:00
test_issue1031.py	Fix gaps in Lexeme API. Closes #1031	2017-07-22 13:53:48 +02:00
test_issue1061.py	Add flush_cache method to tokenizer, to fix #1061	2017-07-22 15:06:50 +02:00
test_issue1207.py	Rename regression test	2017-07-22 14:14:35 +02:00