spaCy/spacy/tests/regression
Matthew Honnibal 4b2e5e59ed Add flush_cache method to tokenizer, to fix #1061
The tokenizer caches output for common chunks, for efficiency. This
cache is be invalidated when the tokenizer rules change, e.g. when a new
special-case rule is introduced. That's what was causing #1061.

When the cache is flushed, we free the intermediate token chunks.
I *think* this is safe --- but if we start getting segfaults, this patch
is to blame. The resolution would be to simply not free those bits of
memory. They'll be freed when the tokenizer exits anyway.
2017-07-22 15:06:50 +02:00
..
__init__.py Add __init__.py file for regression tests 2016-11-01 13:45:06 +01:00
test_issue54.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue118.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue242.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue309.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue351.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue360.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue361.py Move test for #361 to regression tests 2017-01-12 16:51:12 +01:00
test_issue401.py Pass regression test for #401 (resolves #401) 2017-03-18 17:06:49 +01:00
test_issue429.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue514.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue587.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue588.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue589.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue590.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue595.py Fix regression test 2017-03-25 22:35:07 +01:00
test_issue599.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue600.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue605.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue615.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue617.py Tidy up and rename regression tests and remove unnecessary imports 2017-01-12 22:00:37 +01:00
test_issue686.py Add title-case version of morph rules (resolves #686) 2017-03-18 17:27:11 +01:00
test_issue693.py Fix noun_chunk rules around coordination 2017-04-07 17:06:40 +02:00
test_issue704.py Add regression test for #704 and test new model (resolves #704) 2017-03-18 16:04:14 +01:00
test_issue717.py Fix #717: Set correct lemma for contracted verbs 2017-03-18 16:16:10 +01:00
test_issue719.py Fix #719: Lemmatizer can no longer output empty string 2017-03-18 16:02:06 +01:00
test_issue736.py Fix formatting 2017-01-12 22:56:20 +01:00
test_issue740.py Fix formatting 2017-01-16 13:09:52 +01:00
test_issue743.py Add missing unicode declaration 2017-03-13 12:08:19 +01:00
test_issue744.py Exclude "were" and "Were" from tokenizer exceptions and add regression test (resolves #744) 2017-01-16 13:10:38 +01:00
test_issue758.py Fix unicode header in tests 2017-04-24 10:09:01 +02:00
test_issue759.py Add regression test for #759 2017-01-20 15:11:48 +01:00
test_issue768.py Fix test after updating the French tokenizer stuff 2017-02-27 11:20:47 +01:00
test_issue775.py Add more test cases to #775 regression test to cover #847 2017-02-18 14:10:44 +01:00
test_issue781.py Fix tests 2017-03-16 18:53:47 -05:00
test_issue792.py Fix handling of trailing whitespace 2017-03-08 15:01:40 +01:00
test_issue801.py Update regression test for #801 to match current expected behaviour 2017-02-02 16:23:05 +01:00
test_issue805.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
test_issue834.py Reformat test and use text_file fixture 2017-02-16 23:49:19 +01:00
test_issue850.py Whitespace 2017-03-07 17:16:26 +01:00
test_issue852.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
test_issue859.py Use consistent unicode declarations 2017-03-12 13:07:28 +01:00
test_issue886.py Update docstring in #886 regression test 2017-03-13 12:00:38 +01:00
test_issue891.py Add / to tokenizer infixes (resolves #891) 2017-04-07 17:30:44 +02:00
test_issue903.py Prevent lemmatization of base nouns 2017-03-25 21:51:12 +01:00
test_issue910.py Remove xfail on Test #910 2017-04-23 16:28:55 +02:00
test_issue912.py Block lemmatization of base-form adjectives 2017-03-25 21:29:57 +01:00
test_issue957.py Add unicode declaration on new regression test 2017-04-07 18:09:23 +02:00
test_issue995.py Fix unicode header in tests 2017-04-24 10:09:01 +02:00
test_issue999.py Try to make test999 less flakey 2017-04-26 18:42:06 +02:00
test_issue1031.py Fix gaps in Lexeme API. Closes #1031 2017-07-22 13:53:48 +02:00
test_issue1061.py Add flush_cache method to tokenizer, to fix #1061 2017-07-22 15:06:50 +02:00
test_issue1207.py Rename regression test 2017-07-22 14:14:35 +02:00