spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-11-19 01:05:56 +03:00

History

adrianeboyd f2bfaa1b38 Filter subtoken matches in merge_subtokens() (#4539 ) The `Matcher` in `merge_subtokens()` returns all possible subsequences of `subtok`, so for sequences of two or more subtoks it's necessary to filter the matches so that the retokenizer is only merging the longest matches with no overlapping spans.		2019-10-28 15:40:28 +01:00
..
__init__.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_analysis.py	Component decorator and component analysis (#4517 )	2019-10-27 13:35:49 +01:00
test_entity_linker.py	Explicitly catch warning in test	2019-10-24 16:16:27 +02:00
test_entity_ruler.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_factories.py	Revert #4334	2019-09-29 17:32:12 +02:00
test_functions.py	Filter subtoken matches in merge_subtokens() (#4539 )	2019-10-28 15:40:28 +01:00
test_pipe_methods.py	Also support passing list to Language.disable_pipes (#4521 )	2019-10-25 16:19:08 +02:00
test_sentencizer.py	Extend unicode character block for Sinhala (#4378 )	2019-10-07 13:17:03 +02:00
test_textcat.py	Revert #4334	2019-09-29 17:32:12 +02:00