minor fix (while we've calculated conf.matchRation for stable pages, we've put a constant value (0.900) for dynamic ones - so putting (ratio - conf.matchRatio) > DIFF_TOLERANCE for dynamic pages too would just effectively increase it's value to 0.900 + DIFF_TOLERANCE (in our case to 0.950) which is too narrow space for True result)

This commit is contained in:
Miroslav Stampar 2010-11-09 23:21:21 +00:00
parent 5ebd5d935c
commit 47720a43dd

View File

@ -116,4 +116,7 @@ def comparison(page, headers=None, getSeqMatcher=False, pageLength=None):
# If the url is not stable it returns sequence matcher between the
# first untouched HTTP response page content and this content
else:
if kb.pageStable:
return (ratio - conf.matchRatio) > DIFF_TOLERANCE
else:
return ratio > conf.matchRatio