updating google parsing regex (for the better, of course)

This commit is contained in:
Miroslav Stampar 2010-11-19 10:00:29 +00:00
parent d97e97d884
commit e8bef28337

View File

@ -45,7 +45,7 @@ class Google:
matches = []
regExpr = "li class=\042?g\042?\076.+?a href=\042(http[s]*://.+?)\042\sclass=\042?l\042?"
regExpr = r'h3 class="?r"?><a href="(http[s]*://.+?)"\sclass="?l"?'
matches = re.findall(regExpr, page, re.I | re.M)
return matches