Skip to content

Commit 5a2e975

Browse files
authored
Merge pull request #102 from taoyudong/master
Fix Google Image parser
2 parents 7e78a9a + 2580b27 commit 5a2e975

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

icrawler/builtin/google.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -159,7 +159,7 @@ def parse(self, response):
159159
#data = meta[31][0][12][2]
160160
#uris = [img[1][3][0] for img in data if img[0] == 1]
161161

162-
uris = re.findall(r'http.*?\.(?:jpg|png|bmp)', txt)
162+
uris = re.findall(r'http[^\[]*?\.(?:jpg|png|bmp)', txt)
163163
return [{'file_url': uri} for uri in uris]
164164

165165

0 commit comments

Comments
 (0)