so close to run, maybe... #28

davebar · 2015-02-16T14:48:18Z

Changed links for chrome, installation seems OK but after: python fbstalker1.py -user blabla
this is what I get

Traceback (most recent call last):
File "fbstalker.py", line 18, in
from pygraphml.GraphMLParser import *
ImportError: No module named GraphMLParser

When droped line, it has same error on next (Graph, Node, Edge)

Any idea?

fragmi · 2015-02-21T08:01:27Z

for me, I commented out the following in fbstalker:

fragmi · 2015-02-21T08:01:52Z

Added:
from pygraphml import *

Commented out:
#from pygraphml.GraphMLParser import *
#from pygraphml.Graph import *
#from pygraphml.Node import *
#from pygraphml.Edge import *

davebar · 2015-02-23T12:53:53Z

Changed from pygraphml.GraphMLParser import * to from pygraphml.graphmp_parser import *
and also folowing 3 lines: Graph to graph, Node to node, Edge to edge - now seems to work...

Now new errors:
-fbstalker run ok but I cant get Chrome (27.0.1453.110) to show pictures (images were blocked on this page - message from Chrome - cant change in settings!). Runing Chrome alone (not from fbstalker shows pics normal.
-Caching runs but writing to database file gets: photosOf (same as photosBy, photos Commented) list index out of range
-There is no maltego file (it shows maltego file created)

7109node · 2015-03-12T21:02:43Z

SO I have been working at this for a little while now. I have it running to where it will scrape the pages and complete the run with pictures etc, but when it goes to write the the SQL.db it records nothing. I also get the index out of range error. I believe this is being caused by a change in facebook's backend and the information being parsed out into larger fields than allotted for in the original script. I'm going to give it another look when I get some time.

davebar · 2015-03-16T08:44:07Z

Yeah, same clue at my side - sqlite3 problem - still workin on it, no progress so far...

Fluffinko · 2015-04-03T14:04:01Z

@davebar all parse definitions is obsolete and not used this way in facebook, so to make it work we need to rewrite

def parsePhotosOf(html):
    soup = BeautifulSoup(html)  
    photoPageLink = soup.findAll("a", {"class" : "_23q"})
    tempList = []
    for i in photoPageLink:
        html = str(i)
        soup1 = BeautifulSoup(html)
        pageName = soup1.findAll("img", {"class" : "img"})
        pageName1 = soup1.findAll("img", {"class" : "scaledImageFitWidth img"})
        pageName2 = soup1.findAll("img", {"class" : "_46-i img"})   
        for z in pageName2:
            if z['src'].endswith('.jpg'):
                url1 = i['href']
                r = re.compile('fbid=(.*?)&set=bc')
                m = r.search(url1)
                if m:
                    filename = 'fbid_'+ m.group(1)+'.html'
                    filename = filename.replace("profile.php?id=","")
                    if not os.path.lexists(filename):
                        #html1 = downloadPage(url1)
                        html1 = downloadFile(url1)
                        print "[*] Caching Photo Page: "+m.group(1)
                        text_file = open(filename, "w")
                        text_file.write(normalize(html1))
                        text_file.close()
                    else:
                        html1 = open(filename, 'r').read()
                soup2 = BeautifulSoup(html1)
                username2 = soup2.find("div", {"class" : "fbPhotoContributorName"})
                r = re.compile('a href="(.*?)"')
                m = r.search(str(username2))
                if m:   
                    username3 = m.group(1)
                    username3 = username3.replace("https://www.facebook.com/","")
                    username3 = username3.replace("profile.php?id=","")
                    print "[*] Extracting Data from Photo Page: "+username3
                    tempList.append([str(uid),z['alt'],z['src'],i['href'],username3])
        for y in pageName1:
            if y['src'].endswith('.jpg'):
                url1 = i['href']
                r = re.compile('fbid=(.*?)&set=bc')
                m = r.search(url1)
                if m:
                    filename = 'fbid_'+ m.group(1)+'.html'
                    filename = filename.replace("profile.php?id=","")
                    if not os.path.lexists(filename):
                        #html1 = downloadPage(url1)
                        html1 = downloadFile(url1)
                        print "[*] Caching Photo Page: "+m.group(1)
                        text_file = open(filename, "w")
                        text_file.write(normalize(html1))
                        text_file.close()
                    else:
                        html1 = open(filename, 'r').read()
                soup2 = BeautifulSoup(html1)
                username2 = soup2.find("div", {"class" : "fbPhotoContributorName"})
                r = re.compile('a href="(.*?)"')
                m = r.search(str(username2))
                if m:   
                    username3 = m.group(1)
                    username3 = username3.replace("https://www.facebook.com/","")
                    username3 = username3.replace("profile.php?id=","")
                    print "[*] Extracting Data from Photo Page: "+username3
                    tempList.append([str(uid),y['alt'],y['src'],i['href'],username3])
        for x in pageName:
            if x['src'].endswith('.jpg'):
                url1 = i['href']
                r = re.compile('fbid=(.*?)&set=bc')
                m = r.search(url1)
                if m:
                    filename = 'fbid_'+ m.group(1)+'.html'
                    filename = filename.replace("profile.php?id=","")
                    if not os.path.lexists(filename):
                        #html1 = downloadPage(url1)
                        html1 = downloadFile(url1)
                        print "[*] Caching Photo Page: "+m.group(1)
                        text_file = open(filename, "w")
                        text_file.write(normalize(html1))
                        text_file.close()
                    else:
                        html1 = open(filename, 'r').read()
                soup2 = BeautifulSoup(html1)
                username2 = soup2.find("div", {"class" : "fbPhotoContributorName"})
                r = re.compile('a href="(.*?)"')
                m = r.search(str(username2))
                if m:   
                    username3 = m.group(1)
                    username3 = username3.replace("https://www.facebook.com/","")
                    username3 = username3.replace("profile.php?id=","")
                    print "[*] Extracting Data from Photo Page: "+username3
                    tempList.append([str(uid),x['alt'],x['src'],i['href'],username3])
    return tempList

7109node · 2015-04-14T15:10:25Z

anymore progress on this?

Fluffinko · 2015-04-14T17:30:18Z

still no progress , working on it

DocKali · 2017-09-22T15:58:49Z

Something new, 2 years later?

Indeed, I can launch fbstalker no problem about that. It launches Chrome, connect to FB and then nothing else. I have always the same message : "Problem converting username to uid".

I try with and without Facebook token, I uninstall / reinstall Chrome and Chromedriver but the problem is still there.

Anyone has an idea?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

so close to run, maybe... #28

so close to run, maybe... #28

davebar commented Feb 16, 2015

fragmi commented Feb 21, 2015

fragmi commented Feb 21, 2015

davebar commented Feb 23, 2015

7109node commented Mar 12, 2015

davebar commented Mar 16, 2015

Fluffinko commented Apr 3, 2015

7109node commented Apr 14, 2015

Fluffinko commented Apr 14, 2015

DocKali commented Sep 22, 2017

so close to run, maybe... #28

so close to run, maybe... #28

Comments

davebar commented Feb 16, 2015

fragmi commented Feb 21, 2015

fragmi commented Feb 21, 2015

davebar commented Feb 23, 2015

7109node commented Mar 12, 2015

davebar commented Mar 16, 2015

Fluffinko commented Apr 3, 2015

7109node commented Apr 14, 2015

Fluffinko commented Apr 14, 2015

DocKali commented Sep 22, 2017