Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to only dump random walks to disk and skip training #5

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

viveksck
Copy link
Collaborator

gensim version downgraded to 0.10.1 as 0.10.2 does not install via easy_install due to this bug: https://groups.google.com/forum/#!topic/gensim/NSOXuP4IE9Q

Vivek Kulkarni added 2 commits January 26, 2015 14:39
… required version of gensim 0.10.2 cannot be added because of a bug in gensim where easy_install gensim fails for 0.10.2. Refer https://groups.google.com/forum/#!topic/gensim/NSOXuP4IE9Q
@viveksck
Copy link
Collaborator Author

In [1]: import gensim

In [2]: gensim.version
Out[2]: '0.10.1'

vvkulkarni@descartes:~/deepwalk$ deepwalk --input ./example_graphs/karate.adjlist --output karate.embeddings
Number of nodes: 34
Number of walks: 340
Data size (walks*length): 13600
Walking...
Training...

vvkulkarni@descartes:~/deepwalk$ ls -ltr karate.embeddings
-rw-rw-r-- 1 vvkulkarni vvkulkarni 20847 Jan 26 15:02 karate.embeddings

@aboSamoor
Copy link
Collaborator

This is the solution I used in polyglot
https://github.com/aboSamoor/polyglot/blob/master/setup.py#L20-L22

@viveksck
Copy link
Collaborator Author

viveksck commented May 5, 2016

Pushing in changes to only dump walks if needed. Change needed for extended work.

vvkulkarni@curie:/toolkits/viveks_deepwalk/deepwalk$ deepwalk --input example_graphs/karate.adjlist --output karate.embeddings --max-memory-data-size 0
Number of nodes: 34
Number of walks: 340
Data size (walks_length): 13600
Data size 13600 is larger than limit (max-memory-data-size: 0). Dumping walks to disk.
Walking...
Counting vertex frequency...
Training...
vvkulkarni@curie:
/toolkits/viveks_deepwalk/deepwalk$ deepwalk --input example_graphs/karate.adjlist --output karate.embeddings --max-memory-data-size 0 --only-walk
Number of nodes: 34
Number of walks: 340
Data size (walks_length): 13600
Data size 13600 is larger than limit (max-memory-data-size: 0). Dumping walks to disk.
Walking...

@viveksck viveksck changed the title Adding dependencies to be installed in setup.py Option to only dump random walks to disk and skip training May 5, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants