Skip to content

spyysalo/wvlib

Folders and files

NameName
Last commit message
Last commit date

Latest commit

60411af · Oct 9, 2019
Nov 10, 2013
Nov 25, 2013
Nov 18, 2013
Nov 10, 2013
Aug 5, 2016
Nov 7, 2013
Nov 7, 2013
Dec 7, 2013
Dec 12, 2013
Feb 1, 2017
Oct 9, 2019
Oct 9, 2019
Dec 15, 2013
Dec 6, 2013
Aug 14, 2016
Aug 14, 2016
Oct 9, 2019
Aug 5, 2016
Oct 9, 2019
Dec 17, 2013
Oct 9, 2019
Dec 16, 2013
Nov 8, 2015
Nov 8, 2015
Nov 10, 2013
Oct 9, 2019

Repository files navigation

wvlib - word vector library

Work in progress, not currently recommended for any use.

Try the following:

Find 10 words closest to "protein" using word2vec vectors induced on the text8 demo data

echo protein | python nearest.py text8.tar.gz -n 10

Find word that has the same relationship to "japan" as "paris" has to "france"

echo 'france paris japan' | python analogy.py text8.tar.gz -q -n 1

Evaluate the vectors on the binary classification task using words from McIntosh and Curran "Reducing semantic drift with bagging and distributional similarity" (ACL 2009)

python evalclass.py text8.tar.gz word-classes/McIC-09/*.txt

Evaluate the vectors on the closed-class member retrieval task using the set of standard amino acids

python evalset.py text8.tar.gz word-sets/Ohta-bio-sets/standard-amino-acids.txt

The rest of this README is TODO. See scripts for documentation.