Latent Dirichlet Allocation

A library for the LDA topic modelling algorithm in Python and C.

 __    ____  _____ 
|  |  |    \|  _  |
|  |__|  |  |     |
|_____|____/|__|__|

Usage

The best way to use liblda is through the command line:

./run.py --docs docs.txt --numT 40 --vocab vocab.txt --seed 3 --iter 400 --alpha 0.1 --beta 0.01 --save_probs --print_topics 10

where:

docs.txt contains one document per line,
vocab.txt contains the vocabulary (one word per line)
--save_probs indicates that you want to output the probs phi and theta

Installation

Place the directory liblda somewhere in your Python path.

Features

We have implemented the Gibbs sampling approach which is fairly efficient when done in C. All the rest of the functionality is done in Python so it is very hackable.

Requirements

numpy (for arrays)
scipy (for weave)

Project status

The code base works, but is a bit of a mess right now. A rewrite has begun -- in cython.

Author

Ivan Savov, first dot last at gmail

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
bin/mac		bin/mac
data		data
gensim		gensim
info/cuda		info/cuda
liblda		liblda
semrelwords		semrelwords
topicmodel		topicmodel
weavetest		weavetest
.gitignore		.gitignore
README.md		README.md
README.rst		README.rst
code_external		code_external
loadArXiv.py		loadArXiv.py
loadICDM.py		loadICDM.py
loadNIPS.py		loadNIPS.py
loadSeededUnseeded.py		loadSeededUnseeded.py
mycmds.py		mycmds.py
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Latent Dirichlet Allocation

Usage

Installation

Features

Requirements

Project status

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ivanistheone/Latent-Dirichlet-Allocation

Folders and files

Latest commit

History

Repository files navigation

Latent Dirichlet Allocation

Usage

Installation

Features

Requirements

Project status

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages