candle-llava

implement LLaVA using candle

The code is based on https://github.com/haotian-liu/LLaVA, Hence the llava-hf version of config may perform differently.

The llava-hf models contain tokenizer.json, so if you want pure-rust experience, I suggest you to use llava-hf version.

model zoo

Right now I have tested on liuhaotian/llava-v1.6-vicuna-7b and llava-hf/llava-v1.6-vicuna-7b-hf. The memory use might have room for optimization.

eval

single-image

cargo run  # default args, use liuhaotian/llava-v1.6-vicuna-7b, default-image is image/llava_logo.png, prompt is "is this a cat?"
cargo run  -- --image-file "images/llava_v1_5_radar.jpg" --prompt "what does this picture show?"
cargo run -- --model-path "llava-hf/llava-v1.6-vicuna-7b-hf" # use llava-hf model

task

Tokenizer Setup

conda create -n llava python=3.10  
pip install transformers protobuf

Download using mirror (for Chinese users)

pip install -U huggingface_hub  
export HF_ENDPOINT=https://hf-mirror.com  
huggingface-cli download --resume-download liuhaotian/llava-v1.6-vicuna-7b

Limitations

Tested only on liuhaotian/llava-v1.6-vicuna-7b version

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
images		images
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

candle-llava

model zoo

eval

single-image

task

Tokenizer Setup

Download using mirror (for Chinese users)

Limitations

About

Releases

Packages

Languages

License

chenwanqq/candle-llava

Folders and files

Latest commit

History

Repository files navigation

candle-llava

model zoo

eval

single-image

task

Tokenizer Setup

Download using mirror (for Chinese users)

Limitations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages