Open
Description
Is this project updated enough to use gguf files or the LLama-3 architecture? I see that the documentation examples use ggml via .bin files which I'm assuming was the previous file format. I'm specifically interested in the loading / unloading feature for LoRa feature that doesn't seem supported in llama.cpp by itself
Metadata
Metadata
Assignees
Labels
No labels