Multi-modal projections for Llama-Adapter #2420

efraimdahl · 2025-03-11T15:51:17Z

efraimdahl
Mar 11, 2025

Just want to make sure I'm not missing anything. Currently the implementation of Llama-Adapter/AdaptionPrompt does not include any way we can add additional input that is added onto the prefix? In the Llama-Adapters paper (implementation here) they include a section on unlocking multimodal reasoning by projecting CLIP embeddings onto the prefix. Are there other PEFT-methods or a configuration of AdaptionPrompt that already implement this functionality?

BenjaminBossan · 2025-03-11T16:12:06Z

BenjaminBossan
Mar 11, 2025
Maintainer

AFAICT, this is not possible. @yeoedward added the method to PEFT, maybe they can clarify.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multi-modal projections for Llama-Adapter #2420

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Multi-modal projections for Llama-Adapter #2420

Uh oh!

efraimdahl Mar 11, 2025

Replies: 1 comment

Uh oh!

BenjaminBossan Mar 11, 2025 Maintainer

efraimdahl
Mar 11, 2025

BenjaminBossan
Mar 11, 2025
Maintainer