[Proposal] Contribute to nanoALM – Add Audio Modality Support

Hi nanoVLM team,

Thank you for your excellent work on nanoVLM – it's a very impressive project.

I’m currently working on audio-language models, and one thing I’ve noticed is the lack of simple, well-structured pipelines for building such models. I believe nanoALM, with its clean and modular design, could serve as a strong foundation for supporting the audio modality in a lightweight and extensible way.

I would love to contribute by extending the framework to include audio modality support, leveraging the existing strengths of the nanoVLM architecture.

Please let me know if you'd be open to this contribution — or potentially collaborating on a new repository focused on lightweight audio-language modeling.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Proposal] Contribute to nanoALM – Add Audio Modality Support #99

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

[Proposal] Contribute to nanoALM – Add Audio Modality Support #99

Description

Activity

lusxvr commented on Jun 4, 2025

tsdocode commented on Jun 5, 2025

carankt commented on Jun 12, 2025

aceliuchanghong commented on Jun 20, 2025

leo1oel commented on Jun 24, 2025

WangHaoyuuu commented on Jun 25, 2025

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions