Skip to content

Matrix multiplication issue #74

@pulkitmehtaworkmetacube

Description

We are doing following :

  1. Pick 4 bit quantized model https://huggingface.co/unsloth/Llama-3.1-Nemotron-70B-Instruct-bnb-4bit
  2. Use that to finetune adapterH or bottleneck adapter.

We reduced bottleck_dimension to 64 instead of 256 .
It loads base model but during finetuning of adapter , it is giving following error .

mat1 and mat2 shapes cannot be multiplied (1024x8192 and 1x117440512)

Please suggest

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions