Skip to content

A queation about Algorithm 1 Vim Block Process in your paper #132

Open
@shawnnjupt

Description

@shawnnjupt

Image

in line 13, you write

Image

but in mamba ssm architecture , A0'=exp(delta*ParameterA) . so do VIM drop exp ?

but in your code ,i see mamba is used from Mamba source code that uses exp.

So ,which is the true architecture ?

Can anyone help me solve the problem ,thanks very much!!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions