Releases: feifeibear/long-context-attention
Releases · feifeibear/long-context-attention
v0.3 released on 27th August 2024!
upgrade flash_attn >= 2.6.0
v0.2 released on 24th June 2024!
- Ulysses supports T4 and V100.
- Updates some directory structures.
v0.1
Sequence parallel attention adopting a hybrid ulysses and ring attention approach.
Support GQA
Support QKV packed.