-
Notifications
You must be signed in to change notification settings - Fork 67
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Updating Wan Vae pytest, Export to 45p, minor issues
#783
opened Feb 9, 2026 by
tv-karthikeya
Loading…
removed duplication of
mdp_json_path in compilation command (#706)
#779
opened Feb 5, 2026 by
ochougul
Loading…
Fixing the issue of CCL support during the decoding phase of Disaggregated Serving
#776
opened Feb 5, 2026 by
vjanfaza
Loading…
WIP: Enabled fp16/bf16 based export and compile for some causalLMs.
#775
opened Feb 3, 2026 by
quic-dhirajku
•
Draft
feat(QEff: Attn): add KV & Q blocking strategies for causal LMs
enhancement
New feature or request
qeff.blocking
Accuracy Onboarding - This is a test template
wip
Work in progress
#767
opened Jan 29, 2026 by
abisravi777-hub
•
Draft
[Qeff.finetuning] Adding Full document for hf_based finetuning stack
#732
opened Jan 16, 2026 by
tchawada
Loading…
[QEff. Finetuning] Adding finetune_experiemental.py and related files
#731
opened Jan 16, 2026 by
quic-swatia
Loading…
Adding the support of dense models distilled from moe models with the same architecture
#728
opened Jan 16, 2026 by
vjanfaza
Loading…
Added changes to load and export Llama model in bfloat16/float16 precision
#707
opened Jan 7, 2026 by
quic-dhirajku
•
Draft
[QEff. Finetuning] Loading HF models partially to save testing compute
#704
opened Jan 6, 2026 by
quic-swatia
Loading…
Subfunction fix: changed invalid_index to INT32MAX Always
#700
opened Jan 5, 2026 by
abhishek-singh591
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.