Skip to content

Pull requests: nod-ai/shark-ai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Use int8 for handling float8_e4m3fn compatiblity
#1883 opened Jul 18, 2025 by KyleHerndon Loading…
Bump IREE requirement pins to 3.6.0rc20250718
#1874 opened Jul 18, 2025 by shark-pr-automator bot Loading…
[sharktank] Add FP4 quantized tensor split and cat
#1873 opened Jul 18, 2025 by sogartar Loading…
Refactor decoder with stateful tools
#1871 opened Jul 18, 2025 by rsuderman Loading…
Minor fix for token selector reservation
#1867 opened Jul 17, 2025 by rsuderman Loading…
[sharktank] Add toy Llama FP4 quantization
#1857 opened Jul 17, 2025 by sogartar Loading…
Add native scorer
#1842 opened Jul 15, 2025 by zeeshanhaque21 Draft
Fix view override for QuantizedTensor
#1835 opened Jul 15, 2025 by paulzzy Loading…
Lisal.mooncake update write back
#1827 opened Jul 15, 2025 by lisaliu1 Loading…
Bump aiohttp from 3.11.3 to 3.12.14 in /shortfin dependencies Pull requests that update a dependency file python Pull requests that update python code
#1824 opened Jul 15, 2025 by dependabot bot Loading…
Set required Python version to ">=3.11"
#1815 opened Jul 14, 2025 by marbre Loading…
Sharktank extend rotary mask
#1809 opened Jul 11, 2025 by stbaione Loading…
Add a paramter that enables QuaRot for GEMMs
#1779 opened Jul 9, 2025 by KyleHerndon Loading…
[tuner] add support for attention op
#1772 opened Jul 8, 2025 by bangtianliu Loading…
ProTip! What’s not been updated in a month: updated:<2025-06-19.