-
Notifications
You must be signed in to change notification settings - Fork 186
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Default
add_generation_prompt
for Chat Templates to true in Python API
#1539
opened Jun 9, 2025 by
sayanshaw24
Loading…
Automatically install java maven artifact in the local maven repository
#1531
opened Jun 5, 2025 by
asoldano
Loading…
Clamp KV Cache Size to Sliding Window for NvTensorRtRtx EP
#1523
opened Jun 3, 2025 by
BLSharda
Loading…
Add Gemma3 Model support for NvTensorRtRtx execution provider
#1520
opened Jun 2, 2025 by
anujj
Loading…
[#1509] Use correct resource path for onnxruntime native library retrieval
#1510
opened May 27, 2025 by
asoldano
Loading…
Add Encode with Options for
add_special_tokens=True
use-case
#1504
opened May 22, 2025 by
sayanshaw24
Loading…
Model Builder: Add Post processing script to convert fp16/32 LM_HEAD to int8 and use tied embeddings
#1437
opened Apr 30, 2025 by
sushraja-msft
Loading…
add extra_options use_channel_wised_quantization to builder.py
#1362
opened Mar 31, 2025 by
bopeng1234
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-05-09.