Skip to content

Pull requests: microsoft/onnxruntime-genai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[model_qa.py] Warn when an unrecognized EP name falls back to CPU
#2252 opened Jul 1, 2026 by daijh Contributor Loading…
[model-builder] Cap max_length at 4096 to reduce KV cache memory usage
#2251 opened Jul 1, 2026 by daijh Contributor Loading…
Generate pyd type info
#2247 opened Jun 29, 2026 by themason2011 Loading…
webgpu: fix RecurrentState graph capture with shared buffer aliasing
#2244 opened Jun 26, 2026 by qjia7 Contributor Loading…
5 tasks done
Implement no_repeat_ngram_size for CPU search
#2242 opened Jun 25, 2026 by mustjab Loading…
[Doc] Cuda Plugin EP integration
#2236 opened Jun 22, 2026 by tianleiwu Contributor Draft
Update model builder for gpt-oss
#2234 opened Jun 19, 2026 by tianleiwu Contributor Loading…
3 of 4 tasks
Add address validation feature
#2223 opened Jun 11, 2026 by apsonawane Contributor Loading…
Add path traversal validation
#2222 opened Jun 11, 2026 by apsonawane Contributor Loading…
Qwen3.6 MTP
#2218 opened Jun 11, 2026 by tianleiwu Contributor Draft
Bump torch from 2.7.1+cpu to 2.12.0+cpu in /test/python/cpu/torch dependencies Pull requests that update a dependency file python Pull requests that update python code
#2213 opened Jun 11, 2026 by dependabot Bot Loading…
Bump torch from 2.7.1 to 2.12.0+cpu in /test/python/macos/torch dependencies Pull requests that update a dependency file python Pull requests that update python code
#2212 opened Jun 11, 2026 by dependabot Bot Loading…
[feat]: support qwen3 prompt embeds
#2211 opened Jun 10, 2026 by huisunCompiler Loading…
Prefer using CMake path variables where available
#2208 opened Jun 9, 2026 by jaeyoonjung Contributor Loading…
Add agent skill for creating new model
#2206 opened Jun 9, 2026 by tianleiwu Contributor Draft
Default to arm64 ONNX Runtime on Apple Silicon
#2196 opened Jun 6, 2026 by kkennethwu Loading…
Add AMDGPU execution provider support
#2194 opened Jun 3, 2026 by AMDmoore Loading…
Add quantized KV cache support for CPU provider
#2187 opened May 28, 2026 by tianleiwu Contributor Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.