Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

server: add missing rerank and chat presets (#10932)
#18742 opened Jan 10, 2026 by ingyukoh Loading…
POC: group gate_exps and up_exps + fix mxfp4 alignment for PP boost model Model specific python python script changes
#18740 opened Jan 10, 2026 by am17an Draft
llama: add canaries to Markdown files
#18735 opened Jan 10, 2026 by JohannesGaessler Loading…
feat: add support for WeDLM architecture python python script changes
#18731 opened Jan 10, 2026 by feedseawave Loading…
5 tasks done
opencl: add softplus op ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#18726 opened Jan 9, 2026 by shaofeiqi Loading…
model: Add VAETKI support examples model Model specific python python script changes
#18719 opened Jan 9, 2026 by dororodoroddo Loading…
5 tasks done
ggml: new backend for Virglrenderer API Remoting acceleration (v2) build Compilation issues ggml changes relating to the ggml tensor library for machine learning python python script changes
#18718 opened Jan 9, 2026 by kpouget Loading…
vulkan: Check maxStorageBufferRange in supports_op ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18709 opened Jan 9, 2026 by jeffbolznv Loading…
fix text spacing in print_info
#18708 opened Jan 9, 2026 by ddh0 Loading…
ggml-metal: Clean up files used for embedded build Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#18705 opened Jan 9, 2026 by DaAwesomeP Loading…
[WIP] ggml-opencl: op args init refactoring ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#18701 opened Jan 8, 2026 by chraac Draft
Improving inference speed for the repack buffer type on NUMA architectures ggml changes relating to the ggml tensor library for machine learning
#18698 opened Jan 8, 2026 by zzjianhui Loading…
ggml-cuda: extend concat support for more types ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18690 opened Jan 8, 2026 by Lourdle Loading…
model: try to improve Qwen3 Next model Model specific python python script changes
#18683 opened Jan 8, 2026 by ngxson Loading…
vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#18678 opened Jan 7, 2026 by jeffbolznv Loading…
Autoparser - complete refactoring of parser architecture documentation Improvements or additions to documentation examples model Model specific python python script changes script Script related server testing Everything test related
#18675 opened Jan 7, 2026 by pwilkin Draft
MCP MVP enhancement New feature or request examples server/webui server
#18655 opened Jan 7, 2026 by allozaur Draft
docs: update ops.md for CANN backend documentation Improvements or additions to documentation
#18654 opened Jan 7, 2026 by hipudding Loading…
CANN: support gated linear attn Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#18653 opened Jan 7, 2026 by hipudding Loading…
ProTip! What’s not been updated in a month: updated:<2025-12-10.