-
Notifications
You must be signed in to change notification settings - Fork 654
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: propagate moe_token_dispatcher_type in bridge model provider
#1737
opened Mar 18, 2026 by
nanjiangwill
Loading…
fix: resolve rope_theta from rope_parameters in DeepseekV32Bridge
#1734
opened Mar 17, 2026 by
stevewx
Loading…
3 tasks done
chore: translate remaining Chinese comments to English
#1726
opened Mar 15, 2026 by
WangHong-yang
Loading…
fix: http_utils. disable system proxy for internal SGLang httpx clients
#1714
opened Mar 12, 2026 by
DongzhuoranZhou
Loading…
Add Mooncake Backend for Rollout Data Transfer
run-ci-megatron
#1709
opened Mar 11, 2026 by
zxpdemonio
Loading…
6 tasks done
[WIP] fix(cp): wrap linear attention CP in custom autograd.Function
#1692
opened Mar 9, 2026 by
lilei199908
Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655
opened Mar 2, 2026 by
dubin555
Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654
opened Mar 2, 2026 by
tourzhao
Loading…
3 tasks
Fix the Rotary Position Embedding (RoPE) parameter passing in the GLM5 mode
#1650
opened Mar 2, 2026 by
hanxdmech-ship-it
Loading…
[WIP] fix transforrmers api change at 5.2.0
run-ci-megatron
#1647
opened Feb 28, 2026 by
UbeCc
Loading…
feat: add --lazy-multimodal-load to defer image process to rollout time
#1623
opened Feb 25, 2026 by
yzlnew
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.