Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support qwen3.5 loss mask for multi-turn SFT
#1742 opened Mar 19, 2026 by huang3eng Loading…
(fix):not have encoder_only attr cause run failed
#1741 opened Mar 19, 2026 by wangyufak Loading…
fix: resolve rope_theta from rope_parameters in DeepseekV32Bridge
#1734 opened Mar 17, 2026 by stevewx Loading…
3 tasks done
[docker] fix qwen3_vl visual module loading
#1727 opened Mar 15, 2026 by ZHZisZZ Loading…
feat: add Qwen3.5-4B model support
#1721 opened Mar 13, 2026 by shihaohou Loading…
small fix on qwen3-235b-a22b launch script
#1719 opened Mar 12, 2026 by Zhuohao-Li Loading…
Add Mooncake Backend for Rollout Data Transfer run-ci-megatron
#1709 opened Mar 11, 2026 by zxpdemonio Loading…
6 tasks done
fix: auto-detect GPUs in qwen3-4b script
#1700 opened Mar 10, 2026 by ailuntz Loading…
fix: make ray actor gpu fractions configurable
#1699 opened Mar 10, 2026 by ailuntz Loading…
fix: accept unboxed math answers
#1698 opened Mar 10, 2026 by ailuntz Loading…
fix: default reward for aborted samples
#1697 opened Mar 10, 2026 by ailuntz Loading…
fix: handle missing sglang cuda-graph constant
#1696 opened Mar 10, 2026 by ailuntz Loading…
PipelineRL -- keep cache on weight update
#1694 opened Mar 9, 2026 by hari-hm Loading…
fix: quote $MOE_LAYER_FREQ
#1689 opened Mar 8, 2026 by lawrence-harmonic Loading…
internv3.5 support
#1660 opened Mar 3, 2026 by samaritan1998 Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655 opened Mar 2, 2026 by dubin555 Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654 opened Mar 2, 2026 by tourzhao Loading…
3 tasks
Refactor code safety checks by removing patterns
#1643 opened Feb 28, 2026 by Rohan5commit Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.