Skip to content

[FSDP2/Megatron-FSDP/DCP] If model parameters are DTensors, optimizer states should also be DTensors.#2795

Open
cspades wants to merge 12 commits intoNVIDIA:mainfrom
cspades:cye/fused-adam-dcp
Open

[FSDP2/Megatron-FSDP/DCP] If model parameters are DTensors, optimizer states should also be DTensors.#2795
cspades wants to merge 12 commits intoNVIDIA:mainfrom
cspades:cye/fused-adam-dcp

Commits

Commits on Mar 31, 2026