Skip to content

Clarification on GPU count used in reward model training #21

@LAOS-Y

Description

@LAOS-Y

Are all the reward models trained with 32 GPUs in total, including models with single machine config, e.g., editscore_7B, editscore_qwen3_vl_4B_instruct, and editscore_qwen3_vl_8B_instruct?

Is all multi-machine reward training done with 2 nodes with 16 GPUs each?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions