Skip to content

Request to release 7B training hyperparameters #43

@Jerry-hyl

Description

@Jerry-hyl

Hi, thanks for releasing LIMO and the training configs for 32B!
In the paper you also reported experiments on 7B, but I couldn’t find the corresponding training parameters/configs in this repo.

Could you please share the training hyperparameters and config files for the 7B experiments (e.g., learning rate, batch size, optimizer settings, training steps, DeepSpeed config)? This would be very helpful for reproducibility and fair comparison.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions