Add gradient_accumulation_steps to pretrain/train API by tetelias · Pull Request #743 · lightly-ai/lightly-train

tetelias · 2026-05-25T13:11:31Z

What has changed and why?

Summary

Adds gradient_accumulation_steps to the pretrain/train API as a convenience alias for PyTorch Lightning's accumulate_grad_batches.

This makes the pretraining API more consistent with the task-specific training APIs while preserving the existing trainer_args escape hatch.

Changes

add gradient_accumulation_steps parameter
map to Trainer(accumulate_grad_batches=...)
add validation/conflict handling
add tests

Example

lightly_train.pretrain(
    ...,
    batch_size=8,
    gradient_accumulation_steps=4,
)

Equivalent to:

lightly_train.pretrain(
    ...,
    trainer_args={
        "accumulate_grad_batches": 4,
    },
)

Reasoning

This resolves #35

How has it been tested?

In tests/_commands/test_train_helpers.py test_get_trainer was updated, test_get_trainer_gradient_accumulation and test_get_trainer_gradient_accumulation_conflict were added.
All of:
tests
ruff check .
ruff format .
pre-commit run --all-files
pass without errors.

Did you update CHANGELOG.md?

Yes
Not needed (internal change)

Did you update the documentation?

Yes
Not needed (internal change without effects for user)

chatgpt-codex-connector · 2026-05-25T13:11:36Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

CLAassistant · 2026-05-25T13:11:46Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ liopeer
❌ tetelias

tetelias seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

liopeer · 2026-05-25T17:35:24Z

Thanks for the contribution @tetelias! Could you sign the CLA?

tetelias · 2026-05-27T10:54:10Z

@liopeer I signed CLA and corrected failure on one of tests. Do you need to restart workflows approval?

liopeer

Thanks, that's really high quality work. If all the checks succeed, this is ready to merge!

liopeer · 2026-06-16T08:48:43Z

/review

liopeer

LGTM!

liopeer · 2026-06-16T08:53:47Z

@tetelias It looks like there are still issues with the CLA. Can you quickly check again if it is really signed?

DLemming · 2026-06-25T08:31:39Z

One thought: should accumulate_grad_batches also be included when computing global_batch_size?

Right now it's:

global_batch_size = args.batch_size * args.devices

but the effective batch size is really:

effective_batch_size = args.batch_size * args.devices * args.acc_grad_batches

Since global_batch_size is used for automatic LR scaling, exposing accumulate_grad_batches without accounting for it means users get a larger effective batch size but the LR is still scaled for the smaller one. As lightly-train is a high-level, worry-free wrapper around lightly, as a user I would expect to be accounted for correct learning rate scaling when simulating larger effective batch sizes.

One caveat though, global_batch_size is reused for non-LR purposes (e.g. steps_per_epoch = dataset_size // self.global_batch_size in dinov2.py:675 or throughput logging). So multiplying globally will have side effects and it's probably safer to pass through gradient_accumulation_steps to the lr_scale line specifically.

tetelias added 3 commits May 25, 2026 15:20

Add gradient_accumulation_steps to pretrain API

ab2a6f1

update to CHANGELOG.md

d8728ab

update to docs

49b4e29

Merge branch 'main' into gradient-accumulation

6fd0497

tetelias and others added 2 commits May 26, 2026 10:03

corrected src/lightly_train/_cli.py

8bf02a5

Merge branch 'main' into gradient-accumulation

7010623

Merge branch 'main' into gradient-accumulation

9d57296

liopeer reviewed May 29, 2026

View reviewed changes

tetelias and others added 3 commits May 29, 2026 17:29

corrected mistake in _cli.py and formatting

6950fe7

Merge branch 'main' into gradient-accumulation

f056046

Merge branch 'main' into gradient-accumulation

3cea6fb

liopeer approved these changes Jun 16, 2026

View reviewed changes

add type to docstring

d40bbc6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add gradient_accumulation_steps to pretrain/train API#743

Add gradient_accumulation_steps to pretrain/train API#743
tetelias wants to merge 11 commits into
lightly-ai:mainfrom
tetelias:gradient-accumulation

tetelias commented May 25, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 25, 2026

Uh oh!

CLAassistant commented May 25, 2026 •

edited

Loading

Uh oh!

liopeer commented May 25, 2026

Uh oh!

tetelias commented May 27, 2026

Uh oh!

liopeer left a comment

Uh oh!

liopeer commented Jun 16, 2026

Uh oh!

liopeer left a comment

Uh oh!

liopeer commented Jun 16, 2026

Uh oh!

DLemming commented Jun 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

tetelias commented May 25, 2026

What has changed and why?

Summary

Changes

Example

Reasoning

How has it been tested?

Did you update CHANGELOG.md?

Did you update the documentation?

Uh oh!

chatgpt-codex-connector Bot commented May 25, 2026

Uh oh!

CLAassistant commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

liopeer commented May 25, 2026

Uh oh!

tetelias commented May 27, 2026

Uh oh!

liopeer left a comment

Choose a reason for hiding this comment

Uh oh!

liopeer commented Jun 16, 2026

Uh oh!

liopeer left a comment

Choose a reason for hiding this comment

Uh oh!

liopeer commented Jun 16, 2026

Uh oh!

DLemming commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CLAassistant commented May 25, 2026 •

edited

Loading

DLemming commented Jun 25, 2026 •

edited

Loading