feat: add gradient accumulation support #2646

irisTa56 · 2025-12-13T16:39:53Z

What this does

Adds gradient accumulation support to the training script.
This allows simulating larger batch sizes without increasing GPU memory usage, which is useful when training large models or when memory is limited.

Added gradient_accumulation_steps configuration parameter to TrainPipelineConfig (default: 1)
Updated update_policy() to use accelerator.accumulate() context manager
Updated effective batch size calculation in logging and MetricsTracker
Training loop now skips evaluation and checkpointing during gradient accumulation steps (i.e., treats step as an optimizer step)
(Removed unused lock parameter from update_policy() as it was not used anywhere in the codebase)

How it was tested

Added tests in tests/training/test_update_policy.py.
- test_update_policy_sync_gradients: Verifies gradient sync behavior
- test_update_policy_gradient_accumulation_equivalence: Validates mathematical equivalence
Added test_metrics_tracker_step_with_accelerator in tests/utils/test_logging_utils.py

How to checkout & try? (for the reviewer)

# Effective batch size: 8 × 1 × 4 = 32
lerobot-train \
  --dataset.repo_id=lerobot/svla_so101_pickplace \
  --policy.type=act \
  --policy.repo_id=foo/my_policy \
  --policy.push_to_hub=false \
  --batch_size=8 \
  --gradient_accumulation_steps=4 \
  --log_freq=1 \
  --steps=50

Copilot

Pull request overview

This PR adds gradient accumulation support to the training pipeline, enabling simulation of larger batch sizes without increasing GPU memory usage. The implementation leverages Accelerator's built-in gradient accumulation features.

Key changes:

Added gradient_accumulation_steps configuration parameter (default: 1) to control how many batches to accumulate before performing an optimizer step
Updated training loop to properly handle gradient synchronization, skipping evaluation/checkpointing during accumulation steps
Modified effective batch size calculations throughout the codebase to account for gradient accumulation

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/lerobot/configs/train.py	Adds `gradient_accumulation_steps` configuration parameter with documentation
src/lerobot/scripts/lerobot_train.py	Updates training loop and `update_policy()` to use Accelerator's accumulation context, removes unused `lock` parameter, and adds gradient sync checks
src/lerobot/utils/logging_utils.py	Updates `MetricsTracker.step()` to calculate effective batch size including gradient accumulation
tests/training/test_update_policy.py	Adds comprehensive tests for gradient sync behavior and mathematical equivalence
tests/utils/test_logging_utils.py	Adds test for `MetricsTracker` with gradient accumulation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/lerobot/scripts/lerobot_train.py

tests/training/test_update_policy.py

…accumulation

…cy function

irisTa56 and others added 4 commits December 13, 2025 21:43

refactor: simplify update_policy function and metrics handling

f7d7533

feat: add gradient accumulation support

5c85bf9

Revert some modifications to respect the existing code

6529fdf

Merge branch 'huggingface:main' into gradient_accumulation

b7b8002

irisTa56 marked this pull request as ready for review December 13, 2025 16:50

Copilot AI review requested due to automatic review settings December 13, 2025 16:50

Copilot AI reviewed Dec 13, 2025

View reviewed changes

src/lerobot/scripts/lerobot_train.py Outdated Show resolved Hide resolved

tests/training/test_update_policy.py Outdated Show resolved Hide resolved

irisTa56 added 3 commits December 14, 2025 02:12

fix: ensure lr_scheduler steps only when gradients are synced during …

b727b8f

…accumulation

refactor: improve readability of gradient update logic in update_poli…

05c28f6

…cy function

refactor: rename variable for clarity in gradient accumulation test

06b047b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add gradient accumulation support #2646

feat: add gradient accumulation support #2646

irisTa56 commented Dec 13, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add gradient accumulation support #2646

Are you sure you want to change the base?

feat: add gradient accumulation support #2646

Conversation

irisTa56 commented Dec 13, 2025

What this does

How it was tested

How to checkout & try? (for the reviewer)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant