Drop inplace operation for loss computation with gradient accumulation (#35416)

Fix inplace loss computation
2025-07-31 02:02:21 +06:00 · 2024-12-26 14:58:53 +01:00 · 2024-12-26 14:58:53 +01:00 · 4eb17b26e7
commit 4eb17b26e7
parent 24c91f095f
1 changed files with 1 additions and 1 deletions
--- a/src/transformers/trainer.py
+++ b/src/transformers/trainer.py
@ -3700,7 +3700,7 @@ class Trainer:
        else:
            # Finally we need to normalize the loss for reporting
            if num_items_in_batch is None:
-                loss /= self.args.gradient_accumulation_steps
+                loss = loss / self.args.gradient_accumulation_steps

            self.accelerator.backward(loss, **kwargs)