Loss broadcasting fix #347

laserkelvin · 2025-03-17T18:46:38Z

This PR attempts to correct broadcasting issues due to shape mismatches in loss calculations.

This was brought on by the realization that broadcasting works a little differently (from some time ago) when tensor shapes are mismatched. In particular, labels come out of the pipeline with an extra dimension (e.g. [N, 1]) compared to graph readouts. The resulting behavior is actually very different from the intention:

>>> y
tensor([0.0551, 0.2665, 0.4638, 0.3288, 0.1201, 0.1515, 0.9187, 0.4527])
>>> x
tensor([[0.0961],
        [0.6194],
        [0.3628],
        [0.5289],
        [0.2046],
        [0.0989],
        [0.6621],
        [0.8717]])
>>> from torch.nn import MSELoss
>>> MSELoss()(y, x)
/python3.12/site-packages/torch/nn/modules/loss.py:610: UserWarning: Using a target size (torch.Size([8, 1])) that is different to the input size (torch.Size([8])). This will likely lead to incorrect results due to broadcasting. Please ensure they have the same size.
  return F.mse_loss(input, target, reduction=self.reduction)
tensor(0.1454)
>>> MSELoss()(y.view(-1, 1), x)
tensor(0.0535)

The code changes make it so that _compute_losses methods will check if model output and label shapes are mismatched, and if they are, attempt to reshape the model outputs to match the labels' shape before computing the loss.

Lee, Kin Long Kelvin added 2 commits March 17, 2025 10:48

refactor: attempting to reshape model outputs according to labels

875f630

refactor: updating shape checking for MaceEnergyForceTask as well

760cfe4

laserkelvin added the bug Something isn't working label Mar 17, 2025

laserkelvin requested a review from smiret-intel March 17, 2025 18:58

smiret-intel approved these changes Mar 17, 2025

View reviewed changes

laserkelvin merged commit 3d66b45 into IntelLabs:main Mar 18, 2025
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Loss broadcasting fix #347

Loss broadcasting fix #347

Uh oh!

laserkelvin commented Mar 17, 2025

Uh oh!

Uh oh!

Uh oh!

Loss broadcasting fix #347

Loss broadcasting fix #347

Uh oh!

Conversation

laserkelvin commented Mar 17, 2025

Uh oh!

Uh oh!

Uh oh!