`CompositeLossMetrics` now performs a weighted sum of losses. by ds-hwang · Pull Request #1251 · apple/axlearn

ds-hwang · 2025-06-10T16:22:57Z

Currently, CompositeLossMetrics sums the losses without considering their weights (i.e., the number of live targets). To make this a weighted sum, downstream code has been implementing CompositeLossWeights to inject the number of live targets into loss_weights. This is essentially patching a surprising logic (initail loss sum) with complex logic (CompositeLossWeights) into a straightforward one (weighted sum).

Therefore, we’re changing the default loss aggregation logic to be straightforward from the beginning.

From now on, our standarized loss aggregation logic is

loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(each_loss_weight * num_each_samples)

Historically, the complex logic was introduced because the weights of losses returned by child metrics were unknown. But now that child metrics return losses as WeightedScalar, we can adopt a simpler, cleaner aggregation logic.

Note: alternative formulation could be

loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(num_each_samples)

However, when num_each_samples is large and each_loss_weight is small, the denominator can become disproportionately large. So we discard this option.

ds-hwang · 2025-06-10T16:24:03Z

@markblee Could you take a look? From 1399

markblee

(Will approve after the internal review completes.)

Currently, `CompositeLossMetrics` sums the losses without considering their weights (i.e., the number of live targets). To make this a weighted sum, downstream code has been implementing `CompositeLossWeights` to inject the number of live targets into `loss_weights`. This is essentially patching a surprising logic (initail loss sum) with complex logic (CompositeLossWeights) into a straightforward one (weighted sum). Therefore, we’re changing the default loss aggregation logic to be straightforward from the beginning. From now on, our standarized loss aggregation logic is ``` loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(each_loss_weight * num_each_samples) ``` Historically, the complex logic was introduced because the weights of losses returned by child metrics were unknown. But now that child metrics return losses as `WeightedScalar`, we can adopt a simpler, cleaner aggregation logic. Note: alternative formulation could be ``` loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(num_each_samples) ``` However, when num_each_samples is large and each_loss_weight is small, the denominator can become disproportionately large. So we discard this option.

ds-hwang · 2025-07-15T16:56:08Z

@markblee could you take a look again?

(Will approve after the internal review completes.)

All reviewers approved internally at 23540

…apple#1251)" (#1573) This reverts commit 343102a.

ds-hwang requested review from a team, markblee and ruomingp as code owners June 10, 2025 16:22

markblee reviewed Jun 10, 2025

View reviewed changes

ds-hwang force-pushed the metric_weighted branch 3 times, most recently from 00d1611 to 29f13f7 Compare July 3, 2025 20:40

ds-hwang force-pushed the metric_weighted branch from 29f13f7 to 1bb0551 Compare July 15, 2025 00:06

ds-hwang requested a review from markblee July 15, 2025 16:56

markblee approved these changes Jul 15, 2025

View reviewed changes

ds-hwang added this pull request to the merge queue Jul 16, 2025

Merged via the queue into main with commit 343102a Jul 17, 2025
11 checks passed

ds-hwang deleted the metric_weighted branch July 17, 2025 00:34

loofahcus pushed a commit to loofahcus/axlearn that referenced this pull request Oct 11, 2025

Revert "CompositeLossMetrics now performs a weighted sum of losses. (…

ec8b942

…apple#1251)" (#1573) This reverts commit 343102a.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`CompositeLossMetrics` now performs a weighted sum of losses.#1251

`CompositeLossMetrics` now performs a weighted sum of losses.#1251
ds-hwang merged 1 commit intomainfrom
metric_weighted

ds-hwang commented Jun 10, 2025

Uh oh!

ds-hwang commented Jun 10, 2025

Uh oh!

markblee left a comment

Uh oh!

ds-hwang commented Jul 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ds-hwang commented Jun 10, 2025

Uh oh!

ds-hwang commented Jun 10, 2025

Uh oh!

markblee left a comment

Choose a reason for hiding this comment

Uh oh!

ds-hwang commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ds-hwang commented Jul 15, 2025 •

edited

Loading