Allow for larger scale submissions in Inference when moving from Preview to Available

A number of Preview systems in MLPerf Inference v4.0 used fewer cards than would be typical in production due to a limited availability of cards at the time. Rather than benchmarking the systems with exactly the same, atypical number of cards as in Preview, it would be desirable to benchmark them in a more typical configuration, with a higher number of cards. Of course, for Available submissions the performance **per accelerator** would still need to be demonstrated to be equal or better than in Preview submissions.

We have a similar provision in the submission policies, but at the moment it only covers Training:

> On each of the benchmarks that are previewed and are Compatible, the Available submission must show equal or better performance (allowing for noise, for any changes to the benchmark definition) on *all* systems for Inference and across at least the smallest and the largest scale of the systems used for Preview submission on that benchmark for Training (e.g. Available Training submissions can be on scales smaller than the smallest and larger than the largest scale used for Preview submission).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow for larger scale submissions in Inference when moving from Preview to Available #176

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Allow for larger scale submissions in Inference when moving from Preview to Available #176

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions