Skip to content

Allow for larger scale submissions in Inference when moving from Preview to Available #176

Open
@psyhtest

Description

@psyhtest

A number of Preview systems in MLPerf Inference v4.0 used fewer cards than would be typical in production due to a limited availability of cards at the time. Rather than benchmarking the systems with exactly the same, atypical number of cards as in Preview, it would be desirable to benchmark them in a more typical configuration, with a higher number of cards. Of course, for Available submissions the performance per accelerator would still need to be demonstrated to be equal or better than in Preview submissions.

We have a similar provision in the submission policies, but at the moment it only covers Training:

On each of the benchmarks that are previewed and are Compatible, the Available submission must show equal or better performance (allowing for noise, for any changes to the benchmark definition) on all systems for Inference and across at least the smallest and the largest scale of the systems used for Preview submission on that benchmark for Training (e.g. Available Training submissions can be on scales smaller than the smallest and larger than the largest scale used for Preview submission).

Metadata

Metadata

Labels

Next MeetingItem to be discussed in the next Working Group

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions