Address PR review comments: confirm unit test coverage #1848

Copilot · 2026-01-15T04:07:05Z

Responded to reviewer question about page attention unit test coverage for the new paged_attention_common API.

Analysis Provided

Confirmed op_tests/test_pa.py includes comprehensive tests for the new API via run_aiter_common()
Tests validate correctness against golden outputs for both unquantized and quantized KV cache paths
Tests cover per-tensor and per-token quantization with proper scale tensor handling for both HIP and ASM backends
Noted tests require ROCm/AMD GPU hardware (MI325/MI355) and run via CI pipeline

No Code Changes

This PR contains no code modifications—only clarification that existing test coverage validates the paged attention implementation introduced in the original commits.

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Inference engines should be calling paged_attention_common now with shuffled kv cache layout and aiter internally will decide between asm or hip kernel. HIP is more performant for lower concurrencies ( < 128). Also a unit test has been updated to include the new interface. Note that support for the shuffled scales in HIP is not supported and is always redirected to asm now when KV cache is in int8 or fp8 formats.

Sergey Solo and others added 12 commits January 12, 2026 16:17

Delete op_tests/README_pa_merged_tests.md

04d6687

Delete op_tests/test_pa_merged.py

a744e4a

Fix formatting according to Black requirements

476286e

Fix one last place with broken formatting

b95df2e

Remove modification to pa_v1, we already have pa for 5D kv cache

85d9984

Fix another formatting issue

d759438

Add proper quant support for the common API

fa7634d

Apply formatting

39e3d66

Remove redundant parameters

3677df8

Remove redundant parameters

94816c7

Initial plan

82b5715

Copilot AI assigned Copilot and fsx950223 Jan 15, 2026

Copilot AI mentioned this pull request Jan 15, 2026

Implement a new api that will be switching between asm and hip pa internally #1821

Merged

Copilot started work on behalf of fsx950223 January 15, 2026 04:09 View session

Copilot AI changed the title ~~[WIP] Implement API for switching between ASM and HIP kernels~~ Address PR review comments: confirm unit test coverage Jan 15, 2026

Copilot AI requested a review from fsx950223 January 15, 2026 04:17

Copilot finished work on behalf of fsx950223 January 15, 2026 04:17

Base automatically changed from common_hip_asm_pa_inerface to main January 16, 2026 02:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address PR review comments: confirm unit test coverage #1848

Address PR review comments: confirm unit test coverage #1848

Uh oh!

Copilot AI commented Jan 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Address PR review comments: confirm unit test coverage #1848

Are you sure you want to change the base?

Address PR review comments: confirm unit test coverage #1848

Uh oh!

Conversation

Copilot AI commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Analysis Provided

No Code Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Copilot AI commented Jan 15, 2026 •

edited

Loading