Skip to content

test/unit/queue/queueTest fails on Radeon Pro W7800 (gfx1100) with ROCm 6.2.4 and later #2499

@fwyzard

Description

@fwyzard
$ ./test/unit/queue/queueTest 
Randomness seeded to: 1375303555
BEGIN not-awaited task in Queue 'alpaka::QueueGenericThreadsNonBlocking<alpaka::DevCpu>' (if there is no matching 'END' line, the task ran past program termination)
END not-awaited task in Queue 'alpaka::QueueGenericThreadsNonBlocking<alpaka::DevCpu>'
BEGIN not-awaited task in Queue 'alpaka::uniform_cuda_hip::detail::QueueUniformCudaHipRt<alpaka::ApiHipRt, false>' (if there is no matching 'END' line, the task ran past program termination)
END not-awaited task in Queue 'alpaka::uniform_cuda_hip::detail::QueueUniformCudaHipRt<alpaka::ApiHipRt, false>'

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
queueTest is a Catch2 v3.5.2 host application.
Run with -? for options

-------------------------------------------------------------------------------
enqueueBenchmark - alpaka::test::TestQueues - 0
-------------------------------------------------------------------------------
[...]
-------------------------------------------------------------------------------
enqueueBenchmark - alpaka::test::TestQueues - 1
-------------------------------------------------------------------------------
[...]
-------------------------------------------------------------------------------
enqueueBenchmark - alpaka::test::TestQueues - 2
-------------------------------------------------------------------------------
[...]
/data/user/fwyzard/hackathon/alpaka/test/unit/queue/src/QueueTest.cpp:256: FAILED:
due to a fatal error condition:
  SIGSEGV - Segmentation violation signal

===============================================================================
test cases: 35 | 34 passed | 1 failed
assertions: 65 | 64 passed | 1 failed

Segmentation fault (core dumped)

Configured and built with

cmake ../alpaka \
  -Dalpaka_ACC_CPU_B_SEQ_T_SEQ_ENABLE=ON \
  -Dalpaka_ACC_GPU_HIP_ENABLE=ON \
  -DCMAKE_HIP_PLATFORM=amd \
  -DCMAKE_HIP_ARCHITECTURES=gfx1100 \
  -DCMAKE_HIP_COMPILER=/opt/rocm-6.2.4/llvm/bin/clang++ \
  -DCMAKE_PREFIX_PATH=/opt/rocm-6.2.4/lib/cmake \
  -Dalpaka_BUILD_EXAMPLES=ON \
  -Dalpaka_BUILD_BENCHMARKS=ON \
  -DBUILD_TESTING=ON

make

Tested with ROCm 5.7.3, 6.2.4, 6.3.2, 6.4.0, using the AMD driver from 6.3.2 and 6.4.0.
The 5.7 build works, the 6.x builds fail as above.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions