-
Notifications
You must be signed in to change notification settings - Fork 105
Description
Problem Description
Heyaa, we are encounting an interesting issue when we try to compile the AMDMiGraphX with our toolchain (clang++ & libc++). The clang++ version we are trying:
clang version 20.1.6 (https://github.com/llvm/llvm-project.git 47addd4540b4c393e478ba92bea2589e330c57fb)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /usr/bin
Before compiling we needed to change the symlink of the /usr/bin/ld
to /usrc/bin/ld.bfd
Everything compiles for:
- provided toolchain within
rocm-6.4.1
- our clang++ by not specifying libc++
The errors we are getting are particularly for some of the operators, and we are not even sure what the error is the error message is quite minimal:
In file included from /workdir/AMDMIGraphX/src/onnx/include/migraphx/onnx/op_parser.hpp:29:
In file included from /workdir/AMDMIGraphX/src/onnx/include/migraphx/onnx/onnx_parser.hpp:29:
In file included from /workdir/AMDMIGraphX/src/include/migraphx/program.hpp:29:
In file included from /workdir/AMDMIGraphX/src/include/migraphx/operation.hpp:35:
In file included from /workdir/AMDMIGraphX/src/include/migraphx/dyn_output.hpp:27:
In file included from /workdir/AMDMIGraphX/src/include/migraphx/shape.hpp:39:
In file included from /workdir/AMDMIGraphX/src/include/migraphx/serialize.hpp:28:
/workdir/AMDMIGraphX/src/include/migraphx/value.hpp:354:9: float_type, attr.floats());
| ^
warningwarning: s'switch' missing 'default' label [-Wswitch-default]
and I see above for two more operator
Hardware related infos.
OS:
NAME="Ubuntu"
VERSION="22.04.2 LTS (Jammy Jellyfish)"
CPU:
model name : AMD Ryzen 7 8700G w/ Radeon 780M Graphics
GPU:
Name: AMD Ryzen 7 8700G w/ Radeon 780M Graphics
Marketing Name: AMD Ryzen 7 8700G w/ Radeon 780M Graphics
Name: gfx1103
Marketing Name:
Name: amdgcn-amd-amdhsa--gfx1103
Name: amdgcn-amd-amdhsa--gfx11-generic
Any help would be appreciated!
Operating System
22.04.2 LTS (Jammy Jellyfish)
CPU
AMD Ryzen 7 8700G w/ Radeon 780M Graphics
GPU
Other
Other
Radeon 780M Graphics
ROCm Version
ROCm 6.0.0
Steps to Reproduce
Use this particular clang++ version to compile:
clang version 20.1.6 (https://github.com/llvm/llvm-project.git 47addd4540b4c393e478ba92bea2589e330c57fb)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /usr/bin
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
ROCk module version 6.12.12 is loaded
=====================
HSA System Attributes
=====================
Runtime Version: 1.15
Runtime Ext Version: 1.7
System Timestamp Freq.: 1000.000000MHz
Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model: LARGE
System Endianness: LITTLE
Mwaitx: DISABLED
XNACK enabled: NO
DMAbuf Support: YES
VMM Support: YES
==========
HSA Agents
==========
*******
Agent 1
*******
Name: AMD Ryzen 7 8700G w/ Radeon 780M Graphics
Uuid: CPU-XX
Marketing Name: AMD Ryzen 7 8700G w/ Radeon 780M Graphics
Vendor Name: CPU
Feature: None specified
Profile: FULL_PROFILE
Float Round Mode: NEAR
Max Queue Number: 0(0x0)
Queue Min Size: 0(0x0)
Queue Max Size: 0(0x0)
Queue Type: MULTI
Node: 0
Device Type: CPU
Cache Info:
L1: 32768(0x8000) KB
Chip ID: 0(0x0)
ASIC Revision: 0(0x0)
Cacheline Size: 64(0x40)
Max Clock Freq. (MHz): 4200
BDFID: 0
Internal Node ID: 0
Compute Unit: 16
SIMDs per CU: 0
Shader Engines: 0
Shader Arrs. per Eng.: 0
WatchPts on Addr. Ranges:1
Memory Properties:
Features: None
Pool Info:
Pool 1
Segment: GLOBAL; FLAGS: FINE GRAINED
Size: 48960044(0x2eb122c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 2
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED
Size: 48960044(0x2eb122c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 3
Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED
Size: 48960044(0x2eb122c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 4
Segment: GLOBAL; FLAGS: COARSE GRAINED
Size: 48960044(0x2eb122c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
ISA Info:
*******
Agent 2
*******
Name: gfx1103
Uuid: GPU-XX
Marketing Name:
Vendor Name: AMD
Feature: KERNEL_DISPATCH
Profile: BASE_PROFILE
Float Round Mode: NEAR
Max Queue Number: 128(0x80)
Queue Min Size: 64(0x40)
Queue Max Size: 131072(0x20000)
Queue Type: MULTI
Node: 1
Device Type: GPU
Cache Info:
L1: 32(0x20) KB
L2: 2048(0x800) KB
Chip ID: 5567(0x15bf)
ASIC Revision: 12(0xc)
Cacheline Size: 128(0x80)
Max Clock Freq. (MHz): 2900
BDFID: 28672
Internal Node ID: 1
Compute Unit: 12
SIMDs per CU: 2
Shader Engines: 1
Shader Arrs. per Eng.: 2
WatchPts on Addr. Ranges:4
Coherent Host Access: FALSE
Memory Properties: APU
Features: KERNEL_DISPATCH
Fast F16 Operation: TRUE
Wavefront Size: 32(0x20)
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Max Waves Per CU: 32(0x20)
Max Work-item Per CU: 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 4294967295(0xffffffff)
y 4294967295(0xffffffff)
z 4294967295(0xffffffff)
Max fbarriers/Workgrp: 32
Packet Processor uCode:: 40
SDMA engine uCode:: 21
IOMMU Support:: None
Pool Info:
Pool 1
Segment: GLOBAL; FLAGS: COARSE GRAINED
Size: 24480020(0x1758914) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:2048KB
Alloc Alignment: 4KB
Accessible by all: FALSE
Pool 2
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED
Size: 24480020(0x1758914) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:2048KB
Alloc Alignment: 4KB
Accessible by all: FALSE
Pool 3
Segment: GROUP
Size: 64(0x40) KB
Allocatable: FALSE
Alloc Granule: 0KB
Alloc Recommended Granule:0KB
Alloc Alignment: 0KB
Accessible by all: FALSE
ISA Info:
ISA 1
Name: amdgcn-amd-amdhsa--gfx1103
Machine Models: HSA_MACHINE_MODEL_LARGE
Profiles: HSA_PROFILE_BASE
Default Rounding Mode: NEAR
Default Rounding Mode: NEAR
Fast f16: TRUE
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 4294967295(0xffffffff)
y 4294967295(0xffffffff)
z 4294967295(0xffffffff)
FBarrier Max Size: 32
ISA 2
Name: amdgcn-amd-amdhsa--gfx11-generic
Machine Models: HSA_MACHINE_MODEL_LARGE
Profiles: HSA_PROFILE_BASE
Default Rounding Mode: NEAR
Default Rounding Mode: NEAR
Fast f16: TRUE
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 4294967295(0xffffffff)
y 4294967295(0xffffffff)
z 4294967295(0xffffffff)
FBarrier Max Size: 32
*** Done ***
Additional Information
If you can provide any steps to be able to debug further such error, would be great! thanks