UCX on AWS EFA instances validate the presence of efa_nv_peermem module to detect if GPUDirect RDMA is supported.
The efa_nv_peermem module might not always be available (depends on OS/AMI), yet the instance does support GPUDirect RDMA.
- It looks like UCX doesn't make use of this module but uses dmabuf
- If 1 is true, can we please remove this dependency and find other ways to detect if GPUDirect RDMA is supported? we can make this configurable via environment variable if needed for now.