Description
We are trying to connect using the IPv6 address over RoCEv2 protocol and it is not working with the latest nvme-cli/libnvme versions. We tried the same setup on the same server with nvme-cli 1.16 and connections are made. We must stay at the latest version in order to use other features. This is a blocking issue for us. Please correct me in case the arguments I am using are not correct.
Discover does work fine but the connect or connect-all does not even try to reach the target over the wire. We see no activity on the network trace.
This is all we see after the connect command is issued "could not add new controller: failed to write to nvme-fabrics device"
In the dmesg it shows "nvme nvme0: rdma connection establishment failed (-104)"
Failing nvme-versions are:
nvme version 2.11 (git 2.11+)
libnvme version 1.11.1 (git 1.11.1)
nvme version 2.10.2 (git 2.10.2)
libnvme version 1.10 (git 1.10)
Command used to connect/discover with 2.10/2.11
nvme discover -t rdma -a fe80::020c:caff:fe12:24c3%enp193s0f0np0 -s 4420
nvme connect-all -t rdma -a fe80::020c:caff:fe12:24c3%enp193s0f0np0 -s 4420 -i 1
Command used with Passing nvme version is 1.16
nvme connect-all -t rdma -a fe80::020c:caff:fe12:24c3%enp193s0f0np0 -s 4420 -i 1 -w fe80::966d:aeff:fe38:772e%enp193s0f0np0