|
11 | 11 | ### Features: |
12 | 12 | ### Bugfixes: |
13 | 13 |
|
14 | | -## 1.16.0-rc5 (April 02, 2024) |
15 | | -### Features: |
16 | | -#### UCS |
17 | | -* Added support for PCIe gen5 bandwidth detection |
18 | | -### Bugfixes: |
19 | | -#### UCP |
20 | | -* Fixed rndv_put transport selection for device to device scenario |
21 | | -#### RDMA CORE (IB, ROCE, etc.) |
22 | | -* Disabled MR multithreading registration |
23 | | - |
24 | | -## 1.16.0-rc4 (February 21, 2024) |
25 | | -### Bugfixes: |
26 | | -#### UCP |
27 | | -* Disabled rendezvous pipeline protocol selection when using non-contiguous buffer |
28 | | -#### RDMA CORE (IB, ROCE, etc.) |
29 | | -* Fixed mlx5 WQE posting error due to compiler memory copy optimizations |
30 | | -#### GPU (CUDA, ROCM) |
31 | | -* Fixed cuda_ipc transport being disabled if a CUDA device is not set during initialization |
32 | | -#### UCM |
33 | | -* Fixed compilation error when building on PPC64 |
34 | | -#### Packaging |
35 | | -* Fixed already existing target error when using cmake find_package(ucx) twice |
36 | | - |
37 | | -## 1.16.0-rc3 (February 20, 2024) |
38 | | -### Bugfixes: |
39 | | -#### UCP |
40 | | -* Fixed crash in rendezvous protocol rkey pack after failed memory registration |
41 | | - |
42 | | -## 1.16.0-rc2 (January 21, 2024) |
| 14 | +## 1.16.0 (April 15, 2024) |
43 | 15 | ### Features: |
44 | 16 | #### UCP |
45 | 17 | * Added tag offload rendezvous protocol in new infrastructure |
|
86 | 58 | * Added support for VLAN over channel bonding interface |
87 | 59 | * Added LRU cache and Usage Tracker datastructures |
88 | 60 | * Improved cross-NUMA device detection |
| 61 | +* Added support for PCIe gen5 bandwidth detection |
89 | 62 | #### Build |
90 | 63 | * Added LCOV coverage report as a build option |
91 | 64 | * Added binutils 2.40 library dependencies |
|
125 | 98 | * Fixed memory corruption by proper memh handling in tag offload rendezvous |
126 | 99 | * Changed default allocator to not use reserved huge pages |
127 | 100 | * Fixed rndv put protocol to avoid early completion |
| 101 | +* Fixed rndv_put transport selection for device to device scenario |
| 102 | +* Disabled rendezvous pipeline protocol selection when using non-contiguous buffer |
| 103 | +* Fixed crash in rendezvous protocol rkey pack after failed memory registration |
128 | 104 | #### RDMA CORE (IB, ROCE, etc.) |
129 | 105 | * Fixed compilation failure when DevX is explicitly disabled |
130 | 106 | * Fixed crash when using PCIe relaxed ordering |
|
133 | 109 | * Fixed assertion failure when configured with UCX_IB_ADDR_TYPE=ib_global |
134 | 110 | * Fixed overwritten MD attribute capabilities when querying a device |
135 | 111 | * Fixed ibv_reg_mr error by registering memory in rcache callback |
| 112 | +* Disabled MR multithreading registration |
| 113 | +* Fixed mlx5 WQE posting error due to compiler memory copy optimizations |
136 | 114 | #### TCP |
137 | 115 | * Fixed assymetric lanes selection issue due to inconsistent device listing |
138 | 116 | #### GPU (CUDA, ROCM) |
139 | 117 | * Fixed compilation flags to support ROCm 6.0 |
140 | 118 | * Fixed values of D2H_THRESH and latencey params |
141 | 119 | * Fixed Cuda memory support for iov datatype |
142 | 120 | * Increased max number of agents in ROCm |
| 121 | +* Fixed cuda_ipc transport being disabled if a CUDA device is not set during initialization |
143 | 122 | #### Shared Memoey |
144 | 123 | * Fixed posix and cma transport selection by enhancing reachability checks |
145 | 124 | * Fixed UGNI build failure |
|
153 | 132 | * Fixed floating point division by zero during protocols initialization |
154 | 133 | #### UCM |
155 | 134 | * Fixed occasional crash in bisto hooks by adding a lock before hooking |
| 135 | +* Fixed compilation error when building on PPC64 |
156 | 136 | #### Java |
157 | 137 | * Fixed go tests by setting CUDA device before allocating CUDA memory |
158 | 138 | * Fixed perftest error detection and hanging issue |
|
164 | 144 | * Fixed ROCm building and testing |
165 | 145 | * Removed libnvidia-compute version dependency |
166 | 146 | * Removed libibmad/libumad from default build configuration to avoid runtime dependency |
| 147 | +#### Packaging |
| 148 | +* Fixed already existing target error when using cmake find_package(ucx) twice |
167 | 149 |
|
168 | 150 | ## 1.15.0 (September 28, 2023) |
169 | 151 | ### Features: |
|
0 commit comments