@@ -11,17 +11,92 @@ v1.19.0, Fri Sep 1, 2023
1111
1212## Core
1313
14+ - General code cleanup and restructuring
15+ - Add ofi_hmem_any_ipc_enabled()
16+ - ofi_consume_iov allows 0-byte consume
17+ - ofi_consume_iov consistency
18+ - ofi_indexer: return error code when iterating
19+ - getinfo: Add post filters for domain and fabric names
20+ - Filter loopback device if iface is specified
21+ - bsock: Fix error checking for -EAGAIN
22+ - windows/osd: Remove unneeded check to silence coverity
23+ - windows/osd: Move variable declaration to silence coverity
24+ - Introduce gdrcopy awareness to hmem copy
25+ - mr/cache: Fix fi_mr_info initialization
26+ - hmem_cuda: remove gdrcopy from cuda hmem copy path
27+ - iouring: Fix wrong indent in ofi_sockapi_accept_uring()
28+ - Implement ofi_sockctx_uring_poll_add()
29+ - hmem: introduce gdrcopy from/to cuda iov functions
30+ - hmem: Deprecate ` FI_HMEM_CUDA_ENABLE_XFER `
31+ - hmem_cuda: Restrict CUDA IPC based on peer accessibility
32+ - hmem_cuda: Log number of CUDA devices detected
33+ - hmem_cuda: Refactor global variables
34+ - tostr: Remove the extra dir "shared/" from "include/" and "src/" .
35+ - hmem_ze: fix ZE is valid check
36+ - hmem_rocr: fix offset calculation
37+ - hmem_rocr: use ofi spinlock functions
38+ - hmem_rocr: minor fixes
39+ - hmem_neuron: convert warn to info for nrt_get_dmabuf_fd not found
40+ - hmem_neuron: check existance of neuron devices during initialization
41+ - tostr: Moved Windows functions in shared/ofi_str.c to windows/osd.h
42+ - tostr: Add helper functions ofi_tostr_size() and ofi_tostr_count().
43+
1444## EFA
1545
46+ - Onboard Peer API, use shm provider as a peer provider
47+ - Uses util SRX framework in shared receive procedures.
48+ - Register shm MR with hmem_data, allow shm to use gdrcopy for cuda data movement
49+ - Finish the refactor for rxr squash.
50+ - Use rdma-core WR API for send requests
51+ - Check optlen in getopt call
52+ - Fix the rdma-read support check in RMA and MSG operations
53+ - Optimize ep lock usage
54+ - Use an internal fi_mr_attr for memory registration
55+
1656## Hooks
1757
58+ - Init field in mr_attr to silence coverity
59+ - Add profiling hook provider
60+ - Rename cq hooking functions' names
61+ - Added trace for resource creation operations
62+
1863## OPX
1964
65+ - Initialize ofi_mr_info
66+ - Fix dput credit check
67+ - Only allocate replay buffer if psn is valid
68+ - Support SHM Intra-node communication between single server HFI devices
69+ - Fix incorrect packet size in packet header when sending CTS packet
70+ - Added check to address Coverity scan defect
71+ - Add multi-entry caching to TID rendezvous
72+ - Fall back to default domain name for TID fabric
73+ - Properly handle multiple IOVs in fi_opx_tsendmsg
74+ - Fix OPX Rzv RTS receive operation SHM error (DAOS-related)
75+ - Fix non-tagged sends may incorrectly set FI_TAGGED in send completions
76+ - Add more info to reliability IOV buffer validation check
77+ - Move dput packet build functions to new inline include
78+ - Use fi_mr_attr in fi_opx_mr
79+ - Disable Pre-NAKing by default, throttle until all outstanding replays ACK'd
80+ - Fix reliability bug when NAKing the last PSN
81+ - Update HeaderQ Register more frequently
82+ - No rbuf_wrap needed for expected receive (TID)
83+ - Fixes for Coverity scan issues
84+ - Enhanced tag matching
85+ - Tune expected recv for unaligned buffers
86+ - Observability: Add finer logging granularity
87+ - Reduce RTS immediate data and fix packet estimate for odd TID lengths
88+ - Add additional sources for FI_OPX_UUID
89+
2090## Peer
2191
92+ - Add cq_data to rx_entry, allow peer to modify on unexp
93+ - Introduce peer cntr API
94+ - Add foreach_unspec_addr API
95+ - Add size as an input of the get_tag op
96+
2297## PSM3
2398
24- ## RXM
99+ - Sync with IEFS 11.5.0.0.172
25100
26101## SHM
27102
@@ -48,14 +123,124 @@ v1.19.0, Fri Sep 1, 2023
48123
49124## TCP
50125
126+ - General code cleanups
127+ - Restrict which EPs can be opened per domain
128+ - Increase CM error debug output
129+ - Avoid calling close() on an invalid socket after accept error
130+ - Mark the EP as disconnected before flushing the queues
131+ - Add assertion failures for xnet_ {monitor,halt}_ sock
132+ - Disable ofi_dynpoll_wait() for non-blocking progress
133+ - Move PEP pollin operations to io_uring
134+ - Move EP poll operations to io_uring
135+ - Early exit if ofi_bsock_flush() has operation in progress
136+ - Implement pollin sockctx in bsock
137+ - Add missing call to xnet_submit_uring()
138+ - Add return error to xnet_update_pollflag()
139+ - Remove the cancel sockctx from the EP structure
140+ - Move io_uring cqe from the stack to progress struct
141+ - Reduce stack size for epoll event array
142+ - handle NULL av in xnet_freeall_conns()
143+
51144## UCX
52145
146+ - Publish FI_LOCAL_COMM and FI_REMOTE_COMM capabilities
147+ - Fix configure error with newer MOFED
148+ - Fix segfault in unsignalled completions
149+
53150## Util
54151
152+ - Add FI_PEER support to util counter
153+ - Refactor the usage of cntrs
154+ - Change util_ep to be a genlock
155+ - Add util shared receive implementation
156+ - Update log message for invalid AV type message
157+ - Fix fi_mr_info initialization
158+ - Add peer ID to MR cache
159+ - Store hmem_data in ofi_mr_map
160+ - Split the cq progress and reading entries in ofi_cq_readfrom
161+
162+ ## Verbs
163+
164+ - Add event lock to EQ to serialize closing ep
165+ - Remove saved_wc_list and use CQ directly
166+ - Consolidate peer_mem and dmabuf support check
167+ - Fix vrb_add_credits signature
168+ - Introduce new progress engine structure
169+ - Simplify (and correct) locking around progress operations
170+ - General code restructuring
171+
172+ ## Fabtests
173+
174+ - Fix reading addressing options
175+ - Allow to change only the OOB address
176+ - Allow to use FI_ADDR_STR with -F
177+ - Fix bw buffer utilization
178+ - Separate RX and RMA counters
179+ - Fix tx counter with RMA
180+ - Add FI_CONTEXT mode to rdm_cntr_pingpong
181+ - Add HMEM support to fi_unexpected_msg test
182+ - Fix array OOB during fabtest list parsing
183+ - Enable shm tagged_peek test
184+ - Fix windows build warnings
185+ - Make tx_buf and rx_buf aligned to 64 bytes by default
186+ - Fix windows build warnings for sscanf
187+ - Use dummy ft_pin_core on macOS
188+ - Fix some header includes
189+ - sock_test: Do not use epoll if not available
190+ - recv_cancel: initialize error entry
191+ - Fix wrong size used to allocate tx_msg_buf
192+ - unexpected: change defaults to support tcp
193+ - unexpected: add unknown unexpected peer test
194+ - Enable a list of arbitrary message sizes
195+ - Enabled data validation for rma read & write
196+ - bw_rma operates on distinct buffer offsets
197+ - ft_post_rma issues reads from remote's tx_buf
198+ - General code cleanup and restructuring
199+ - rdm_tagged_peek: fix race condition synchronization
200+ - Add FI_LOCAL_COMM/FI_REMOTE_COMM presence check to fi_getinfo_test
201+ - Correct ft_exchange_keys in prefix-mode
202+ - Make rdm_tagged_peek test more general
203+ - Add unit test for fi_setopt
204+
205+ v1.18.2, Fri Sep 1, 2023
206+ ========================
207+
208+ ## Core
209+
210+ - Check for CUDA devices with nvmlDeviceGetCount_v2() first
211+ - Try libnvidia-ml.so.1 if .so symlink missing
212+ - Fix ssize_t format specifiers
213+
214+ ## EFA
215+
216+ - Remove rxr_rm_tx/rx_cq_check()
217+ - Report cntr completion for shm inject write
218+
219+ ## SHM
220+
221+ - Change recv entry freestack into bufpool
222+ - Load DSA dependency dynamically
223+
224+ ## TCP
225+
226+ - Fix missing iov truncation on saved message path
227+ - Add locking to trywait path for potential data race
228+ - Fix incorrect locking around MR operations
229+
230+ ## UCX
231+
232+ - Updated ucx.exclude and Makefile.am
233+
55234## Verbs
56235
236+ - Add additional checks to vrb_shutdown_qp_in_err
237+ - Prevent duplicate FI_SHUTDOWN events
238+ - Fix memory leak when creating EQ with unsupported wait object
239+
57240## Fabtests
58241
242+ - Extend the test_unexpected_msg
243+ - Rename dmabuf-rdma tests to prefix with xe
59244
60245v1.18.1, Fri Jun 30, 2023
61246=========================
0 commit comments