Skip to content

4.0.x FeatureList

Geoff Paulsen edited this page Jun 12, 2018 · 34 revisions

Initial list is based on discussion from the Dallas March 2018 face-to-face meeting.

Release Managers

  • Geoff Paulsen: IBM
  • Howard Pritchard: LANL

Timeframe

  • June 12, 2018 - Readiness Review - Review unfinished features on feature list to raise visibility, get rough sense of is/is not and evaluate readiness.
  • July 15, 2018 - Branch from master No new features on release branch (possible whitelist exceptions) past branch point.
  • August 13, 2018 - Aim to create RC1 (shoot for 2 week RC intervals)
  • Sept 17, 2018 - Release.

New features proposed for v4.0.x:

  • (UTK) Better multithreading. - George
    • In OB1 PML, normal OMP parallel Sections. Improved for injection and extraction rates.
    • Implications for other PMLs. Very OB1 specific Maybe a little bit in progress.
  • (UTK) UCT BTL (i.e., lower layer of UCX) - George
    • This allows us to remove openib BTL
  • (LANL) New iWARP-only BTL (i.e., stripped-down version of openib)
  • Latest PMIx - Need Owner
    • PMIx 3.0.x slated for summer 2018 (well how much do we really need this?) Can ULFM just use external PMIx?
    • Include PMIx debugger support
      • This allows us to start deprecating MPIR (i.e., add to NEWS -- next major version can output warnings when MPIR is used).
    • Add static endpoint integration for BTL/MTLs where supported
  • Support for hwloc v2.0.x
    • Jeff and/or Geoff
  • (UTK) ULFM support via new MPIX functions. Most is in MPIX, but some in PML.
    • Depends on PMIx v3.x
  • (UTK) SPC - Software Performance Counters (looks good)
  • Possible MPI-3.2 functionality (depends on the MPI Forum)
    • (Fujitsu) Persistent collective operations (with MPIX_ prefix)
  • Change defaults for embedding libevent / hwloc (see this issue) - Jeff
  • OSHMEM - Mellanox
    • Update to v1.4 API
    • Do not build OSHMEM if a viable SPML cannot be built
  • Remove the MXM MTL - Brian
  • Update ROMIO - Looking for owner (giles, or Geoff)
  • (IBM) -prot and -entry functionality - Geoff
  • Improved performance for single and multi-threaded RMA when using osc/rdma. - Nathan
  • Add support for SLURM 17.11's new MPMD feature - Howard
  • OMPIO new vulcan fcoll component - Edgar
  • OMPIO Cuda buffer support - Edgar
  • OMPIO Luster component updates - Edgar
    • Might not be ready to make OMPIO the default on Lustre in time for v4.0
  • Want Nathan's fix for Vader and other BTL to allow us to have SOMETHING for OSC_RDMA for one-sided + MT runs.
Clone this wiki locally