nanoMPI

A minimal MPI Implementation loosely based on OpenMPI. nanoMPI has a few usecases:

First and foremost, nanoMPI is for educational purposes. The vast majority of most MPI libraries' code is dedicated to performance optimization, which makes them hard to parse as a beginner. nanoMPI allows beginners to the field of distributed computing to quickly see answers to questions like "how is a ring allreduce implemented?"
Development of distributed code on local devices is super convenient! It works offline and doesn't require job scheduling. Part of this project is to allow the authors to develop distributed code on a local laptop.

make is a tool that helps compile software. Central to make's operation is the Makefile, a text file located in the project's directory. The Makefile contains compilation instructions for make, including optimization settings, debugging options, and where to install components such as executables, documentation, and configuration files. make has a lot of convenience features such as only recompiling parts of the program that have changed since the last build, system-specific instructions, etc. My favorite resource on all things Makefile is at: https://makefiletutorial.com/

Use make to compile nanoMPI

Install the ssh server

sudo apt update
sudo apt install openssh-server

Enable it to start on boot

sudo systemctl enable ssh

Generate an ssh keypair using the ed25519 protocol. Use the default location under ~/.ssh/ with no passphrase.

ssh-keygen -t ed25519

You should now have a private key (~/.ssh/id_ed25519) and public key (~/.ssh/id_ed25519.pub).

Add your key to the ssh-agent:

ssh-add ~/.ssh/id_ed25519

Test your installation by ssh-ing to your local machine (Note: If using a remote server--like on a cluster--you should copy the key into ~/.ssh/authorized_keys on the remote server first. This will avoid it asking for your password every time):

$ ssh localhost

What is ssh? Why do we need it?

ssh (Secure Shell) is a cryptographic network protocol used for secure remote login over an unsecured network. We need SSH for several reasons:

Secure remote access: It allows users to securely log into and control remote systems.
File transfer: It enables secure file transfer between local and remote systems.
Port forwarding: It can securely tunnel other protocols through its encrypted connection.

ssh-agent is a program that holds private keys used for public key authentication in SSH. It's particularly useful because:

It saves you from typing your passphrase every time you use your SSH key.
It allows for single sign-on across multiple SSH sessions.

In the context of the provided commands:

We install the SSH server to allow incoming SSH connections to our machine.
We generate an ED25519 key pair for secure authentication.
We add the private key to ssh-agent to manage it securely and conveniently.
We test the setup by SSH-ing to localhost, which simulates connecting to a remote machine.

Repo Setup

Clone and build nanoMPI

git clone https://github.com/Quentin-Anthony/nanoMPI
cd nanoMPI
make

Add nanoMPI to your linux environment:

export LD_LIBRARY_PATH=$PWD:$LD_LIBRARY_PATH

What are these Linux environment variables?

Linux environment variables are editable values that affect programs running on a system. They are part of the environment in which a process runs. There are a few important ones on linux-based systems:

LD_LIBRARY_PATH: Tells the system where to look for shared libraries (e.g. libmpi.so) when executing programs at run-time.
PATH: Tells the system where to look for program binaries (e.g. mpirun)

And if you're compiling with gcc like us:

LIBRARY_PATH: Tells gcc where to look for linker files or ordinary libraries at compilation-time. Note that LD_LIBRARY_PATH is used by your program after compilation to find libraries, and LIBRARY_PATH is used by gcc before compilation to find libraries that need linked to the program.
CPATH: Tells gcc where to look for include paths (i.e. header files) at compilation-time

You can access the values of environment variables using the $ character. Try running echo $LD_LIBRARY_PATH to see the library paths already exported!

The command export LD_LIBRARY_PATH=$PWD:$LD_LIBRARY_PATH does the following:

It adds the value of the current directory ($PWD, which is the nanoMPI directory) to the beginning of the existing LD_LIBRARY_PATH.
This allows the system to find and use shared libraries (libmpi.so) in the current directory when running programs later on (mpirun).
The export command makes this change available to all child processes of the current shell.

Running

Fill in the hostfile with the hosts you will run with, one per line:

echo -e "localhost\nlocalhost" > hostfile

Run Basic Hello World With MPI:

./mpirun ./hostfile ./tests/test_hello

Which should output:

Hello world from rank 0 out of 2 processors
Hello world from rank 1 out of 2 processors

Run MPI_Allreduce Benchmark

./mpirun ./hostfile ./benchmarks/benchmark_allreduce

Which should output:

Message Size (bytes)      Latency (us)         Bus BW (MB/s)        Validation
8                         47.01                0.1702               PASS
16                        41.87                0.3822               PASS
32                        36.82                0.8692               PASS
64                        39.55                1.6180               PASS
128                       76.70                1.6689               PASS
256                       40.31                6.3505               PASS
512                       119.03               4.3014               PASS
1024                      93.79                10.9182              PASS
2048                      97.00                21.1136              PASS
4096                      49776.58             0.0823               PASS
8192                      149.62               54.7506              PASS
16384                     77.30                211.9644             PASS
32768                     264.47               123.9025             PASS
65536                     139.57               469.5430             PASS
131072                    269.17               486.9470             PASS
262144                    547.08               479.1712             PASS
524288                    1076.73              486.9248             PASS
1048576                   2896.33              362.0366             PASS

Things To Be Aware Of

The MPI standard allows MPI_Recv to partially fill the posted recvbuf if an incoming message matches the source, tag, and comm. The implementation so far:

Requires each MPI_Send and MPI_Recv to match the buffer size, otherwise there may be a hang.
Ignores the tag

These are todo items on the Roadmap.

Additional Resources

This page from NCCL explains how to analyze the bandwidth of collectives
MPI Introduction - A quick introduction to common MPI concepts
Parallel Programming for Science and Engineering Book - A thorough coverage of MPI concepts. More of a "textbook for MPI"

Roadmap

Contributions are welcome! If you have something to add, open a PR!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backends		backends
benchmarks		benchmarks
colls		colls
tests		tests
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
comm.c		comm.c
comm.h		comm.h
constants.h		constants.h
dtype.c		dtype.c
dtype.h		dtype.h
group.c		group.c
group.h		group.h
mpi.c		mpi.c
mpi.h		mpi.h
mpirun.c		mpirun.c
op.c		op.c
op.h		op.h
proc.c		proc.c
proc.h		proc.h
status.h		status.h
util.c		util.c
util.h		util.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

nanoMPI

Contents

Quickstart

Pre-Requisites

Repo Setup

Running

Run Basic Hello World With MPI:

Run MPI_Allreduce Benchmark

Things To Be Aware Of

Additional Resources

Roadmap

About

Uh oh!

Releases

Packages

Languages

License

Mirza-Samad-Ahmed-Baig/nanoMPI

Folders and files

Latest commit

History

Repository files navigation

nanoMPI

Contents

Quickstart

Pre-Requisites

Repo Setup

Running

Run Basic Hello World With MPI:

Run MPI_Allreduce Benchmark

Things To Be Aware Of

Additional Resources

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages