GitHub - iHeartGraph/iSpan: Parallel and distributed computation for the strongly connected component (SCC).

iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees

Here are the Paper and Slides at SC'18.

Introduction

1. What is strongly connected component (SCC)?

In a directed graph, an SCC is a maximal subset of the vertices that every vertex has a directed path to all the others.

SCC detection will find all the SCCs in the directed graph. Each shaded area in the picture is an SCC.

Most real-world graphs have one large SCC that contains the majority of the vertices, as well as many small SCCs whose sizes are reversely proportional to the frequency of their occurrences. For both types of SCCs, current approaches that rely on depth or breadth first search (DFS and BFS) face the challenges of both strict synchronization requirement and high computation cost.

2. What is iSpan?

Motivated, we advocate a new paradigm of identifying SCCs with simple spanning trees, since SCC detection requires only the knowledge of connectivity among the vertices. We have developed a prototype called iSpan, which consists of parallel, relaxed synchronization construction of spanning trees for detecting the large and small SCCs, combined with fast trims for small SCCs.

We further scale iSpan to distributed memory system by applying different distribution strategies to the data and task parallel jobs.

The evaluations show that iSpan is able to significantly outperform current state-of-the-art DFS and BFS-based methods by average 18x and 4x, respectively.

Tutorial

You can find the source code of the shared-memory version in "src/", the source code of the distribute-memory version in "src_mpi", some useful scripts in "script/", and some test result under "result/"

Prerequisites

The following software are required, but the versions do not have to be the same. The versions listed are used in our experiments.

GCC-4.8.5
OpenMP-3.1
Open MPI-2.1.1
Makefile

Install and Run

Get into the source code directory, "src" for shared-memory, "src_mpi" for distributed-memory, then compile the source code with Makefile,

cd src/
make

If the prerequisites are correct, the make process should be good. You will get the executable file "ispan". Run "ispan" to see the parameters and use the correct ones. For simplicity, you can change the "bash_one.sh" file and run it.

./bash_one.sh

For the distributed-memory version, we used the clusters of GWU Colonial One, and MGHPCC. There is a script for running jobs on GWU Colonial One, named "run_batch.sh". You should write your own script for running on a different cluster.

Graph format

We are using compressed sparse row (CSR) format stored in binaries. We provide a converter from regular text edge list to our CSR binary format. One can find the converter under "graph_converter/".

Authors

Yuede Ji, email: [email protected]

Hang Liu, email: [email protected]

H. Howie Huang, email: [email protected]

Reference

If you use iSpan in your project, please cite the following paper.

@inproceedings{ji2018s,
    title={iSpan: parallel identification of strongly connected components with spanning trees},
    author={Ji, Yuede and Liu, Hang and Huang, H Howie},
    booktitle={Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis},
    pages={58},
    year={2018},
    organization={IEEE Press}
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
graph_converter		graph_converter
include		include
result		result
script		script
src		src
src_mpi		src_mpi
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees

Introduction

1. What is strongly connected component (SCC)?

2. What is iSpan?

Tutorial

Prerequisites

Install and Run

Graph format

Authors

Reference

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

iHeartGraph/iSpan

Folders and files

Latest commit

History

Repository files navigation

iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees

Introduction

1. What is strongly connected component (SCC)?

2. What is iSpan?

Tutorial

Prerequisites

Install and Run

Graph format

Authors

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages