An Introduction to Topological Data Analysis

This repository is for a tutorial on Topological Data Analysis (TDA) for the Midwest Big Data Summer School in 2021 virtually in Ames, Iowa. This tutorial covers persistent homology and mapper, two of the main tools used in TDA.

The slides for this tutorial can be found here.

Vietoris-Rips persistence

The zip archive InteractiveJPDwB.zip contains three versions of the InteractiveJPDwB application for understanding the Vietoris-Rips construction of a point cloud. There are Windows, MacOSX, and Linux versions, depending on your operating system. The instructions for using the program can be found in the bottom panel.

Persistent Homology

For persistent homology, we use two implementations: scikit-tda and giotto-tda. Both of these packages are available on pypi and everything you need for the topological data analysis part of the tutorial can be installed with

   pip install scikit-tda, giotto-tda

The tutorial depends on other libraries like numpy and matplotlib which I assume you already have installed.

Both scikit-tda and giotto-tda implement Vietoris-Rips persistent homology based on the Ripser algorithm, a very efficient C++ implementation of persistence. If you don't want to install anything on your computer, you can go to live.ripser.org and upload the data sets there for easy persistence computations.

There are three persistent homology notebooks for users to work through:

Introduction to persistent homology. A simple notebook with mostly synthetic data sets.
Differentiation using persistence landscapes. A notebook for distinguishing $S^2$ from $S^3$ using one-dimensional homology, highlighting the geometric aspects of persistence. This relies on persistence landscapes, one of the first vectorization schemes introduced for persistence diagrams.
MNIST using persistent homology. The most advanced notebook, combining cubical persistence with various vectorization schemes to build a digit classifier for the famous MNIST data set.

Mapper

We use KeplerMapper for our mapper implementation. Kepler Mapper is written in python, and is compatible with other machine learning packages, like scikit-learn.

There is one mapper notebook for users to work through: Introduction to mapper. An elementary notebook with basic data sets to get accustomed to choosing filter functions, cover parameters, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Output		Output
data		data
Differentiation_with_Persistence_Landscapes.ipynb		Differentiation_with_Persistence_Landscapes.ipynb
InteractiveJPDwB.zip		InteractiveJPDwB.zip
Intro_to_PH.ipynb		Intro_to_PH.ipynb
MBDS_2021.pdf		MBDS_2021.pdf
MNIST_using_PH.ipynb		MNIST_using_PH.ipynb
Mapper_notebook.ipynb		Mapper_notebook.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Introduction to Topological Data Analysis

Vietoris-Rips persistence

Persistent Homology

Mapper

Other tutorials

About

Releases

Packages

Languages

catanzaromj/MBDS21_TDA

Folders and files

Latest commit

History

Repository files navigation

An Introduction to Topological Data Analysis

Vietoris-Rips persistence

Persistent Homology

Mapper

Other tutorials

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages