`ISpyMSI`: Mass spectrometry tissue segmentation project.

This project contains a python package, and associated scripts, for automating the detection of tissue in mass spectrometry images using a convolutional neural network. It accompanies the paper "Automatic tissue segmentation in mass spectrometry images" link placeholder.

Installation

To install the repository, first clone it with

git clone [email protected]:AZU-BioPharmaceuticals-RD/ISpyMSI.git
cd path/to/repo/
pip install .

Note: it would be a good idea to do this inside a virtual Python env.

Software requirements

The software requirements are defined in the pyproject toml.

Using this method

Clone the repo

git clone ...

Create the virtual environment

To create the virtual environment, we strongly suggest conda. To create it, use

conda env create -f requirements_dev.conda.yaml

If you modify it for your own purposes, remember to run

conda env update -f requirements_dev.conda.yaml

Note, the requirements for the python package are in the file pyproject toml.

Pre-commit hooks

After creating and activating the environment, please use the pre-commit hooks by running

pre-commit install

Recreating the paper analysis (with your own data)

1. Structuring your project

Create a directory, which will serve as the parent directory for your entire project. For the sake of this example, we will call it proj-parent.

The structure should look like this:

├── H&E
├── MSI
└── metadata.csv

The H&E folder should contain the histological whole-slide images.
The MSI folder should contain your .imzML and .ibd files.
The file metadata.csv should be a csv file with the fields: "he_img", "msi", "tissue_type", "organism", "msi_microns_pp", "dataset_id", "ion_mode", "split" and "Notes".
- "dataset_id" can be anything, but integers make sense.
- "split" should be either "train" or "test".
- "msi_microns_pp" should be a float.

2. Labelling the slides: QuPath

In this work we used QuPath v0.4.3.

Install QuPath, create a project, and add your H&Es. Your project dir might now look like:

├── H&E
├── MSI
├── metadata.csv
└── qupath-project

Annotate the tissue, setting the class name to "tissue".
- Note, if you have small fragments of tissue surrounding bigger fragments, group them as single annotation objects.
Save the annotations as .geojson files by using the option available when you click "file".
- Note, you must uncheck all of the boxes before saving.
Save the to a folder called "ROI". Now your project should look like

├── H&E
├── MSI
├── ROI
├── metadata.csv
└── qupath-project

3. Extract information from the MSI files

Run

./scripts/extract_mass_spec_info.py /path/to/proj-parent

where the path is to the folder containing your .imzML and .ibd files. This will extract ion images from the MSI, and after it runs, your project directory should now look like

├── H&E
├── MSI
├── ROI
├── ion-imgs
├── metadata.csv
└── qupath-project

4. Record landmarks

Run

./scripts/record_landmarks.py /path/to/proj-parent

where the directory is the parent directory where all of data for this project are saved. Recording the landmarks should be very self explanatory, and boring. The landmarks will be saved as a csv file in the current working directory.

Project masks

./scripts/project_masks.py /project/base/proj-parent/

Again, the path is to the parent directory for the project. Remember to use the --help argument to look at all of the command-line options.

After completing this step, your folder should look like

├── H&E
├── MSI
├── ROI
├── images-and-masks
├── ion-imgs
├── metadata.csv
└── qupath-project

5. Extract patches

To extract the patches, run

./scripts/extract_patches.py /path/to/proj-parent/images-and-masks/

Again, use --help for more info.

6. Training the model

To train the model, run

./scripts/train_segmentation_model.py /patch/parent/dir/

It is strongly recommended to run this with --help first.

7. Testing the model

To test the model, on unseen data, first run

./scripts/test_segmentation_model.py --help

and then proceed as you see fit.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
scripts		scripts
src/ispy_msi		src/ispy_msi
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AUTHORS.md		AUTHORS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
landmarks.csv		landmarks.csv
mass-to-charge-ratios.csv		mass-to-charge-ratios.csv
peaks.csv		peaks.csv
pyproject.toml		pyproject.toml
requirements_dev.conda.yaml		requirements_dev.conda.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`ISpyMSI`: Mass spectrometry tissue segmentation project.

Installation

Software requirements

Using this method

Clone the repo

Create the virtual environment

Pre-commit hooks

Recreating the paper analysis (with your own data)

1. Structuring your project

2. Labelling the slides: QuPath

3. Extract information from the MSI files

4. Record landmarks

Project masks

5. Extract patches

6. Training the model

7. Testing the model

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

AstraZeneca/ISpyMSI

Folders and files

Latest commit

History

Repository files navigation

ISpyMSI: Mass spectrometry tissue segmentation project.

Installation

Software requirements

Using this method

Clone the repo

Create the virtual environment

Pre-commit hooks

Recreating the paper analysis (with your own data)

1. Structuring your project

2. Labelling the slides: QuPath

3. Extract information from the MSI files

4. Record landmarks

Project masks

5. Extract patches

6. Training the model

7. Testing the model

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

`ISpyMSI`: Mass spectrometry tissue segmentation project.

Packages