Training model on 7T MP2RAGE images #65

valosekj · 2024-07-13T22:33:30Z

This issue tracks the training of the model on 7T MP2RAGE images (hc-leipzig-7t-mp2rage).

Steps (originally posted in #63 (comment)):

Thanks for testing the model @KaterinaKrejci231054! The predictions for the inverted UNIT1 image (multiplied by -1) look promising! I believe we can leverage them for the model training. I would do the following steps:

1. run the model on inverted UNIT1 images for 5 subjects to get initial rootlets segmentations (done in Testing hc-leipzig-7t-mp2rage dataset #64)

2. correct the predictions; push them to git-annex (done in https://data.neuro.polymtl.ca/datasets/hc-leipzig-7t-mp2rage/pulls/2)

3. train an initial nnUNet model using all contrasts (UNIT1, inv-1_part-mag_MP2RAGE, inv-2_part-mag_MP2RAGE). A big advantage here is that the contrasts are already coregistered, so we can use the same GT for all 3 contrasts.

Note that the recommended nnUNet trainer now is nnU-Net ResEnc L; see here. We can this train two models: one with the default trainer, the second with the nnU-Net ResEnc L trainer.

TODO: describe the training, tagging @KaterinaKrejci231054

The text was updated successfully, but these errors were encountered:

KaterinaKrejci231054 · 2024-07-22T20:22:33Z

Obtaining GT labels and nnUNet model training

1. Obtaining GT labels

In the hc-leipzig-7t-mp2rage dataset were not provided groundtruth labels (rootlets segmentation), so first step needs to be obtaining GT labels for training.

I tried to run T2w r20240523 model on inverse UNIT1 data (5 subjects) to obtain rootlets segmentation. This segmentations needed manual corrections (see below) to be considered as groundtruth labels:

NOTE: Each manually corrected label fits to all from 3 contrasts (INV1, INV2 and UNIT1), so this is big advantage for us.

2. Model training

I created 4 datasets for 4 model training:

UNIT1 data only - model UNIT1
INV1 data only - model INV1
INV2 data only - model INV2
UNIT1 + INV1 + INV2 data together - model MIX

UNIT1 model training

INV1 model training

INV2 model training

MIX model training:

3. Results - one testing subject

UNIT1 model - gif

INV1 model - gif

INV2 model - gif

Comparison UNIT1 model vs MIX model

Comparison INV1 model vs MIX model

Comparison INV2 model vs MIX model

4. Results summary - one testing subject

KaterinaKrejci231054 · 2024-08-30T19:24:28Z

nnUNet model training (10 training vs 15 training subjects, default vs increased patch-size)

More manual corrections were made - see hc-leipzig-7t-mp2rage_train-test_split.csv and 6 subjects were excluded due to poor data quality. Then we tried to train single-contrast models (based on UNIT1-neg images) and multi-contrast models (based on UNIT1, inv1 and inv2 images).

New results are comparing:

single-contrast model vs multi-contrast model
default vs increased patch-size
number of training subjects (n=10 vs n=15)

UNIT1-neg models (single-contrast)

Training with default vs increased patch-size

Training log (Dataset026) graph with model settings: 15 training subjects, 1000 epochs, default patch size [192, 96, 128], fold 0

Training log (Dataset028) graph with model settings, 15 training subjects, 1000 epochs, increased patch size [352, 96, 128], fold 0

Impact of default vs increased patch-size (4 testing subjects)

Increasing the patch size (Dataset028) had a positive effect, particularly at the C2 and C3 levels. With the default patch size (Dataset026), training didn't start at these levels, resulting in a Dice score of 0 for testing subjects. In other levels, the performance on testing data was similar between the two models.

Results visualization in violinplot

Impact of increased patch size and different number of training subjects

Spinal level C2

Increasing the patch size led to earlier training initiation at the C2 level in both cases compared to the default patch size (lighter vs. darker colors). Additionally, increasing the number of training subjects also resulted in earlier training at the C2 level (dark blue vs. dark red).

Spinal level C3

Increasing the patch size and the number of subjects doesn't have as significant an impact at the C3 level as it does at the C2 level.

Multi-contrast models (MIX)

NOTE: We considered INV1, INV2 and UNIT1 images as multi-contrast dataset.

Training with default vs increased patch-size

Training log (Dataset027) graph with model settings: 45 training images, 2000 epochs, default patch size [192, 96, 128], fold 0

Training log (Dataset029) graph with model settings, 45 training images, 2000 epochs, increased patch size [352, 96, 128], fold 0

Impact of default vs increased patch-size (4 testing subjects)

Increasing the patch size had a positive effect, particularly at the C2 and C3 levels. With the default patch size, training didn't start at these levels, resulting in a Dice score of 0 for testing subjects. In other levels, the performance on testing data was similar between the two models.

UNIT1 testing images - Results visualization in violinplot

INV1 testing images - Results visualization in violinplot

INV2 testing images - Results visualization in violinplot

Impact of increased patch size and different number of training subjects

Spinal level C2

Increasing the patch size led to earlier training initiation at the C2 level (lighter red vs. darker red).

Spinal level C3

Increasing the patch size led to earlier training initiation at the C3 level (lighter red vs. darker red). Additionally, increasing the number of training subjects also resulted in earlier training at the C3 level (dark blue vs. dark red).

Comparison single-contrast vs multi-contrast model (UNIT1-neg vs UNIT1 data)

Model settings:

Dataset028 … 4 UNIT1-neg testing images, increased patch size [352, 96, 128], fold 0
Dataset029 … 4 UNIT1 testing images, increased patch size [352, 96, 128], fold 0

The performance of single-contrast and multi-contrast models is similar, but the multi-contrast model has the advantage of being directly applicable to the original MP2RAGE data. Unlike the UNIT1-neg single-contrast model, there’s no need to create inverse images.

Single-contrast vs multi-contrast model violinplot

Summary

Increasing the patch size primarily benefited training at the C2 and C3 levels for both the single-contrast model (UNIT1-neg) and the multi-contrast model.
For models at levels C4 to C8, there was no significant difference between using the default patch size and the increased patch size.
While the performance of single-contrast and multi-contrast models is similar, the multi-contrast model has the advantage of being directly applicable to the original MP2RAGE data, without the need to create inverse images like the UNIT1-neg single-contrast model.

valosekj added contrast: MP2RAGE strength: 7T labels Jul 13, 2024

This was referenced Jul 13, 2024

Try rootlets segmentation on inv2 (MP2RAGE) #45

Closed

Testing cervical model on hc-leipzig-7t-mp2rage dataset #63

Closed

valosekj assigned KaterinaKrejci231054 Jul 19, 2024

KaterinaKrejci231054 added a commit that referenced this issue Jul 31, 2024

Subjects split for models: #65

e736b4d

valosekj pushed a commit that referenced this issue Aug 29, 2024

Subjects split for models: #65

e7e776c

KaterinaKrejci231054 mentioned this issue Feb 10, 2025

Training model on multi-contrast (7T MP2RAGE and 3T T2w) images #84

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training model on 7T MP2RAGE images #65

Training model on 7T MP2RAGE images #65

valosekj commented Jul 13, 2024 •

edited by KaterinaKrejci231054

Loading

KaterinaKrejci231054 commented Jul 22, 2024 •

edited by valosekj

Loading

KaterinaKrejci231054 commented Aug 30, 2024

Training model on 7T MP2RAGE images #65

Training model on 7T MP2RAGE images #65

Comments

valosekj commented Jul 13, 2024 • edited by KaterinaKrejci231054 Loading

KaterinaKrejci231054 commented Jul 22, 2024 • edited by valosekj Loading

Obtaining GT labels and nnUNet model training

1. Obtaining GT labels

2. Model training

3. Results - one testing subject

4. Results summary - one testing subject

KaterinaKrejci231054 commented Aug 30, 2024

nnUNet model training (10 training vs 15 training subjects, default vs increased patch-size)

UNIT1-neg models (single-contrast)

Training with default vs increased patch-size

Impact of default vs increased patch-size (4 testing subjects)

Impact of increased patch size and different number of training subjects

Spinal level C2

Spinal level C3

Multi-contrast models (MIX)

Training with default vs increased patch-size

Impact of default vs increased patch-size (4 testing subjects)

Impact of increased patch size and different number of training subjects

Spinal level C2

Spinal level C3

Comparison single-contrast vs multi-contrast model (UNIT1-neg vs UNIT1 data)

Summary

valosekj commented Jul 13, 2024 •

edited by KaterinaKrejci231054

Loading

KaterinaKrejci231054 commented Jul 22, 2024 •

edited by valosekj

Loading