Skip to content

DDMRA results are dissimilar to original paper's findings #17

Open
@tsalo

Description

@tsalo

The DDMRA results I get are pretty dissimilar to the original paper's findings. While I expect some differences driven by differences in preprocessing and denoising methods, these go beyond my expectations.

From the replication manuscript, our predictions were:

Based on the findings of the original paper, intercepts and slopes were predicted to be statistically significant for the OC data and for the MEDN Noise data. Intercepts (but not slopes) were predicted to be significant for the MEDN data. Finally, neither intercepts nor slopes were predicted to be significant for the MEDN+GODEC and MEDN+GSR data.

We also hypothesized that the extended derivatives included in the analyses would follow similar patterns. Namely, we predicted that the intercepts, but not the slopes, would be statistically significant for the MEDN+GODEC Noise, MEDN+GSR Noise, MEDN+dGSR Noise, MEDN+aCompCor Noise, and MEDN+MIR Noise data. We also predicted that neither the intercepts nor the slopes would be significant for the MEDN+dGSR, MEDN+aCompCor, and MEDN+MIR data.

Analysis of Cambridge, CamCAN, and DuPre datasets

Here are the results using the Cambridge, CamCAN, and DuPre datasets, only dropping subjects with missing or zero-variance data in one or more ROIs (as well as subjects that had already been flagged from other steps). The table shows the p-values for the analyses.

QC:RSFC High-low Scrubbing
OC int. 0.9999 0.9990 0.0000
slope 0.6006 0.3664 0.0000
MEDN int. 1.0000 0.9991 0.0000
slope 0.8616 0.2138 0.0000
MEDN Noise int. 0.9992 0.9988 0.0000
slope 0.9605 0.9649 1.0000
MEDN+aCompCor int. 0.0000 0.0000 0.0000
slope 0.0000 0.0000 0.0000
MEDN+aCompCor Noise int. 1.0000 1.0000 1.0000
slope 1.0000 0.9987 1.0000
MEDN+GODEC int. 0.0000 0.0000 0.0000
slope 0.0014 0.0006 0.0000
MEDN+GODEC Noise int. 1.0000 1.0000 1.0000
slope 1.0000 1.0000 1.0000
MEDN+GSR int. 0.0786 0.0184 0.0000
slope 0.1146 0.0154 0.0000
MEDN+GSR Noise int. 1.0000 1.0000 1.0000
slope 1.0000 1.0000 1.0000
MEDN+MIR int. 0.3915 0.0014 0.0000
slope 0.0474 0.0000 0.0000
MEDN+MIR Noise int. 1.0000 1.0000 0.9741
slope 0.9999 1.0000 1.0000
MEDN+dGSR int. 0.4102 0.2016 0.0000
slope 0.2810 0.3201 0.0000
MEDN+dGSR Noise int. 1.0000 0.9780 0.9834
slope 0.9823 0.7149 0.8673

The code used to perform the analysis is this script, and the version of the DDMRA package I used is at https://github.com/tsalo/ddmra/tree/f9be4687d6465c09aae158509566083d456f2567.

Replication on just the Cambridge dataset

Just in case the problem was due to intersite differences (as mentioned in Power et al., 2017), I ran the analyses just using the Cambridge dataset, which should have produced very similar results to the original paper.

Note that the scrubbing analysis results still look terrible, but also the MEDN intercepts were nowhere near significant and the MEDN+GODEC QC:RSFC intercept was significant (and slope was nearly significant).

QC:RSFC High-low Scrubbing
OC int. 0.0026 0.0390 0.0000
slope 0.0036 0.0055 0.0007
MEDN int. 0.7697 0.6309 0.0000
slope 0.6040 0.3776 0.0000
MEDN Noise int. 0.0022 0.0166 0.0000
slope 0.0071 0.0018 1.0000
MEDN+aCompCor int. 0.0172 0.0697 0.0000
slope 0.1502 0.2623 0.0000
MEDN+aCompCor Noise int. 0.9806 0.9503 1.0000
slope 0.6464 0.3469 1.0000
MEDN+GODEC int. 0.0004 0.1383 0.0000
slope 0.0906 0.5315 0.0000
MEDN+GODEC Noise int. 0.9972 0.9665 0.9776
slope 0.8941 0.4836 0.0350
MEDN+GSR int. 0.2597 0.3426 0.0000
slope 0.2942 0.3766 0.0001
MEDN+GSR Noise int. 0.9842 0.9733 1.0000
slope 0.6605 0.3891 1.0000
MEDN+MIR int. 0.2531 0.3150 0.0000
slope 0.3228 0.3305 0.0000
MEDN+MIR Noise int. 0.0767 0.2449 0.0273
slope 0.0249 0.0278 1.0000
MEDN+dGSR int. 0.5025 0.2794 0.0000
slope 0.4939 0.4604 0.0000
MEDN+dGSR Noise int. 0.9688 0.8504 0.4810
slope 0.4127 0.3522 0.3476

Replication on subjects with mean FD < 0.2mm

Per Power et al., 2017, DDMRA analyses of multiple datasets can be corrupted by differences in baseline levels of motion, as well as outliers.

Specifically, what it says is:

A third way in which motion artifact can be overlooked is if a handful of outlying datasets are allowed to influence the scrubbing or QC:RSFC correlations calculated across many subjects; correlations are sensitive to outlying values and a few scans with marked abnormalities can obscure relationships present across most other datasets.

Also, in the original paper's supplement, they re-run the analyses with only subjects with mean FD < 0.3 and < 0.2 mm, separately, and say that the results were basically the same. I was concerned that the CamCAN dataset, which has higher levels of motion (probably because there are older participants), might be causing problems in the main analysis, so I ran the supplement's analysis of only subjects with < 0.2mm mean FD. However, the results are still quite dissimilar from the original paper.

I don't know if there is a problem with the analyses/dataset or if the CamCAN dataset is simply driving different, but still valid, results.

QC:RSFC High-low Scrubbing
OC int. 0.9980 0.9973 0.0000
slope 0.2761 0.5359 0.0000
MEDN int. 0.8211 0.7266 0.0000
slope 0.0272 0.1078 0.0000
MEDN Noise int. 0.9998 1.0000 0.0526
slope 0.9406 0.9796 1.0000
MEDN+aCompCor int. 0.0171 0.0153 0.0000
slope 0.0067 0.0181 0.0000
MEDN+aCompCor Noise int. 0.9993 0.9985 1.0000
slope 0.4379 0.8967 1.0000
MEDN+GODEC int. 0.0000 0.0001 0.0000
slope 0.0239 0.0565 0.0000
MEDN+GODEC Noise int. 1.0000 1.0000 1.0000
slope 0.9495 0.9789 1.0000
MEDN+GSR int. 0.0160 0.0641 0.0000
slope 0.0086 0.0432 0.0000
MEDN+GSR Noise int. 1.0000 1.0000 1.0000
slope 0.8618 0.9847 1.0000
MEDN+MIR int. 0.0002 0.0006 0.0000
slope 0.0000 0.0010 0.0000
MEDN+MIR Noise int. 1.0000 1.0000 1.0000
slope 0.9946 0.9985 1.0000
MEDN+dGSR int. 0.1122 0.0267 0.0000
slope 0.2781 0.2620 0.0000
MEDN+dGSR Noise int. 0.6137 0.6661 0.1692
slope 0.0455 0.5208 0.1339

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions