Description
The DDMRA results I get are pretty dissimilar to the original paper's findings. While I expect some differences driven by differences in preprocessing and denoising methods, these go beyond my expectations.
From the replication manuscript, our predictions were:
Based on the findings of the original paper, intercepts and slopes were predicted to be statistically significant for the OC data and for the MEDN Noise data. Intercepts (but not slopes) were predicted to be significant for the MEDN data. Finally, neither intercepts nor slopes were predicted to be significant for the MEDN+GODEC and MEDN+GSR data.
We also hypothesized that the extended derivatives included in the analyses would follow similar patterns. Namely, we predicted that the intercepts, but not the slopes, would be statistically significant for the MEDN+GODEC Noise, MEDN+GSR Noise, MEDN+dGSR Noise, MEDN+aCompCor Noise, and MEDN+MIR Noise data. We also predicted that neither the intercepts nor the slopes would be significant for the MEDN+dGSR, MEDN+aCompCor, and MEDN+MIR data.
Analysis of Cambridge, CamCAN, and DuPre datasets
Here are the results using the Cambridge, CamCAN, and DuPre datasets, only dropping subjects with missing or zero-variance data in one or more ROIs (as well as subjects that had already been flagged from other steps). The table shows the p-values for the analyses.
QC:RSFC | High-low | Scrubbing | ||
---|---|---|---|---|
OC | int. | 0.9999 | 0.9990 | 0.0000 |
slope | 0.6006 | 0.3664 | 0.0000 | |
MEDN | int. | 1.0000 | 0.9991 | 0.0000 |
slope | 0.8616 | 0.2138 | 0.0000 | |
MEDN Noise | int. | 0.9992 | 0.9988 | 0.0000 |
slope | 0.9605 | 0.9649 | 1.0000 | |
MEDN+aCompCor | int. | 0.0000 | 0.0000 | 0.0000 |
slope | 0.0000 | 0.0000 | 0.0000 | |
MEDN+aCompCor Noise | int. | 1.0000 | 1.0000 | 1.0000 |
slope | 1.0000 | 0.9987 | 1.0000 | |
MEDN+GODEC | int. | 0.0000 | 0.0000 | 0.0000 |
slope | 0.0014 | 0.0006 | 0.0000 | |
MEDN+GODEC Noise | int. | 1.0000 | 1.0000 | 1.0000 |
slope | 1.0000 | 1.0000 | 1.0000 | |
MEDN+GSR | int. | 0.0786 | 0.0184 | 0.0000 |
slope | 0.1146 | 0.0154 | 0.0000 | |
MEDN+GSR Noise | int. | 1.0000 | 1.0000 | 1.0000 |
slope | 1.0000 | 1.0000 | 1.0000 | |
MEDN+MIR | int. | 0.3915 | 0.0014 | 0.0000 |
slope | 0.0474 | 0.0000 | 0.0000 | |
MEDN+MIR Noise | int. | 1.0000 | 1.0000 | 0.9741 |
slope | 0.9999 | 1.0000 | 1.0000 | |
MEDN+dGSR | int. | 0.4102 | 0.2016 | 0.0000 |
slope | 0.2810 | 0.3201 | 0.0000 | |
MEDN+dGSR Noise | int. | 1.0000 | 0.9780 | 0.9834 |
slope | 0.9823 | 0.7149 | 0.8673 |
The code used to perform the analysis is this script, and the version of the DDMRA package I used is at https://github.com/tsalo/ddmra/tree/f9be4687d6465c09aae158509566083d456f2567.
Replication on just the Cambridge dataset
Just in case the problem was due to intersite differences (as mentioned in Power et al., 2017), I ran the analyses just using the Cambridge dataset, which should have produced very similar results to the original paper.
Note that the scrubbing analysis results still look terrible, but also the MEDN intercepts were nowhere near significant and the MEDN+GODEC QC:RSFC intercept was significant (and slope was nearly significant).
QC:RSFC | High-low | Scrubbing | ||
---|---|---|---|---|
OC | int. | 0.0026 | 0.0390 | 0.0000 |
slope | 0.0036 | 0.0055 | 0.0007 | |
MEDN | int. | 0.7697 | 0.6309 | 0.0000 |
slope | 0.6040 | 0.3776 | 0.0000 | |
MEDN Noise | int. | 0.0022 | 0.0166 | 0.0000 |
slope | 0.0071 | 0.0018 | 1.0000 | |
MEDN+aCompCor | int. | 0.0172 | 0.0697 | 0.0000 |
slope | 0.1502 | 0.2623 | 0.0000 | |
MEDN+aCompCor Noise | int. | 0.9806 | 0.9503 | 1.0000 |
slope | 0.6464 | 0.3469 | 1.0000 | |
MEDN+GODEC | int. | 0.0004 | 0.1383 | 0.0000 |
slope | 0.0906 | 0.5315 | 0.0000 | |
MEDN+GODEC Noise | int. | 0.9972 | 0.9665 | 0.9776 |
slope | 0.8941 | 0.4836 | 0.0350 | |
MEDN+GSR | int. | 0.2597 | 0.3426 | 0.0000 |
slope | 0.2942 | 0.3766 | 0.0001 | |
MEDN+GSR Noise | int. | 0.9842 | 0.9733 | 1.0000 |
slope | 0.6605 | 0.3891 | 1.0000 | |
MEDN+MIR | int. | 0.2531 | 0.3150 | 0.0000 |
slope | 0.3228 | 0.3305 | 0.0000 | |
MEDN+MIR Noise | int. | 0.0767 | 0.2449 | 0.0273 |
slope | 0.0249 | 0.0278 | 1.0000 | |
MEDN+dGSR | int. | 0.5025 | 0.2794 | 0.0000 |
slope | 0.4939 | 0.4604 | 0.0000 | |
MEDN+dGSR Noise | int. | 0.9688 | 0.8504 | 0.4810 |
slope | 0.4127 | 0.3522 | 0.3476 |
Replication on subjects with mean FD < 0.2mm
Per Power et al., 2017, DDMRA analyses of multiple datasets can be corrupted by differences in baseline levels of motion, as well as outliers.
Specifically, what it says is:
A third way in which motion artifact can be overlooked is if a handful of outlying datasets are allowed to influence the scrubbing or QC:RSFC correlations calculated across many subjects; correlations are sensitive to outlying values and a few scans with marked abnormalities can obscure relationships present across most other datasets.
Also, in the original paper's supplement, they re-run the analyses with only subjects with mean FD < 0.3 and < 0.2 mm, separately, and say that the results were basically the same. I was concerned that the CamCAN dataset, which has higher levels of motion (probably because there are older participants), might be causing problems in the main analysis, so I ran the supplement's analysis of only subjects with < 0.2mm mean FD. However, the results are still quite dissimilar from the original paper.
I don't know if there is a problem with the analyses/dataset or if the CamCAN dataset is simply driving different, but still valid, results.
QC:RSFC | High-low | Scrubbing | ||
---|---|---|---|---|
OC | int. | 0.9980 | 0.9973 | 0.0000 |
slope | 0.2761 | 0.5359 | 0.0000 | |
MEDN | int. | 0.8211 | 0.7266 | 0.0000 |
slope | 0.0272 | 0.1078 | 0.0000 | |
MEDN Noise | int. | 0.9998 | 1.0000 | 0.0526 |
slope | 0.9406 | 0.9796 | 1.0000 | |
MEDN+aCompCor | int. | 0.0171 | 0.0153 | 0.0000 |
slope | 0.0067 | 0.0181 | 0.0000 | |
MEDN+aCompCor Noise | int. | 0.9993 | 0.9985 | 1.0000 |
slope | 0.4379 | 0.8967 | 1.0000 | |
MEDN+GODEC | int. | 0.0000 | 0.0001 | 0.0000 |
slope | 0.0239 | 0.0565 | 0.0000 | |
MEDN+GODEC Noise | int. | 1.0000 | 1.0000 | 1.0000 |
slope | 0.9495 | 0.9789 | 1.0000 | |
MEDN+GSR | int. | 0.0160 | 0.0641 | 0.0000 |
slope | 0.0086 | 0.0432 | 0.0000 | |
MEDN+GSR Noise | int. | 1.0000 | 1.0000 | 1.0000 |
slope | 0.8618 | 0.9847 | 1.0000 | |
MEDN+MIR | int. | 0.0002 | 0.0006 | 0.0000 |
slope | 0.0000 | 0.0010 | 0.0000 | |
MEDN+MIR Noise | int. | 1.0000 | 1.0000 | 1.0000 |
slope | 0.9946 | 0.9985 | 1.0000 | |
MEDN+dGSR | int. | 0.1122 | 0.0267 | 0.0000 |
slope | 0.2781 | 0.2620 | 0.0000 | |
MEDN+dGSR Noise | int. | 0.6137 | 0.6661 | 0.1692 |
slope | 0.0455 | 0.5208 | 0.1339 |