-
Notifications
You must be signed in to change notification settings - Fork 459
Support for PureCN #1710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Support for PureCN #1710
Conversation
Allows HGVS annotation when wokring offline. Fixes nf-core#1607.
|
I didn't hook the VCF part yet, because PureCN depends on MuTect2, but it also requires specific parameters:
a. MuTect2 is being run; |
Also lint fails, |
This needs to be upstreamed pronto!
As discussed as the hackathon, we can reasonably assume PureCN is run in the recommended way.
As long as Mutect has been run with a PoN, PureCN can work with it.
From the implementation side, at least conceptually, everything should be in place. I can't test this here at the hackathon, so this will have to wait until I'm back to use some real-world tests. |
Allows HGVS annotation when wokring offline. Fixes nf-core#1607.
This reverts commit 7c3e1f4. It wasn't meant to be here. In the end I guess I'll rebase everything....
Warning Newer version of the nf-core template is available. Your pipeline is using an old version of the nf-core template: 3.0.2. For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation. |
If I can, I'll update the nf-core modules to use a newer PureCN. |
A potential issue that I've stumbled upon is that for annotation PureCN needs to process the intervals (BED), and requires that the normal data and the GATK data to be aligned to these intervals or it would error out. While this is pretty much doable for the data under sarek's control, this would need to be written somewhere as a caveat (since the PoN should be made somewhere else). |
More snags: using a PoN from the GATK will remove regions, which then cause issues when comparing the normals (unfiltered) with the tumors (filtered). See lima1/PureCN#386 |
Given that I can't seem to fix this issue, I have a better plan:
Of course, the best course of action is fixing the upstream issue. I think I fixed the underlying issue (lima1/PureCN#388), but it needs to be merged (if appropriate) and a new release made, and since PureCN is in Bioconductor that might require quite a while. |
I deeply regret trying to work on this. ;) |
Is there a blocker to bump the bioconda recipe? |
Yes, the fact that they are on Bioconductor 3.20, and the release is for Bioconductor 3.21. |
ahhh 😞, now I understand |
Blocking issue: bioconda/bioconda-recipes#55833 |
This PR implements support for PureCN (https://github.com/lima1/).
Design rationale talked at the hackathon:
CI is failing, but in areas I didn't touch (hopefully!)
TODO:
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).nextflow run . -profile debug,test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).