Skip to content

Udate GATK modules - Fix output line lack of specificity #1839

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 12 commits into
base: dev
Choose a base branch
from

Conversation

johnoooh
Copy link

@johnoooh johnoooh commented Mar 18, 2025

Addresses #1838 for the APPLYBQSR step and applies this PR from the modules repo

When using APPLYBQSR on an HPC with scratch enabled, the globbed *.bam copys both the input and output bam from scratch to the work directory. The output needed to be more specific. It is now ${prefix}.bam for the output so it now targets the output bam/cram instead of all files with the right file extension. For more details see the issue.

This should fix the issue for APPLYBQSR but be on the lookout for other modules where this occurs.

Also it looks like the modules.json is getting updated, was there a schema change? Also the gatk bioconda version is updated to 4.6.1.0

PR checklist

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
  • If you've added a new tool - have you followed the pipeline conventions in the contribution docs
  • If necessary, also make a PR on the nf-core/sarek branch on the nf-core/test-datasets repository.
  • Make sure your code lints (nf-core pipelines lint).
  • Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
  • Check for unexpected warnings in debug mode (nextflow run . -profile debug,test,docker --outdir <OUTDIR>).
  • Usage Documentation in docs/usage.md is updated.
  • Output Documentation in docs/output.md is updated.
  • CHANGELOG.md is updated.
  • README.md is updated (including new tool citations and authors/contributors).

maxulysse and others added 2 commits February 18, 2025 09:44
<!--
# nf-core/sarek pull request

Many thanks for contributing to nf-core/sarek!

Please fill in the appropriate checklist below (delete whatever is not
relevant).
These are the most common things requested on pull requests (PRs).

Remember that PRs should be made against the dev branch, unless you're
preparing a pipeline release.

Learn more about contributing:
[CONTRIBUTING.md](https://github.com/nf-core/sarek/tree/master/.github/CONTRIBUTING.md)
-->

## PR checklist

- [ ] This comment contains a description of changes (with reason).
- [ ] If you've fixed a bug or added code that should be tested, add
tests!
- [ ] If you've added a new tool - have you followed the pipeline
conventions in the [contribution
docs](https://github.com/nf-core/sarek/tree/master/.github/CONTRIBUTING.md)
- [ ] If necessary, also make a PR on the nf-core/sarek _branch_ on the
[nf-core/test-datasets](https://github.com/nf-core/test-datasets)
repository.
- [ ] Make sure your code lints (`nf-core pipelines lint`).
- [ ] Ensure the test suite passes (`nextflow run . -profile test,docker
--outdir <OUTDIR>`).
- [ ] Check for unexpected warnings in debug mode (`nextflow run .
-profile debug,test,docker --outdir <OUTDIR>`).
- [ ] Usage Documentation in `docs/usage.md` is updated.
- [ ] Output Documentation in `docs/output.md` is updated.
- [ ] `CHANGELOG.md` is updated.
- [ ] `README.md` is updated (including new tool citations and
authors/contributors).

---------

Co-authored-by: Friederike Hanssen <[email protected]>
Co-authored-by: Friederike Hanssen <[email protected]>
Co-authored-by: tdanhorn <[email protected]>
Co-authored-by: Thomas <[email protected]>
@nf-core-bot
Copy link
Member

Warning

Newer version of the nf-core template is available.

Your pipeline is using an old version of the nf-core template: 3.0.2.
Please update your pipeline to the latest version.

For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation.

@maxulysse
Copy link
Member

@johnoooh, sorry to bother you about it, can you try using pre-commit, so that the linting stays the same on the modules.json file?

@maxulysse
Copy link
Member

Since this change is also updating a version of a GATK module, I would recommend to keep this PR open, and update ALL GATK modules to the same version in just a single PR.

@maxulysse maxulysse changed the title APPLYBQSR output line specificity Udate GATK modules - Fix output line lack of specificity Mar 19, 2025
@johnoooh
Copy link
Author

Yeah I realized the module file was a pre-commit issue. I think pre-commit doesn't play nice with my HPC cluster for some reason but I'll figure that out.

I'll work on the GATK versions as well, no worries! Looks like they all got the version bump a few months ago in the modules repo.

Copy link
Contributor

@FriederikeHanssen FriederikeHanssen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In addition, the modules should really be updated first in nf-core/modules and then secondary in nf-core.

@johnoooh
Copy link
Author

In addition, the modules should really be updated first in nf-core/modules and then secondary in nf-core.

All of the GATK versions were bumped in the modules repo a few months ago, I'll just be using nf-core module upgrade to bring in the changes.

@FriederikeHanssen
Copy link
Contributor

All of the GATK versions were bumped in the modules repo a few months ago, I'll just be using nf-core module upgrade to bring in the changes.

Is the prefix as in all of them already?

@johnoooh
Copy link
Author

johnoooh commented Mar 24, 2025 via email

@FriederikeHanssen
Copy link
Contributor

sounds good and thanks for all the work ❤️

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants