-
Notifications
You must be signed in to change notification settings - Fork 3
Description
When running very large sample BAMs through call-sSNV, it likely that the pipeline would fail because of default resource configurations.
Although the base_resouce_update function in template.config is a great utility to update resources on a case-by-case basis, it is often unclear on how much of the resource is to be updated for a successful run. It would be nice provide examples of resource configurations that worked for large BAMs, perhaps in a doc/ dir of the repo.
Here are the resources I set in my pipeline run's config,
base_resource_update {
cpus = [
['call_sIndel_Manta', 0.1]
]
memory = [
[['run_validate_PipeVal', 'call_sSNV_Strelka2', 'call_sSNV_Mutect2', 'call_sIndel_Manta', 'concat_VCFs_BCFtools', 'plot_VennDiagram_R', 'run_LearnReadOrientationModel_GATK', 'convert_BAM2Pileup_SAMtools'], 10],
[['call_sSNV_MuSE', 'run_sump_MuSE'], 2]
]
}
Nextflow trace files
Case 1:
Normal - 2.5TB
Tumor - 1.1TB - /hot/data/unregistered/Zook-Mootor-BNCH-GIAB/analysis/GIAB/AshkenazimParents/somatic-variants/call-sSNV-7.0.0/HG002-T/log-call-sSNV-7.0.0-20231201T000959Z/nextflow-log/trace.txt
Case 2:
Normal - 369GB
- Tumor 1 - 312GB - /hot/data/unregistered/Zook-Mootor-BNCH-GIAB/analysis/GIAB/CancerGIAB/metapipeline/metapipeline-DNA-5.3.1/BNCH000122/main_workflow/output/call-sSNV-8.0.0/ZMBNGIAB000008-T001-C01-F/log-call-sSNV-8.0.0-20240421T231004Z/nextflow-log/trace.txt
- Tumor 2 - 328GB - /hot/data/unregistered/Zook-Mootor-BNCH-GIAB/analysis/GIAB/CancerGIAB/metapipeline/metapipeline-DNA-5.3.1/BNCH000122/main_workflow/output/call-sSNV-8.0.0/ZMBNGIAB000008-T001-C02-F/log-call-sSNV-8.0.0-20240421T231102Z/nextflow-log/trace.txt