Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aligner: Create a merged file when using folder as input #1133

Open
billytcl opened this issue Nov 15, 2024 · 1 comment
Open

Aligner: Create a merged file when using folder as input #1133

billytcl opened this issue Nov 15, 2024 · 1 comment

Comments

@billytcl
Copy link

Issue Report

Please describe the issue:

Is there a way to output a merged file when using a folder as input? Right now I'm using

dorado aligner --recursive (fasta reference) (read folder) > output.bam

which errors out:

[2024-11-15 12:09:23.137] [error] An output-dir must be specified if reading from an input folder.
[2024-11-15 12:09:23.137] [error] Could not initialise for input ../../globus_download/08302024_XKLT001_D18_RERUN_SV/

When I use -o for an input folder, it will create a ton of small bams corresponding to each file of the input folder (eg. what MinKNOW used to output before recent updates). Presumably this is so each file can be sorted, but is there a way to just create a single aligned output bam? I can sort it later -- it's just that merging a ton of bam files is really problematic with samtools when you have thousands of them (eg. I have to break it into subgroups).

Run environment:

  • Dorado version: 0.8.3
  • Operating system: Linux
  • Source data type (e.g., pod5 or fast5 - please note we always recommend converting to pod5 for optimal basecalling performance): unaligned bam from MinKNOW
@HalfPhoton
Copy link
Collaborator

@billytcl
At this time theres no way to generate a single output from multiple inputs directly in Dorado as you've found.

Kind regards,
Rich

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants