-
Notifications
You must be signed in to change notification settings - Fork 4
Applicable to ONT nanopore long reads? #20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi, How many calls do you have on chr1? Can you please check the file size of |
Thank you for your response. For NA12878 ONT reads, the file size of the original bam file used as input is 148.5 GB, and the file size of the smoothed.selective.bam after the smoothing step is 99.2 GB. The number of SV calls in the final output svs_poa.vcf was 1,439, which was only for chr1. Although the smoothing step reduced the reads, the likely problematic step might be the final call step. I show the last part of the log of the call step. [I] Loaded all chromosomes. The batch number of NA12878 ONT was significantly smaller than that for NA12878 HiFi, as is shown below. . |
I'd say that if you have less batches, the reasons could be:
I don't think that the issue is in the call step since batches are the output of previous steps.. It will be easier for me to check what is going on by running SVDSS by myself. Since you are analyzing the NA12878, I suspect that the sample is publicly available. Is this correct? In case, can you please provide me the link to the ONT dataset? Thanks |
The fastq file of NA12878 ONT reads was obtained from the ENA site, https://www.ebi.ac.uk/ena/browser/view/SRR15058167, where SRR15058167_1.fastq.gz is linked. Please let me know what is the cause of this trouble if you could know. Thank you |
Hi, I ran SVDSS on that sample and the final VCF has calls for all chromosomes. I got 93 batches in the working directory. Can you please check how many These the commands I ran:
This is the log of
And this is the log of
Let me know |
The number of my batch sfs files was 92. My log of SVDSS search: How many calls did you get? Thank you |
I ran it on hg38 so this could be the reason of the different batches number. I got 63355 calls, but I'd not fully trust all these calls (I repeat: I've never tested SVDSS on ONT so I don't have any results on its accuracy). So the issue is the missing
and it should call SVs from each chromosome. Let me know if this fixes the issue. Note to myself: improve CLI and remove batches option. |
According to your suggestion, I conducted SVDSS call command with the option --batches 92. |
Thank you for providing the excellent tool. I tried to use HG002 HiFi and ONT long reads for running SVDSS. Although SVDSS output a complete set of SVs when using HiFi reads, SVDSS using ONT reads resulted in generating SVs for only chr1 without any error messages. Can SVDSS use error-containing long reads such as ONT and PacBio CLR reads?
The text was updated successfully, but these errors were encountered: