Skip to content

SVDSS call no output #30

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
jiadong324 opened this issue Nov 18, 2023 · 6 comments
Open

SVDSS call no output #30

jiadong324 opened this issue Nov 18, 2023 · 6 comments
Labels
bug Something isn't working

Comments

@jiadong324
Copy link

Dear author,

Firstly, I got 22 sfs files after running SVDSS search --index ./hg38.fmd --workdir ./HG00733/ --bam HG00733/smoothed.selective.bam. Each of the sfs`` file is named as solution_batch_*.sfs``` and contains some information for each reads.

Then, I run SVDSS call --min-sv-length 50 --workdir ./HG00733 --bam ./HG00733/smoothed.selective.bam --reference ./hg38.no_alt.fa. I got the following output:

SVDSS, Structural Variant Discovery from Sample-specific Strings.
Mode: call
[I] ./HG00733 
[I] PingPong SV Caller running on 4 threads.. 
[I] Loading reference genome from /net/eichler/vol28/eee_shared/assemblies/hg38/no_alt/hg38.no_alt.fa.. 
[I] Extracted chr1 with 248956422 bases. 
[I] Extracted chr10 with 133797422 bases. 
[I] Extracted chr11 with 135086622 bases. 
[I] Extracted chr11_KI270721v1_random with 100316 bases. 
[I] Extracted chr12 with 133275309 bases. 
[I] Extracted chr13 with 114364328 bases. 
[I] Extracted chr14 with 107043718 bases. 
[I] Extracted chr14_GL000009v2_random with 201709 bases. 
[I] Extracted chr14_GL000225v1_random with 211173 bases. 
[I] Extracted chr14_KI270722v1_random with 194050 bases. 
[I] Extracted chr14_GL000194v1_random with 191469 bases. 
[I] Extracted chr14_KI270723v1_random with 38115 bases. 
[I] Extracted chr14_KI270724v1_random with 39555 bases. 
[I] Extracted chr14_KI270725v1_random with 172810 bases. 

There are a number of similar information as shown above. The final output I have is listed below:

[I] Loaded all chromosomes. 
[I] Loading assmbled SFS.. 
[I] Loaded 0 SFS strings on 0 reads. 
[I] Extending superstrings on 4 threads.. 
terminate called recursively
terminate called after throwing an instance of 'terminate called recursively
Aborted

Looking forward to your reply!

Thanks!

@ldenti
Copy link
Collaborator

ldenti commented Nov 21, 2023

Hi,
which version are you using?

I think that you have to rerun the search step using the --assemble argument. We always run the caller on "assembled" SFSs and the caller looks for solution_batch_*.assembled.sfs files in the working directory and not simply solution_batch_*.sfs. Then I also suggest to add --batches 22 to the call command to set how many files to load (this is done automatically in the latest version but not on v1.0.5).

Let me know if this solve the issue. Unfortunately, I still haven't updated the conda recipe to the latest v2 but I updated the README on the repo.. Sorry if this created some confusion on how to run the tool.

Best,

@ldenti ldenti added the bug Something isn't working label Nov 21, 2023
@jiadong324
Copy link
Author

Hi,

I am using v1.0.5. Based on your suggestion, I think that I get the correct output, which is svs_poa.vcf.

Thanks!

@jiadong324
Copy link
Author

Looking forward to v2.0, it's important to set --batches automatically. BTW, please also update your snakemake file.

Thanks!

@ldenti
Copy link
Collaborator

ldenti commented Nov 27, 2023

The snakemake should be already updated (do you refer to this one?). Did you encounter any issue?

I just released a new alpha version for SVDSS2 and running some additional tests.

@jiadong324
Copy link
Author

Yes, for example

{SVDSS_BIN} smooth --reference {input.fa} --bam {input.bam} --threads {threads} > {output.bam}

It dose not write into standard output, but to your working directory. This is what I used:

SVDSS smooth --reference {ref} --bam {input.bam} --threads {threads} --workdir $( dirname {output.bam} )

Moreover, is that possible to remove the temp results like poa.sam, clusters.txt and etc. They take quite large space if I run for hundreds of samples.

@ldenti
Copy link
Collaborator

ldenti commented Nov 30, 2023

This snakefile (the one I linked in my previous message) is for SVDSS2. SVDSS2 prints to stdout. And it shouldn't create the temp files by default.

SVDSS1 writes files to the working directory and it creates the temp files. The snakemake for v1 is this one.

Can you please double check that you are using the correct version of SVDSS with the correct Snakefile? Unfortunately v1 and v2 are incompatible.

I'd suggest to use v2 (last alpha) if you don't want the temp files to be stored.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants