Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

psiPerIsoform resulting in empty .psi file #161

Open
TinyTasy opened this issue Apr 12, 2023 · 2 comments
Open

psiPerIsoform resulting in empty .psi file #161

TinyTasy opened this issue Apr 12, 2023 · 2 comments

Comments

@TinyTasy
Copy link

Dear SUPPA team,

Thank you so much for your amazing tool. It is really helpful for differential isoform analysis.

I am trying to use SUPPA on pacbio single-cell isoseq data. I aligned my data with pbbm2 and used pigeon (SQANTI-based) to obtain a .gff file. Using gffread, I converted my .gff file into a .gtf file. My gtf file looks like this:

Screenshot from 2023-04-12 11-54-45

Thus, having the pb gene and transcript ID as the 9th column in the gtf file.

My expression file is a tab-seperated (.tsv) file consists of 268 samples (pseudobulks) and looks like this:

Screenshot from 2023-04-12 11-56-32

If I now execute this command:

python3.4 /vol/projects/agrinko/TREM2_7_03_2022/SUPPA-2.3/suppa.py psiPerIsoform
-g /vol/projects/agrinko/TREM2_7_03_2022/data/Trem2_Longread/pacbio_TREM2.gtf
-e /vol/projects/agrinko/TREM2_7_03_2022/data/Trem2_Longread/pseudobulk_without_rownames.tsv
-o /vol/projects/agrinko/TREM2_7_03_2022/data/Trem2_Longread/psiPerIsoform_output

I get this warning for each transcript:

INFO:psiPerGene:Reading GTF data.
INFO:psiPerGene:Reading Expression data.
INFO:psiPerGene:Calculating inclusion and generating output.
INFO:lib.tools:Expression for transcript "PB.104659.2" not found. Ignoring it in calculation.
INFO:lib.tools:Expression for transcript "PB.104659.16" not found. Ignoring it in calculation.
INFO:lib.tools:Expression for transcript "PB.98879.2" not found. Ignoring it in calculation.
INFO:lib.tools:Expression for transcript "PB.98879.3" not found. Ignoring it in calculation.
.
.
.

And my .psi output file is empty, only the sample names are persisting.

I already tried multiple things, such as testing tab seperated .txt files and .tsv files. I also already used the transcripts as rownames.

Do you have any idea what could be the issue? Any help is greatly appreciated.

Sincerely,
Tasy

@EduEyras
Copy link
Member

EduEyras commented Apr 12, 2023 via email

@TinyTasy
Copy link
Author

Hello Eduardo!

Thank you for your quick reply.
It's almost embarassing, but yes, the error laid in the expression file, I indeed only had to remove the " ".

I am grateful for your help, thank you!

Sincerely,
Tasy

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants