Stringtie prepDE.py script does not take sequence length into consideration

### Description of the bug

The prepDE.py script from Stringtie is used by GEMmaker to estimate raw reads for downstream differential expression. It is used in both the Hisat2 and STAR pipelinies. Currently, GEMmaker does not set an `-l` argument for this script, which is the average length of input reads. The default `-l` for the `prepDE.py` script is 75. In the example below, I changed the `-l` argument to 150 (the proper number for this dataset) and the output is dramatically different (multiplied by 2).

This will have a  impact on downstream DE, as this can improperally push reads with low counts above the threshold of detection, and will cause differences to be exagerated. This is a critical Bug.

**If I run with default (no `-l` argument) (default is 75) (this is how GEMmaker currently runs Hisat2 and Star)**

TraesCS1A03G0000200,201
TraesCS1A03G0000400,260
TraesCS1A03G0000600,249
TraesCS1A03G0000800,12874
TraesCS1A03G0001000,21
TraesCS1A03G0001200,31
TraesCS1A03G0001300,341
TraesCS1A03G0001400,25
TraesCS1A03G0001500,3093

**If I run with 150 as the `-l` argument (notice that everything is doubled)**

TraesCS1A03G0000200,101
TraesCS1A03G0000400,130
TraesCS1A03G0000600,125
TraesCS1A03G0000800,6437
TraesCS1A03G0001000,11
TraesCS1A03G0001200,16
TraesCS1A03G0001300,171
TraesCS1A03G0001400,13
TraesCS1A03G0001500,1547


### Command used and terminal output

_No response_

### Relevant files

_No response_

### System information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stringtie prepDE.py script does not take sequence length into consideration #287

Description of the bug

Command used and terminal output

Relevant files

System information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Stringtie prepDE.py script does not take sequence length into consideration #287

Description

Description of the bug

Command used and terminal output

Relevant files

System information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions