Skip to content

Removal of the superkingdom rank in NCBI taxonomy breaks MEDI #27

@nayeon1994

Description

@nayeon1994

Hello,

I'm currently using medi according the instructions on Github.
However, I've encountered some errors.

The error occurs during the process architeuthis_filter step in quant.nf.
If I exclude this step, the workflow runs without errors.

To debug the issue, I started by looking into the database.
I noticed that when I run the following command:

architeuthis mapping filter {kraken.k2} --data-dir {medi_db/taxonomy} # (with options like --max-entropy 1 --max-multiplicity 10 --min-consistency 0.1)

no reads are retained. I tried adjusting the parameters with various thresholds, but still, no reads are retained.

When I run :

architeuthis mapping score {kraken.k2} --data-dir {medi_db/taxonomy} -f "{p};{c};{o};{f};{g};{s}"

it works correctly.

But if I run:

architeuthis mapping score {kraken.k2} --data-dir {medi_db/taxonomy} -f "{k}"

or

architeuthis mapping score {kraken.k2} --data-dir {medi_db/taxonomy} -f "{k};{p};{c};{o};{f};{g};{s}" #default

no reads are ratined again.

This leads me to believe there might be an issue with the database, specifically with the kingdom-level information.
I checked the manifest.csv file in the medi_db directory and found that the kingdom names for all species are empty (NaN).

Could you help me debug this issue?

Thank you!

Metadata

Metadata

Assignees

Labels

ncbiIssues related to NCBI including downloading, broken links, or broken DBs.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions