Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automated codon usage table selection #88

Open
mahesh-panchal opened this issue Mar 5, 2024 · 2 comments
Open

Automated codon usage table selection #88

mahesh-panchal opened this issue Mar 5, 2024 · 2 comments

Comments

@mahesh-panchal
Copy link
Collaborator

mahesh-panchal commented Mar 5, 2024

Mitohifi and perhaps other tools need to specify the codon usage table to use.
Can this be automated?

@gbdias
Copy link
Contributor

gbdias commented Nov 8, 2024

The simplest solution I could find is to use Entrez Direct efetch to search NCBI taxonomy using the taxonomy ID (that we currently get from the TAXONKIT_NAME2LINEAGE process:

efetch -db taxonomy -id ${taxid} -format xml | awk -F'[<>]' '/<MGCId>/ {print $3}' > mitocode.txt

nf-core has some of the Entrez Direct tools but not efetch. Entrez Direct is in bioconda so a container can be pulled from Seqera.

@mahesh-panchal
Copy link
Collaborator Author

Want to have a shot at writing either an nf-core module or a local module for it?

nf-core module

Make a fork of nf-core modules.

git checkout -b entrezdirect_efetch

Then make a new module

nf-core modules create entrezdirect/efetch

but it means you'll also have to write an nf-test too.

Locally

Make a fork of the EBP pilot workflow

git checkout -b entrezdirect_efetch
nf-core modules create entrezdirect/efetch

but here you won't have to write an nf-test and don't need to follow nf-core guidelines.

Then make PR back to the main workflow

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants