GitHub - peterthorpe5/public_scripts: collection of bioinformatic scripts

Peter Thorpe's public bioinformatics scripts.

for most programs here, help can be accessed by asking for help at the command line:

Type python script_name.py -h for how to use them.

This is an ever growing repository of tools which I have used/ using for the various projects I am involved in.

With in here you will find:

Alternative_to_ITS1_finding

This was an early draft to try an identify single copy common regions within genomes which primers could be desinged for as an alternative metabarcoding region to ITS1. This is not under any further development.

genomic_upstream_regions

This gets the upstream regions of a given gene set to help identify promoter regions. Used in: https://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-0985-1 The genome of the yellow potato cyst nematode, Globodera rostochiensis, reveals insights into the basis of parasitism and virulence

blast_output

Some scripts in here to identify top blast hits, filter BLAST hits based on a given phylum (or whatever tax id is given).

identifying_pallindrome

A collection of scripts to identify and plot pallindromes in a given geneome sequence. Please read the Word file in the folder if you want to know more.

reformat_fasta_hints_names_for_Braker

Script to rename the fasta names and hints names for BRAKER gene prediction. For me, this failed if I did not do this myself.

convert_file_format

A collection of scripts to convert file formats from one type to another.

ITS_copy_number

A pipeline and collection of scripts to estimate the copy number of ITS1 regions (or any other given gene of interest) based on genomic read coverage

shell_ITS_clustering_pipline

A metabarcoding clustering pipeline wrote in shell. This is a draft for the upcoming metapy.py pipeline (https://github.com/widdowquinn/THAPBI-pycits/tree/master).

Diamond_BLAST_add_taxonomic_info

Tool to post taxonomically annotate a DIAMOND blast output.

Lateral_gene_transfer_prediction_tool

Tool to predict horizontal or lateral gene transfer.

split_up_fasta_file_into_N_files

Tool to split up a large fasta file in N smaller fasta files

domain_searching

Pipeline to identity and align domains of interest.

NGS

Tools for working with Illumina data

transposon_analysis

Tools and pipelines for transposon analysis in genomes.

Fix_five_prime

Tool to refine the 5 prime start codon after Transdecoder has predicted the CDS from an RNAseq assembly

primer_designer

Under development

gene_model_testing

Pipeline to tests gene models and gain information as to how good they are. They is no one method to do this!

produce_random_seq

Program to produce N number of random sequences with the average length and average GC of a given database.

lots more not documented here. Look in the individual repo for more help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Peter Thorpe's public bioinformatics scripts.

Alternative_to_ITS1_finding

genomic_upstream_regions

blast_output

identifying_pallindrome

reformat_fasta_hints_names_for_Braker

convert_file_format

ITS_copy_number

shell_ITS_clustering_pipline

Diamond_BLAST_add_taxonomic_info

Lateral_gene_transfer_prediction_tool

split_up_fasta_file_into_N_files

domain_searching

NGS

transposon_analysis

Fix_five_prime

primer_designer

gene_model_testing

produce_random_seq

About

Releases 1

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 457 Commits
Alternative_to_ITS1_finding		Alternative_to_ITS1_finding
Diamond_BLAST_add_taxonomic_info		Diamond_BLAST_add_taxonomic_info
Fix_five_prime		Fix_five_prime
GFF_stuff		GFF_stuff
ITS_copy_number		ITS_copy_number
Introns		Introns
Lateral_gene_transfer_prediction_tool		Lateral_gene_transfer_prediction_tool
NGS		NGS
RNAseq		RNAseq
Sanger_read_metagenetics		Sanger_read_metagenetics
ScaffoldChecker		ScaffoldChecker
TransStart		TransStart
accession_to_fasta		accession_to_fasta
blast_output		blast_output
cluster_analysis		cluster_analysis
convert_file_format		convert_file_format
domain_searching		domain_searching
fasta		fasta
gene_model_testing		gene_model_testing
generate_ITS1_database		generate_ITS1_database
genomic_upstream_regions		genomic_upstream_regions
homer		homer
identifying_pallindrome		identifying_pallindrome
metapy		metapy
metapy_tools		metapy_tools
misc		misc
multithreadBait		multithreadBait
primer_designer		primer_designer
produce_random_seq		produce_random_seq
reformat_fasta_hints_names_for_Braker		reformat_fasta_hints_names_for_Braker
shell_ITS_clustering_pipline		shell_ITS_clustering_pipline
snpeff		snpeff
split_up_fasta_file_into_N_files		split_up_fasta_file_into_N_files
transposon_analysis		transposon_analysis
.gitignore		.gitignore
README.rst		README.rst

peterthorpe5/public_scripts

Folders and files

Latest commit

History

Repository files navigation

Peter Thorpe's public bioinformatics scripts.

Alternative_to_ITS1_finding

genomic_upstream_regions

blast_output

identifying_pallindrome

reformat_fasta_hints_names_for_Braker

convert_file_format

ITS_copy_number

shell_ITS_clustering_pipline

Diamond_BLAST_add_taxonomic_info

Lateral_gene_transfer_prediction_tool

split_up_fasta_file_into_N_files

domain_searching

NGS

transposon_analysis

Fix_five_prime

primer_designer

gene_model_testing

produce_random_seq

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages