Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 428 Bytes

README.md

File metadata and controls

6 lines (4 loc) · 428 Bytes

CenturyFluEvol_Hamming

The full sequence dataset -- including avian sequences -- can be found in (zstd compressed) fasta format in genbank_full_aligned.fasta.zst.

The accompanying metadata is available in genbank_full.tsv.zst.

The Python code to compute and plot nucleotide Hamming distances (rooted Hamming maps as well as unrooted Hamming distributions) is available as the interactive Python notebook HammingPlots.ipynb.