A list of mutations observed by public health or academic institutions engaged in sequencing SARS-CoV-2 clinical specimens. Listed mutations are submitted by the community and come with no guarantees or endorsements. Mutations relative to reference NC_045512.2 are provided in both table and VCF formats (coming soon) as described below. Previous versions of the table are stored in previous_tsv with filenames timestamped to indicate their revision date. A detailed history of updates to the table can be found in the changelog.
See also: https://github.com/W-L/ProblematicSites_SARS-CoV2
- Facilitate sample review by public health laboratories.
- Aid review of assemblies flagged by QC processes due to presumed disruptive mutations.
- Improve submission of SARS-CoV-2 genome sequence assemblies to public data repositories.
- Compile problematic sites that may impact downstream bioinformatic analyses.
- Inform further development of targeted assays for SARS-CoV-2.
Before adding, submitters are expected to confirm the validity of mutations by inspecting read mapping results (for example, upload your FASTA and BAM files to IGV) to ensure high confidence according to the suggested criteria:
- Minimum 90% genome-wide coverage breadth
- Mimimum 30x coverage depth at mutation
- Mimimum 80% read agreement for mutation
- Optional: confirm by Sanger sequencing
- Download /import/mut_template.csv and save with a filename containing an unique identifier using format:
[YYYYMMDD].[YourName].[0-9]{3}.csv
that can be used to link entries between the table and changelog (eg.20210812.Weigand.123.csv
). - Enter mutation information according to the column descriptions below, entering 'NA' for any missing attributes except those marked as required. You may add multiple entries to this file.
- Create a new GitHub issue with title "Add: mutation table entry" (Do you need a GitHub account?).
- Drag-and-drop the new csv file containing your entries.
- Submit your issue. The new entry will be reviewed and you will be notified when the mutation table has been updated.
Header | Description |
---|---|
Chrom * | Name of the reference sequence NC_045512.2 |
Gene * | Annotated gene in the reference |
Position * | Base position in the reference |
Reference * | Reference sequence |
Mutation * | Mutation sequence change (use '-' for deletion) |
Description * | Human-readable description of mutation |
Example | GenBank and/or SRA accession of example genome with confirmed mutation |
Library | Primer set or library method information |
Platform | Sequencing platform information |
Analysis | Bioinformatic pipeline information (version) |
Lineage | Affected PANGO lineages (Pangolin version) |
Submitter * | Name and affiliation of mutation submitter |
Date * | Date added to table |
(* required)
Header | Description |
---|---|
CHROM | Name of the reference sequence |
POS | 1-based position of the variation on the reference |
ID | ???? |
REF | Reference base |
ALT | List of alternative alleles at the position (IUPAC ambiguity code) |
QUAL | ???? |
FILTER | ???? |
INFO | ???? |
Please submit suggested revisions to inclusion criteria, table updates, or other feedback by opening an new GitHub issue and including "mutation table" somewhere in the title.
This resource is maintained by CDC's Technical Outreach and Asstance for States Team (TOAST), 🍞.