Skip to content

Possible mismatch on genomic representation for NM_004260.4:c.2296C>G on GRCh37 #1884

Open
@gcampof

Description

@gcampof

Describe the issue

I’ve encountered a discrepancy in the genomic representation of the transcript variant NM_004260.4:c.2296C>G when using the VEP GRCh37 online tool, and I would appreciate your input on whether this is expected behavior or a potential issue.
 
When I input this variant into VEP (GRCh37 web tool), I get the following genomic representation:
8-145738768-G-C
(job can be found here)
This appears to be a simple base substitution. However, when I use VariantValidator, the output for GRCh37 is:
8-145738768-GG-C
which is interpreted as a delins.
 
On further investigation, we observed that the GRCh37 reference genome contains an extra G at the position 145738768 compared to the transcript NM_004260.4. 
    • Is the current VEP output (8-145738768-G-C) considered accurate in this context, or is it a simplified representation that does not take such transcript-genome alignment differences into account?
    • Does VEP account for alignment gaps or discrepancies between transcript and genomic references when generating HGVSg output?
    • Would a representation like 8-145738768-GG-C be more appropriate in this case?
Any clarification on how VEP handles such cases, and whether this is expected behavior, would be greatly appreciated.

Best regards,
Guillem 

Additional information

Screenshot of the aforementioned region in hg19 visualized using IGV

Image

System

  • VEP version: GrCh37 web tool

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions