Skip to content

Author lists do not deduplicate authors with differing numbers of initials #508

Open
@corneliusroemer

Description

@corneliusroemer

When there are multiple author lists due to e.g. direct submission and publication being present on a Genbank record and the 2 publications contain the same author with differing numbers of initials, NCBI virus and datasets seem to not deduplicate the authors.

Note that Marti,M.A. and Marti,M. is the same individual appearing twice in NCBI virus (and datasets):

Image

https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/virus?SeqType_s=Nucleotide&ids=MG773272

The likely root cause is that the authors list is generated by making a set of authors from all references of the genbank record:

Image

https://www.ncbi.nlm.nih.gov/nuccore/MG773272.1

In this case, the direct submission lists Marti,M. while the publication has Marti,M.A..

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions