Wikidata:Property proposal/GenBank assembly accession

GenBank Assembly accession edit

Originally proposed at Wikidata:Property proposal/Generic

DescriptionGenome assembly accession identifier for the identification of strains with a corresponding assembly.
RepresentsAmycolatopsis mediterranei U32 (Q21102998)
Data typeExternal identifier
Domaingenome (Q7020) strain (Q855769)
Allowed values^GCA_[0-9]{9}\.[0-9]$
ExampleRhodobacter sphaeroides 2.4.1 Rhodobacter sphaeroides 2.4.1 (Q21102953)GCA_000012905.2
Planned useComplement existing strains with assembly information
Formatter URLhttp://www.ebi.ac.uk/ena/data/view/$1

Various strains are being populated into WikiData with identifiers. However sequenced organisms also have an assembly identifier which represents a unique strain / organism as whole. This identifier is of significant importance when using it to match information to different databases as sequence identifiers can differ between different resources. jjkoehorst (talk) 08:55, 29 September 2017 (UTC)[reply]

  WikiProject Molecular biology has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

Discussion