Property talk:P637
Documentation
identifier for a protein
[NYXW]P_(\d{6}|\d{9})(\.\d{1,2})?
”: value must be formatted using this pattern (PCRE syntax). (Help)List of violations of this constraint: Database reports/Constraint violations/P637#Format, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P637#Entity types
List of violations of this constraint: Database reports/Constraint violations/P637#Scope, SPARQL
This property is being used by: Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.) |
|
|
|
AMR format
editBesides the many ID's with the format [NYX]P_(\d{6}|\d{9})(\.\d{1,2})?
there are also some ID's in the form AMR\d+
. Are these also okay and should we change the format constraint? --Pasleim (talk) 08:49, 4 October 2016 (UTC)
Remove Distinct Value Constraint
editDue to NCBI's Prokaryotic RefSeq reannotation project, many prokaryotic ref seq protein ID's are no longer unique. This new type of identifier, the non redundant ref seq ID, begins with the WP prefix and indicates the protein is shared among many strains. To remove redundancy, NCBI now combines identical prokaryotic proteins into a single ID, and unfortunately will invalidate the distinct value constraint of this property. For instance, Chlamydia muridarum Str. Nigg uses the new non redundant ref seq ID format, and its genes are annotated with non redundant ref seq IDs that are assigned to all strains of Chlamydia muridarum. (See pmp). In particular, you can see in the comment section: "This record represents a single, non-redundant, protein sequence which may be annotated on many different RefSeq genomes from the same, or different, species." Although it will no longer be possible to query a distinct protein using this property, the same behavior can be retained by combining the query with the tax ID of the organism. I believe that we should stay consistent with NCBI and remove the distinct value constraint from this property so we can continue to properly annotate new genes with non redundant ref seq IDs in WikiData. Djow2019 (talk) 21:28, 7 August 2018 (UTC)