Wikidata:Property proposal/MassBank Accession ID

MassBank Accession ID edit

Originally proposed at Wikidata:Property proposal/Natural science

DescriptionAccession number for entries in the MassBank database (records of mass spectrometry).
RepresentsMassBank (Q24088019)
Data typeExternal identifier
Domainchemical compound (Q11173)
Allowed values[A-Z]{2}[A-Z0-9][0-9]{5}
Example 1caffeine (Q60235)EA030311
Example 2kanamycin A (Q27094615) → SMI00011
Example 3paracetamol (Q57055) → AU112601
Sourcehttps://github.com/MassBank/MassBank-web/blob/master/Documentation/MassBankRecordFormat.md#2.1.1
Planned useLink Wikidata the MassBank (Q24088019) database entries.
Number of IDs in source53390
Expected completenessalways incomplete (Q21873886)
Formatter URLhttps://massbank.eu/MassBank/RecordDisplay.jsp?id=$1
Robot and gadget jobsAccession IDs will be added from a public CCZero dataset (to be compiled) with QuickStatements

Motivation edit

MassBank (Q24088019) is a international collaboration of mass spectra database. The SPLASH (Q50412900) is a unique spectral identifiers, but does not provide the provenance. This accession identifiers allows to link to specific MassBank records. --Egon Willighagen (talk) 09:51, 2 April 2019 (UTC)[reply]

Discussion edit

  •   Comment new RegEx simplified (already included once in list [0-9A-Z]). --Eihel (talk) 11:45, 2 April 2019 (UTC)[reply]
  •   Support David (talk) 06:18, 3 April 2019 (UTC)[reply]
  •   Support  Conditional support Could you make another Property proposal for the MoNA identifier and leave on this proposal only the massbank.eu link? MoNA is another db, linked on its side with other db. The fact that many IDs are the same does not matter: the sites are not the same. If you create a MassBank ID, the link (WD data) must go to its site (if possible) and not to another site that retrieves its data. Fortunately, you have already done most of the work  . Best regards. --Eihel (talk) 07:08, 3 April 2019 (UTC)[reply]
    Thanks. We can propose that, but we would be reliant on the MoNA team to provide us with the data. We'll ask. --Egon Willighagen (talk) 09:13, 3 April 2019 (UTC)[reply]
    Hello Egon Willighagen, you are making a Property proposal. You decide to name this Property "MassBank Accession ID". Then you tell me that you need the MoNA team for the data and thus form this new Property. Which means that the data does not have the same shape here. That's exactly what I wrote you. The sites and datas are different, so you have to remove http://mona.fiehnlab.ucdavis.edu/spectra/display/$1 from your proposal to be a Property. To avoid getting hit on the fingers, it is up to you to make a correct proposal: I can intervene only slightly on the proposals of others. You are free to make this new property proposal for MoNA. For your information :
    • Following this link [1], number of ids = 184114 in this proposal here.
    • This link, here, gives you the data recorded on MoNA. The data is so different that you can write a RegEx as follows: .{1,40}. There are about 600,000 records on MoNA. This last line is dedicated only for a new property about MoNA. Cordially. --Eihel (talk) 13:58, 4 April 2019 (UTC)[reply]
  •   Support Seems to be an important addition. --Balabinrm (talk) 08:32, 7 April 2019 (UTC)[reply]
  •   Support Indeed a valuable addition --Andrawaag (talk) 19:07, 7 April 2019 (UTC)[reply]
  •   Support. YULdigitalpreservation (talk) 13:46, 11 April 2019 (UTC)[reply]

@Egon Willighagen, ديفيد عادل وهبة خليل 2, Balabinrm, Andrawaag, YULdigitalpreservation:   Done: MassBank accession ID (P6689) --Eihel (talk) 14:28, 17 April 2019 (UTC)[reply]