Wikidata:Property proposal/name-suggestion-index identifier

name-suggestion-index identifier edit

Originally proposed at Wikidata:Property proposal/Authority control

Descriptionidentifier for a brand in OpenStreetMap's name-suggestion-index
RepresentsName Suggestion Index (Q62108705)
Data typeExternal identifier
Domainretail chain (Q507619)
Allowed values[a-z_]{1,16}\/[a-z_]{1,48}\
Example 1McDonald’s (Q38076)amenity/fast_food|McDonald's
Example 2Shell (Q154950)amenity/fuel|Shell, shop/convenience|Shell, amenity/fuel|เชลล์
Example 3United States Postal Service (Q668687)amenity/post_office|United States Post Office
Source[1]
External linksUse in sister projects: [ar][de][en][es][fr][he][it][ja][ko][nl][pl][pt][ru][sv][vi][zh][commons][species][wd][en.wikt][fr.wikt].
Planned useUpon approval, this property will be mentioned in name-suggestion-index's contributing guide and the brand:wikidata key's official documentation, and NSI contributors will immediately begin adding the property to some existing items that have been deleted by mistake in the past.
Number of IDs in source5,747 identifiers, 5,213 that have corresponding Wikidata items as of 86be8c6fd568c9757ec1816362a58ffe13df0adf
Expected completenessalways incomplete (Q21873886)
Formatter URLhttps://nsi.guide/?id=$1
See also

Motivation edit

name-suggestion-index is the OpenStreetMap project's de facto authority for brand-related tagging information. Entries in NSI are presented to mappers as presets to choose from when mapping, alongside unbranded presets like "road", "lake", "restaurant", or "ATM". Most entries were created because NSI scripts flagged certain names as being common supermarket names (for instance) in the main OSM database.

Most entries are for brands that already have Wikidata items, so linking OpenStreetMap with Wikidata is just a matter of adding a brand:wikidata tag to the entry. It isn't feasible to link Wikidata to every instance of a chain store location in OSM, but it is possible to link to the brand's entry in NSI. It would only be feasible for this user script to query OSM for chain store locations to plot on a map when it knows what kind of business OSM considers the chain to be.

The idea of creating an identifier property for NSI has come up a couple times in the context of undeletion discussions (another instance). I don't think the presence of this property on an item would establish notability by itself, given that OSM isn't considered an authority on store locations. But it could give administrators a little more clarity when assessing whether a business-related item is merely undeveloped or whether it's spam.

 – Minh Nguyễn 💬 02:11, 9 November 2019 (UTC)[reply]

Discussion edit

  •   Comment you added links to your examples, and that is obviously helpful, but we should keep in mind that in the current form Wikidata will not be able to generate these links (since it only supports inserting the full statement value in a formatter URL). There are a few options:
    • Use a URL datatype and store the whole URLs as values (they can still be restricted to a particular format via a constraint)
    • Set up a proxy which accepts values in the format you suggest and translates them to your service (ArthurPSmith can help) - this should only be done if the format you are proposing is already attested somewhere else
    • Find another format for which there already exists a service which

accepts these values as part of its URLs

Let me know if any of this is unclear. − Pintoch (talk) 13:45, 9 November 2019 (UTC)[reply]
Thanks for the suggestions Pintoch! NSI has been using this identifier format on its pages but not in its URLs. I've proposed a change to NSI that would allow us to use a format of https://nsi.guide/?id=$1. – Minh Nguyễn 💬 17:27, 9 November 2019 (UTC)[reply]
I think it might make sense to leave the string itself unlinked, but put the URI (either of the k&v type or the id type) as a reference. After all, NSI is an authority of what identifiers it uses :) Arlo Barnes (talk) 22:14, 10 November 2019 (UTC)[reply]
  •   Support Seems valuable, to be able to access this classification from Wikidata. I would prefer the External ID datatype, especially if NSI URLS can be adapted to accept such values -- linked data is so helpful. And no good reason not to do this upfront, rather than through some semi-hidden reference mechanism. Would the applies to name of subject (P5168) qualifier generally be used to indicate the particular name being identified, or would this be redundant given the structure of the identifier? Jheald (talk) 17:57, 11 November 2019 (UTC)[reply]
  • @Jheald: Yes, if I understand the qualifier correctly, it would be helpful in the case of international brands like Shell (Q154950) above. – Minh Nguyễn 💬 02:36, 23 November 2019 (UTC)[reply]
  comment 2 Maybe use some qualifier (e.g. OpenStreetMap tag or key (P1282)) instead of "amenity/fast_food". --- Jura 10:08, 20 January 2020 (UTC)[reply]