Wikidata:Property proposal/Internet Encyclopedia of Ukraine ID

Internet Encyclopedia of Ukraine ID edit

Originally proposed at Wikidata:Property proposal/Authority control

Descriptionidentifier for an article in the Internet Encyclopedia of Ukraine
RepresentsInternet Encyclopedia of Ukraine (Q87193076)
Data typeExternal identifier
Domainitem
Example 1Kyiv (Q1899)pages\K\Y\Kyiv
Example 2Zaporizhzhia (Q157835)pages\Z\A\Zaporizhia
Example 3Mykola Pymonenko (Q2601651)pages\P\Y\PymonenkoMykola
Example 4concentration camp (Q152081)pages\C\O\Concentrationcamps
Example 5Paleolithic (Q40203)pages\P\A\PaleolithicPeriod
Example 6Pavlo Zahrebelnyi (Q983989)pages\Z\A\ZahrebelnyPavlo
External linksUse in sister projects: [ar][de][en][es][fr][he][it][ja][ko][nl][pl][pt][ru][sv][vi][zh][commons][species][wd][en.wikt][fr.wikt].
Planned useadd to items I’ve been editing, others based on links in en.wikipedia
Number of IDs in source8041
Expected completenessalways incomplete (Q21873886)
Formatter URLhttp://www.encyclopediaofukraine.com/display.asp?linkpath=$1
See alsoEncyclopedia of Modern Ukraine ID (P4613), Encyclopedia of the History of Ukraine ID (P8935)

Motivation edit

The Internet Encyclopedia of Ukraine is a useful reference on the subject, written by professionals, actively updated, and covering a broad range of subtopics.

It is based on the six-volume Encyclopedia of Ukraine (Q774515) (1977–85)—called the “most important reference work on Ukraine in English” by the authors of the Historical Dictionary of Ukraine (Toronto 2005, p 745)—which was in turn based on the 14-volume Ukrainian-Language Entsyklopediia ukraïnoznavstva (Munich 1949–95). It is linked 1,652 times in en.wikipedia, 598 times in de., and 231 times in uk. It appears over 200 times in Google Scholar results.

Note: an identifier in the form pages\K\Y\Kyiv seems to work fine in my browser, but in the address bar the URL query is converted to a URL-encoded version pages%5CK%5CY%5CKyiv. I believe this may be a function of the web browser, and I don’t know if it’s safe to use the raw text version. —Michael Z. 04:07, 2 January 2021 (UTC)[reply]

A bit of research confirms that the backslash character “\” is unsafe in a URL, according to RFC 1738 Uniform Resource Locators (URL). It is not one of the reserved or unreserved URL characters, and so must be percent-encoded. That said, currently http://www.encyclopediaofukraine.com/display.asp?linkpath=pages\K\Y\Kyiv works for me, but we should adhere to basic responsible software practices (be liberal in what you accept, and conservative in what you produce).
Is there a way to implement something like http://www.encyclopediaofukraine.com/display.asp?linkpath=URLENCODE($1).htm, to render a safe URL for a link? If not, then we must store %5C.
Researching URI details has also made me aware that the query portion of the URL is technically ?linkpath=pages\K\Y\Kyiv.htm, and the content of the query argument link path is pages\K\Y\Kyiv.htm, including the .htm (the path name ends with the file extension .asp, and the query starts with the question mark). So a safer, future-proof version of the ID might be the full string pages\K\Y\Kyiv.htm, URL encoded as pages%5CK%5CY%5CKyiv.htm.
On the other hand, random testing shows that in practice, the .htm is not currently required, and pages\K\Y\Kyiv (pages%5CK%5CY%5CKyiv) is the most minimal empirical ID. I am updating the proposal to reflect this. —Michael Z. 17:54, 2 January 2021 (UTC)[reply]

Discussion edit

@Mzajac, Epìdosis, ArthurPSmith, Adam Harangozó, Gerwoman:   Done Internet Encyclopedia of Ukraine ID (P9070) Pamputt (talk) 12:47, 20 January 2021 (UTC)[reply]