Wikidata:Property proposal/RBDcode
EU River Basin District code / euRBDCode edit
Description | Unique code for a EU River Basin District |
---|---|
Represents | river basin district (Q132017) |
Data type | External identifier |
Domain | Items of instance river basin district (Q132017) |
Allowed values | [A-Z]{2}[A-Z0-9_]+ |
Example | |
Source | [1] |
Planned use | This is part of the WFD to Wikidata project aimed at making use of the WFD reporting data in Wikidata. |
Formatter URL | http://dd.eionet.europa.eu/vocabularyconcept/wise/SpatialUnit/euRBDCode.$1 |
See also | EU Surface Water Body Code (P2856) |
- Motivation
Unique code denoting River Basin Districts (RBD) within the EU which can be used to structure the information on lakes on other surface waters on Wikidata.
Disclaimer: Wikimedia Sweden is working together with the European Environment Agency (Q632988) to identify parts of the Water Framework Directive (Q1508115) reporting data which can be of use to Wikidata.
/ André Costa (WMSE) (talk) 08:08, 5 July 2016 (UTC)
- Discussion
- Support. Thryduulf (talk) 09:13, 5 July 2016 (UTC)
- Support. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:33, 5 July 2016 (UTC)
- Support. However, could you please write a more restrictive format pattern instead of .+? For example, are lower case allowed? Dots? Hyphens? Could a three symbols string be a valid ID? And a thirty symbols string? --abián 14:14, 5 July 2016 (UTC)
- There doesn't seem to be any limitations imposed other than the first two being the country iso-code and that the whole code needs to be unique. After the first two characters it's up to each country to decide on an id. And yes even three characters is ok (e.g. SE1). /André Costa (WMSE) (talk) 16:20, 5 July 2016 (UTC)
- The bad point is that
.
means any character, even a space, and 字, and Ŋ, and ª, and →, and ™, and ♥, and ☹, and ☮, and ⅛, etc. I think that characters that will never appear as a part of an EU River Basin District code should be excluded to easily detect mistakes and vandalism even if these characters aren't explicitly excluded by the responsible institution. It would be great to get a limited, although wide enough, character set for these IDs. --abián 13:51, 6 July 2016 (UTC)- I can make an educated guess that the characters are limited to [a-zA-Z0-9_-] but I cannot guarantee that that is the case. /André Costa (WMSE) (talk) 09:11, 7 July 2016 (UTC)
- So I just crunched all available identifiers and they are limited to [A-Z0-9_]. So we can probably add that and then re-evaluate if something else pops up later. /André Costa (WMSE) (talk) 09:20, 7 July 2016 (UTC)
- Great! Thanks for your help, André. --abián 14:51, 7 July 2016 (UTC)
- So I just crunched all available identifiers and they are limited to [A-Z0-9_]. So we can probably add that and then re-evaluate if something else pops up later. /André Costa (WMSE) (talk) 09:20, 7 July 2016 (UTC)
- I can make an educated guess that the characters are limited to [a-zA-Z0-9_-] but I cannot guarantee that that is the case. /André Costa (WMSE) (talk) 09:11, 7 July 2016 (UTC)
- The bad point is that
- There doesn't seem to be any limitations imposed other than the first two being the country iso-code and that the whole code needs to be unique. After the first two characters it's up to each country to decide on an id. And yes even three characters is ok (e.g. SE1). /André Costa (WMSE) (talk) 16:20, 5 July 2016 (UTC)
- Support VIGNERON (talk) 13:13, 6 July 2016 (UTC)
@André Costa (WMSE), Thryduulf, Pigsonthewing, Abián, VIGNERON: Done Now EU River Basin District code (P2965) -- Lymantria (talk) 07:59, 14 July 2016 (UTC)
- Realized I accidentally copy-pasted the wrong formatter code above (the examples used the right one). I changed it here and will change in the property. /André Costa (WMSE) (talk) 10:41, 14 July 2016 (UTC)