Wikidata:Bot requests/Archive/2019/01

Import Slovene registry of cultural monuments

Request date: 1 January 2019, by: Yerpo

Task description

Starting the year with a large addition to the project.

Here's a CSV file containing data about all listed monuments in Slovenia, downloaded from the official registry: https://www.dropbox.com/s/nfqc8ttluqbm9u1/kultura_koordinate_WDready.csv?dl=0 It contains data on 8424 currently listed monuments to be used for organizing the WLM competition in Slovenia (among other uses). I already converted the names to Q values, now somebody please help me to import this.

Instructions
The main task is to create new WD objects for monuments or update existing objects by their Slovene Cultural Heritage Register ID (P1587) value. There's currently a bit over 750 in WD created and maintained by hand, those should be corrected where necessary. To be done as follows for each row (referring to column titles in the CSV file):

Additionally, for all objects:

I suggest the person volunteering to do this makes a test run of a few dozen objects for verifying. I'm available for additional clarifications. Thanks in advance in the name of the Slovene Wikipedia community! :) — Yerpo Eh? 19:51, 1 January 2019 (UTC)

Copyright

As an official database made by a state ministry, I believe the data is copyright free (per Copyright and related rights act, art. 9).

Discussion

@Yerpo: I did some test edits [1]. Are they okay? --Pasleim (talk) 22:03, 1 January 2019 (UTC)

@Pasleim: thank you, looking great, with some minor notes:
  • If a WD object already has instance of (P31), ignore OBJECT completely, do not add a second P31.
  • Can you please set UTF-8 encoding for processing names? The special characters of Slovene language (čžš) came out garbled.
  • In case you wonder, I made this complete revert because the existing ID was wrong. Not a bot error.
Other than that, I see no problems. Do you think the data and instructions are solid enough now or would you prefer running a bigger test? — Yerpo Eh? 07:52, 2 January 2019 (UTC)
@Yerpo: Can you check if the encoding is not already wrong in the Dropbox file? It shows me that it is a non-ISO extended ASCII text and I see for example "Domaèija Škerlak" and not "Domačija Škerlak". --Pasleim (talk) 08:38, 2 January 2019 (UTC)
@Pasleim: apparently, CSV files do not natively have an encoding, so it depends on the reader. For example, Notepad on my Windows installation displays the special characters properly, but Dropbox's viewer displays the same as you see. I made another version with forced encoding (here), try this one if it's better - I also see č ok in Notepad on this version, whereas Dropbox displays junk symbols. — Yerpo Eh? 09:01, 2 January 2019 (UTC)
EDIT: I managed to reproduce the problem with read.csv command in R which behaves the same as Dropbox, but displays the characters properly if encoding="UTF-8" option is set (for both versions of the file). If it's easier for you, I can just make a normal txt file with UTF-8 encoding. — Yerpo Eh? 09:24, 2 January 2019 (UTC)
Thanks, the second file works fine. Here are some more test edits: [2]. --Pasleim (talk) 19:18, 2 January 2019 (UTC)
As far as I can see, it looks perfect now. Please do the whole run. — Yerpo Eh? 20:51, 2 January 2019 (UTC)

Many thanks, Pasleim, everything seems to be in order. I intend to do this periodically to account for changes, say, once per year, can I contact you again for the import? — Yerpo Eh? 06:16, 3 January 2019 (UTC)

Feel free to contact me anytime. --Pasleim (talk) 15:07, 3 January 2019 (UTC)
Request process

  Done --Pasleim (talk) 15:07, 3 January 2019 (UTC)

This section was archived on a request by: Pasleim (talk) 15:07, 3 January 2019 (UTC)

Replacing values keeping ref/qual/ranks

Request date: 10 January 2019, by: Infovarius

I've done some job exactifying city (Q515)->city or town (Q7930989) for Russian (and some Soviet) settlements. As QuickStatements doesn't allow changing only values of statements I at first removed wrong value then added correct one. And of course all references, ranks and qualifiers are gone. The batch is https://tools.wmflabs.org/quickstatements/#/batch/6050.

Task description

Can anyone restore deleted references, ranks and qualifiers for new values? I.e. finding change like this do like that

@Magnus Manske: can you add a possibility of changing only value of statements keeping other information?

Discussion

I can not easily restore references but I can change values with my bot. So if you undo all your edits with [3] I could afterwards change the value without losing references, qualifiers and ranks. --Pasleim (talk) 15:29, 10 January 2019 (UTC)

Ok, I am undoing it. In an hour it will be done. Then, Pasleim, please redo change city (Q515)->city or town (Q7930989) at the same group of items. --Infovarius (talk) 09:20, 15 January 2019 (UTC)
Request process

  Done --Pasleim (talk) 10:57, 15 January 2019 (UTC)

This section was archived on a request by: --Pasleim (talk) 10:57, 15 January 2019 (UTC)

pq:P31 -> pq:P3831 for trailer (Q622550)

"Now" that we have object has role (P3831) could someone kindly switch the qualifier from P31 to P3831 for qualifier values trailer (Q622550).

It's mostly YouTube video ID (P1651) statements, but also a few others. currently 345 --- Jura 13:01, 2 January 2019 (UTC)

  Done Matěj Suchánek (talk) 15:55, 15 February 2019 (UTC)
This section was archived on a request by: Matěj Suchánek (talk) 15:55, 15 February 2019 (UTC)

Description of Nigerian LGAs in French

Request date: 24 January 2019, by: DonCamillo

Link to discussions justifying the request
Task description

Hi, For some reason, hundreds of Nigerian Local Government Zones (the subdivision of States in Nigeria) are described in French as "zone de gouvernement local nigérienne". Comes from a quite common confusion in French between "nigérienne" (from Niger) and "nigériane" (from Nigeria). Would it be possible to replace "zone de gouvernement local nigérienne" with "zone de gouvernement local nigériane" in all items which are instances of "local government area of Nigeria" (Q1639634)? Thanks a lot! DonCamillo (talk) 11:58, 24 January 2019 (UTC)

Licence of data to import (if relevant)
Discussion


Request process

Accepted by (VIGNERON (talk) 15:05, 12 February 2019 (UTC)) and under process. Oh put*n, jolie trouvaille DonCamillo, ces erreurs sont là depuis 2014 (et double erreur, la description est souvent "locale" au lieu de "local" #facepalm). Cdlt, VIGNERON (talk) 15:05, 12 February 2019 (UTC)

Task completed (I even added it where there was no description). PS: fr:Zone de gouvernement local au Nigeria says there is 774 of them but Wikidata only has 763 items with instance of (P31) = local government area of Nigeria (Q1639634). Cheers, VIGNERON (talk) 15:37, 12 February 2019 (UTC)

This section was archived on a request by: Matěj Suchánek (talk) 14:46, 15 February 2019 (UTC)

request for a bot to import population data

Request date: 10 January 2019, by: Histobot

Link to discussions justifying the request
Task description

Import municipal population data of Dutch municipalities from the Statistics Netherlands https://www.cbs.nl open data portal from the dataset: https://opendata.cbs.nl/ODataApi/odata/37259ned. Specifically add population data from 1960 to 2017 to every municipality using the property population. This will facilitate the use of reliable and consistent population data in other projects. Look for instance at the municpality of Zwolle and its population data. I will try to add this data using openrefine.

Licence of data to import (if relevant)

CC BY 4.0

Discussion

Histobot (talk) 15:56, 10 January 2019 (UTC)

Request process

Auto-adding complementary values

Request date: 11 January 2019, by: Jc86035

Link to discussions justifying the request
Task description

There should be a bot to add complementary values for Genius artist ID (P2373), Genius album ID (P6217) and Genius ID (P6218). For all Genius artist ID (P2373) values without Genius artist numeric ID (P6351), the bot should add the first match of regex \{"name":"artist_id","values":["(\d+)" in the linked page, and vice versa with the first match of regex "slug":"([0-9A-Z][0-9a-z-]*[0-9a-z]|[0-9A-Z])". Preferred and deprecated ranks should be inferred when adding new values, although if multiple statements to be added have the same value but different rank then only the statement with the higher rank should be used. The values should be periodically checked to see if they match, and errors should be reported somewhere (probably on-wiki). The same should also be implemented for the other two pairs of properties, Genius album ID (P6217)/Genius artist numeric ID (P6351) and Genius ID (P6218)/Genius song numeric ID (P6361).

Licence of data to import (if relevant)

N/A (presumed not copyrightable)

Discussion

All of the properties now exist. Jc86035 (talk) 10:57, 15 January 2019 (UTC)

Request process