Wikidata:Requests for permissions/Bot/JonHaraldSøbyWMNO-bot 2
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 19:32, 29 October 2020 (UTC)[reply]
JonHaraldSøbyWMNO-bot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Jon Harald Søby (WMNO) (talk • contribs • logs)
Task/s: Add and modify items about places in Norway.
Code: N/A
Function details: I have a data set of around 27,000 places in Norway to add to Wikidata. Around 1/3 of these already have items in Wikidata (the majority from the Cebuano Wikipedia), while 2/3 would be new items. I will use QuickStatements to do the actual addition, but the code to prepare the QS batches is very specific to this dataset, so I haven't seen the need to publish it on GitHub. All places have IDs in the Sentralt stadnamnregister (SSR place name number (P1850)), so I will add those IDs (and whatever other data from SSR that is missing) to reconciled items, as well as creating new items for places with no existing items. Jon Harald Søby (WMNO) (talk) 11:23, 28 September 2020 (UTC)[reply]
Discussion
editQuestion Where does the data set come from? How are items reconciled? Will there be duplicates? --Haansn08 (talk) 18:22, 28 September 2020 (UTC)[reply]
- @Haansn08: The data comes from the Norwegian Mapping Authority (Kartverket), from their placenames dataset. All that data for a single place can be accessed via SSR place name number (P1850). All items have been reconciled with OpenRefine. And there shouldn't be any duplicates, no, we've been careful to avoid that during the reconciliation process. Jon Harald Søby (WMNO) (talk) 06:34, 29 September 2020 (UTC)[reply]
Support --Haansn08 (talk) 16:05, 29 September 2020 (UTC)[reply]
Comment Is there anything missing? Should I do some test edits? Jon Harald Søby (WMNO) (talk) 13:44, 9 October 2020 (UTC)[reply]
- Yes, please do--Ymblanter (talk) 19:30, 10 October 2020 (UTC)[reply]
- @Ymblanter: 123 test edits done, please let me know if you see anything that should be fixed. Jon Harald Søby (WMNO) (talk) 08:50, 28 October 2020 (UTC)[reply]
- If you don't think the name is suitable as English label, would you at least add it as an English alias? Also, I think a minimal English description would help, e.g. "lake in <municipality>, <adm1>, Norway" --- Jura 09:01, 28 October 2020 (UTC)[reply]
- @Jura1: Thanks for this feedback! I have made some amendments to the script that generates the input for QuickStatements, so English labels and descriptions will be a part of the items the next time I run it. Jon Harald Søby (WMNO) (talk) 12:28, 28 October 2020 (UTC)[reply]
- Great, thanks. I will approve in one or two days provided no objections have been raised.--Ymblanter (talk) 14:27, 28 October 2020 (UTC)[reply]
- @Jon Harald Søby (WMNO): Great. Could we see some samples after the amendments? --- Jura 18:35, 28 October 2020 (UTC)[reply]
- @Jura1: Sure! Raate (Q100989120) and Goeblenjohke (Q100989121). Jon Harald Søby (WMNO) (talk) 09:13, 29 October 2020 (UTC)[reply]
- @Jon Harald Søby (WMNO): looks good. If it's too complex, maybe this could be added afterwards. For P31, I suppose there is some reason why geographic location (Q2221906) is used instead of mound (Q1584134). --- Jura 10:08, 29 October 2020 (UTC)[reply]
- @Jura1: Sure! Raate (Q100989120) and Goeblenjohke (Q100989121). Jon Harald Søby (WMNO) (talk) 09:13, 29 October 2020 (UTC)[reply]
- @Jon Harald Søby (WMNO): Great. Could we see some samples after the amendments? --- Jura 18:35, 28 October 2020 (UTC)[reply]
- If you don't think the name is suitable as English label, would you at least add it as an English alias? Also, I think a minimal English description would help, e.g. "lake in <municipality>, <adm1>, Norway" --- Jura 09:01, 28 October 2020 (UTC)[reply]
- @Ymblanter: 123 test edits done, please let me know if you see anything that should be fixed. Jon Harald Søby (WMNO) (talk) 08:50, 28 October 2020 (UTC)[reply]