Wikidata:Bot requests/Archive/2022/01
![]() | This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Revival of Orcbot
Request date: 11 January 2022, by Eva: EvaSeidlmayer
- Link to discussions justifying the request
- https://github.com/EvaSeidlmayer/orcid-for-wikidata
- https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot/Orcbot
- https://www.wikidata.org/wiki/Wikidata:Bot_requests/Archive/2020/06
- Task description
Orcbot makes use of ORCID public data file where authors (carrying ORCID-ID) indicate their publications (carrying DOI, pubmed-ID, or other ID). In Wikidata we find a lot of publications where authors are only registered as author name string (P2093). Based on the mentioned identifiers, we can enter the authors as author (P50) to Wikidata.
In Orcbot, it is important to only connect authors and publications that are already listed in Wikidata and not add new data. Therefore, Orcbot helps to connect existing data and improves data quality - not quantity.
I just found out that the bot permissions of Orcbot/Orcid for Wikidata seems to be outdated. So I guess, I need to apply again for the Orcbot. I recently improved the code regarding some remarks of User:Rdmpage. The new version takes care to transfer series ordinal series ordinal (P1545) from author name string author name string (P2093) to author author (P50). It will also remove author name string author name string (P2093) statement afterwards and add a statement about the source of information stated in stated in (P248) as ORCID Public Data File 2021 ORCID Public Data File 2021 (Q110411020) currently. Sorry, due to job issues I was out of work for some month.
- Licence of data to import (if relevant)
The data provided by ORCID is licensed with CC0: https://orcid.org/about/what-is-orcid/principles
- Discussion
- @EvaSeidlmayer: Did you mean to post this on Wikidata:Requests for permissions/Bot? You are the creator of OrcBot right? You just want permission to run it? I'm confused how this is different from Wikidata:Requests for permissions/Bot/Orcbot. That bot request is very old. Maybe scrap it and start a new one? BrokenSegue (talk) 15:10, 12 January 2022 (UTC)
- Dear @BrokenSegue: yes, it is confusing. So to cut a long story short, I had the bot right for Orcbot. Then it was a year off. In between I improved the code and had some other issues at work, this is why I did not take it productively in 2021. Now I have tried to reactivate Orcbot. However, apparently Orcbot does not have the bot rights *anymore*. This is why I thought I should apply again. Maybe I'm doing another thing wrong too. Eva (talk) 16:04, 12 January 2022 (UTC)
- But I don't see a history of having bot permissions at [1]? But yeah this page is more for asking someone else to write a bot for you. Not for requesting bot rights for an account you own. BrokenSegue (talk) 16:08, 12 January 2022 (UTC)
- So, maybe this is rather the place where I should ask for help why the bot does not work?: https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot/Orcbot I am kind of confused about the status Orcbot had. If it was/is a *real* bot or what else. Eva (talk) 17:10, 12 January 2022 (UTC)
- But I don't see a history of having bot permissions at [1]? But yeah this page is more for asking someone else to write a bot for you. Not for requesting bot rights for an account you own. BrokenSegue (talk) 16:08, 12 January 2022 (UTC)
- Request process
- This section was archived on a request by: please use Wikidata:Requests for permissions/Bot/Orcbot --- Jura 09:28, 22 January 2022 (UTC)
Remove all claims from deleted property (2022-01-31)
Request date: 31 January 2022, by: MSGJ
- Link to discussions justifying the request
- Task description
Remove all statements with P4883 (P4883). Property has been deleted at PfD. List of items here: Special:WhatLinksHere/Property:P4883. Thank you — Martin (MSGJ · talk) 12:19, 31 January 2022 (UTC)
- I am also intending to delete P4886 (P4886) with the same rationale. Please can all these statements be removed too? — Martin (MSGJ · talk) 12:22, 31 January 2022 (UTC)
- Discussion
- Can you avoid deleting properties that are still in use? --- Jura 22:12, 1 February 2022 (UTC)
- Request process
- deleted. see [2]. BrokenSegue (talk) 01:14, 2 February 2022 (UTC)
- here's a pastebin to delete the P4886 via QS ones. BrokenSegue (talk) 01:26, 2 February 2022 (UTC)
- Thanks BrokenSegue. It would be great if you could write up some instructions about how you created that code. I thought I was familiar with QuickSatements but that is new for me. — Martin (MSGJ · talk) 22:25, 2 February 2022 (UTC)
- @MSGJ: I just wrote a short ruby script and some bash. There's nothing specific to share I'm afraid. It would be nice if someone bundled this into a reusable tool somehow. I write the same short script over and over every time I do this. BrokenSegue (talk) 02:13, 3 February 2022 (UTC)
- Thanks BrokenSegue. It would be great if you could write up some instructions about how you created that code. I thought I was familiar with QuickSatements but that is new for me. — Martin (MSGJ · talk) 22:25, 2 February 2022 (UTC)
- This section was archived on a request by: --- Jura 15:35, 2 February 2022 (UTC)
request to refine class of items about cadastral municipalities/areas according to country (2022-01-10)
Request date: 10 January 2022, by: UV
- Link to discussions justifying the request
- Task description
- For all items with
- For all items with
- For all items with
- For all items with
Thank you!
- Licence of data to import (if relevant)
- Discussion
- Support Vojtěch Dostál (talk) 11:56, 11 January 2022 (UTC)
- Request process
@UV: Done remaining cases fixed.--Vojtěch Dostál (talk) 11:13, 3 July 2022 (UTC)
I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. Vojtěch Dostál (talk) 11:13, 3 July 2022 (UTC) |
Import Treccani IDs
Request date: 6 February 2019, by: Epìdosis
- Task description
At the moment we have four identifiers referring to http://www.treccani.it/: Treccani's Biographical Dictionary of Italian People ID (P1986), Treccani ID (P3365), Treccani's Enciclopedia Italiana ID (P4223), Treccani's Dizionario di Storia ID (P6404). Each article of these works has, in the right column "ALTRI RISULTATI PER", a link to the articles regarding the same topic in other works (e.g. Ugolino della Gherardesca (Q706003)Treccani ID (P3365)conte-ugolino, http://www.treccani.it/enciclopedia/conte-ugolino/ has links also to Enciclopedia Italiana (Treccani's Enciclopedia Italiana ID (P4223) and Dizionario di Storia (Treccani's Dizionario di Storia ID (P6404)). This cases are extremely frequent: many items have Treccani's Biographical Dictionary of Italian People ID (P1986) and not Treccani ID (P3365)/Treccani's Enciclopedia Italiana ID (P4223); others have Treccani ID (P3365) and not Treccani's Enciclopedia Italiana ID (P4223); nearly no item has Treccani's Dizionario di Storia ID (P6404), recently created.
My request is: check each value of these identifiers in order obtain values for the other three identifiers through the column "ALTRI RISULTATI PER".
- Discussion
- Treccani Vocabulary ID (P5844) can be present too; e.g., at https://www.treccani.it/enciclopedia/bandiera_%28Enciclopedia-Italiana%29/
- There seems to be no impediment to crawling the site in https://www.treccani.it/robots.txt
- The Enciclopedia Italiana has a number of appendici, and there can be ids for a number of articles, as already present at Q298#P4223 and Q298#P4223. Are those good models? @Epìdosis: are you still interested in this? William Avery (talk) 19:21, 25 November 2021 (UTC)
- Yes of course; thanks for resuming this. The appendici of Treccani's Enciclopedia Italiana ID (P4223) are of course very useful, and surely Q298#P4223 is a very good model for their insertion. I am available for any other question or comment, of course. Good evening, --Epìdosis 19:55, 25 November 2021 (UTC)
I have used a script to run a few test edits that added Enciclopedia Italiana and other ids to the following items:
- energy (Q11379) diff
- Ugolino della Gherardesca (Q706003) diff
- Arthur Wellesley, 1st Duke of Wellington (Q131691) diff
- William the Silent (Q78454) diff
- William Wallace (Q82457) diff
- Ypres (Q102728) diff
- Giza (Q81788) diff
- Windsor (Q464955) diff
- Ricinus communis (Q155867) diff
- Pistacia lentiscus (Q159812) diff
- Jerusalem artichoke (Q146190) diff
I am wondering whether it will always be acceptable to add the ids that are found to the item that has the original Treccani id. It seems to me that the correspondence is good for instances of human beings, human settlements, geographical features, biological taxa, to name a few. However, I think it would be unwise to attempt an automated bot run for countries:
- Kingdom of Sicily (Q188586) should have the id for SICILIA e PUGLIA, Regno di?
- Algeria (Q262) should not have an id for the 1929 Enciclopedia Italiana article. Correct item is perhaps French Algeria (Q218272).
I will continue to run against single items for testing purposes, and add features to the script to control it running against chosen batches of items. William Avery (talk) 22:22, 28 November 2021 (UTC)
- Request process
- WD:BRFA § William Avery Bot 3 – to run my script as a bot, for instances of humans only.
- @Epìdosis: This task has been approved and User:William Avery Bot has been scheduled to process small batches. Just an initial 100 items a day for now. William Avery (talk) 12:05, 5 January 2022 (UTC)