Wikidata:Bot requests

Bot requests

If you have a bot request, add a new section using the button and tell exactly what you want. To reduce the process time, first discuss the legitimacy of your request with the community in the Project chat or in the Wikiprojects's talk page. Please refer to previous discussions justifying the task in your request.

For botflag requests, see Wikidata:Requests for permissions.

Tools available to all users which can be used to accomplish the work without the need for a bot:

  1. PetScan for creating items from Wikimedia pages and/or adding same statements to items
  2. QuickStatements for creating items and/or adding different statements to items
  3. Harvest Templates for importing statements from Wikimedia projects
  4. OpenRefine to import any type of data from tabular sources
  5. WikibaseJS-cli to write shell scripts to create and edit items in batch
  6. Programming libraries to write scripts or bots that create and edit items in batch
On this page, old discussions are archived. An overview of all archives can be found at this page's archive index. The current archive is located at 2022/07.
Filing cabinet icon.svg
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 2 days.

Import Treccani IDsEdit

Request date: 6 February 2019, by: Epìdosis

Task description

At the moment we have four identifiers referring to http://www.treccani.it/: Biographical Dictionary of Italian People ID (P1986), Treccani ID (P3365), Treccani's Enciclopedia Italiana ID (P4223), Treccani's Dizionario di Storia ID (P6404). Each article of these works has, in the right column "ALTRI RISULTATI PER", a link to the articles regarding the same topic in other works (e.g. Ugolino della Gherardesca (Q706003) Treccani ID (P3365) conte-ugolino, http://www.treccani.it/enciclopedia/conte-ugolino/ has links also to Enciclopedia Italiana (Treccani's Enciclopedia Italiana ID (P4223) and Dizionario di Storia (Treccani's Dizionario di Storia ID (P6404)). This cases are extremely frequent: many items have Biographical Dictionary of Italian People ID (P1986) and not Treccani ID (P3365)/Treccani's Enciclopedia Italiana ID (P4223); others have Treccani ID (P3365) and not Treccani's Enciclopedia Italiana ID (P4223); nearly no item has Treccani's Dizionario di Storia ID (P6404), recently created.

My request is: check each value of these identifiers in order obtain values for the other three identifiers through the column "ALTRI RISULTATI PER".

Discussion
William Avery (talk) 23:18, 23 November 2021 (UTC)
  • The Enciclopedia Italiana has a number of appendici, and there can be ids for a number of articles, as already present at Q298#P4223 and Q298#P4223. Are those good models? @Epìdosis: are you still interested in this? William Avery (talk) 19:21, 25 November 2021 (UTC)
    Yes of course; thanks for resuming this. The appendici of Treccani's Enciclopedia Italiana ID (P4223) are of course very useful, and surely Q298#P4223 is a very good model for their insertion. I am available for any other question or comment, of course. Good evening, --Epìdosis 19:55, 25 November 2021 (UTC)


I have used a script to run a few test edits that added Enciclopedia Italiana and other ids to the following items:

I am wondering whether it will always be acceptable to add the ids that are found to the item that has the original Treccani id. It seems to me that the correspondence is good for instances of human beings, human settlements, geographical features, biological taxa, to name a few. However, I think it would be unwise to attempt an automated bot run for countries:

I will continue to run against single items for testing purposes, and add features to the script to control it running against chosen batches of items. William Avery (talk) 22:22, 28 November 2021 (UTC)

Request process
@Epìdosis: This task has been approved and User:William Avery Bot has been scheduled to process small batches. Just an initial 100 items a day for now. William Avery (talk) 12:05, 5 January 2022 (UTC)

Fix local dialing code (P473) wrongly insertedEdit

Request date: 7 November 2019, by: Andyrom75

Task description

Several entities has a wrong value for the local dialing code (P473) according to the format as a regular expression (P1793) specified in it: [\d\- ]+, as clarified "excluded, such as: ,/;()+"

Typical examples of wrong values, easily identified are the following two:

  1. local dialing code (P473) that includes at the beginning the country calling code (P474)
  2. local dialing code (P473) that include at the beginning the "optional" zero
  • Case 1 can be checked looking for "+", when present, should be compared with the relevant country calling code (P474) and if matched, it should be removed
  • Case 2 can be checked looking for "(" and ")" with zeros inside. If matched it should be removed
Discussion
Request process

Cleaning of streaming media services urlsEdit

Request date: 12 December 2020, by: Swicher

I'm not sure if this is the best place to propose it but when reviewing the urls of a query with this script:

import requests
from concurrent.futures import ThreadPoolExecutor

# Checks the link of an item, if it is down then saves it in the variable "novalid"
def check_url_item(item):
    # Some sites may return error if a browser useragent is not indicated
    useragent = 'Mozilla/5.0 (X11; Linux i686) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77'
    item_url = item["url"]["value"]
    print("Checking %s" % item_url, end="\r")
    req = requests.head(item_url, headers = {'User-Agent': useragent}, allow_redirects = True)
    if req.status_code == 404:
        print("The url %s in the element %s returned error" % (item_url, item["item"]["value"]))
        novalid.append(item)

base_query = """SELECT DISTINCT ?item ?url ?value
{
%s
  BIND(IF(ISBLANK(?dbvalue), "", ?dbvalue) AS ?value)
  BIND(REPLACE(?dbvalue, '(^.*)', ?url_format) AS ?url)
}"""
union_template = """  {{
    ?item p:{0} ?statement .
    OPTIONAL {{ ?statement ps:{0} ?dbvalue }}
    wd:{0} wdt:P1630 ?url_format.
  }}"""
properties = [
    "P2942", #Dailymotion channel
    "P6466", #Hulu movies
    "P6467", #Hulu series
]
# Items with links that return errors will be saved here
novalid = []

query = base_query % "\n  UNION\n".join([union_template.format(prop) for prop in properties])
req = requests.get('https://query.wikidata.org/sparql', params = {'format': 'json', 'query': query})
data = req.json()

# Schedule and run 25 checks concurrently while iterating over items
check_pool = ThreadPoolExecutor(max_workers=25)
result = check_pool.map(check_url_item, data["results"]["bindings"])

I have noticed that almost half are invalid. I do not know if in these cases it is better to delete or archive them but a bot should periodically perform this task since the catalogs of streaming services tend to be very changeable (probably many of these broken links are due to movies/series whose license was not renewed). Unfortunately I could only include Hulu and Dailymotion since the rest of the services have the following problems:

For those sites it is necessary to perform a more specialized check than a HEAD request (like using youtube-dl (Q28401317) for Youtube).

In the case of Hulu I have also noticed that some items can have two valid values in Hulu movie ID (P6466) and Hulu series ID (P6467) (see for example The Tower of Druaga (Q32256)) so you should take that into account when cleaning links.

Request process

Ontario public school contact infoEdit

Request date: 27 December 2020, by: Jtm-lis

Link to discussions justifying the request
Task description

https://www.wikidata.org/wiki/Wikidata:Dataset_Imports/_Ontario_public_school_contact_information

Licence of data to import (if relevant)
Discussion

request to add identifiers from FB (2021-02-11)Edit

Thanks to a recent import, we currently have more than >1.2 items where the only identifier is Freebase ID (P646). However, checking https://freebase.toolforge.org/ some of them have identifiers available there.

Samples:

See Wikidata:Project_chat#Freebase_(bis) for discussion.

Task description

Import ids where available. Map keys to properties if not available at Wikidata:WikiProject_Freebase/Mapping.

Discussion


Request process

request to fix parliamentary group = caucus, != party (2021-05-12)Edit

Request date: 12 May 2021, by: Jura1

Link to discussions justifying the request
Task description
Discussion

@Jura1: Would you be so kind and provide me with a query to these, so that I know what value should be changed to what? How do I determine with a bot if it should be Democratic Caucus of the United States Senate (Q3117832) or Republican Conference of the United States Senate (Q3117916)? Thanks :) Vojtěch Dostál (talk) 20:36, 3 July 2022 (UTC)

Request process

Proliferate external-IDs from qualifiers and references to main statement (2021-07-06)Edit

Request date: 6 July 2021, by: Vladimir Alexiev

Link to discussions justifying the request
Task description

Take a prop like ORCID: Property_talk:P496 says that 22.7% of uses are as reference, and 0.1% as qualifier.

I bet that some of those uses are not reflected as main statement.

SELECT ?itemLabel ?wdt ?wdLabel ?id { # ?ref ?wdr ?statement {
  ?wd wikibase:propertyType wikibase:ExternalId; wikibase:directClaim ?wdt; wikibase:reference ?wdr.
  ?ref ?wdr ?id.
  ?statement prov:wasDerivedFrom ?ref.
  # ?item ?prop ?statement
  # filter not exists {?item ?wdt ?id}
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
} limit 10
Try it!

Of course, sifting through all those external-IDs used as refs will be a huge task. WD times out even on a count query:

SELECT (count(*) as ?c) {
  ?wd wikibase:propertyType wikibase:ExternalId; wikibase:reference ?wdr.
  ?ref ?wdr ?id.
}
Try it!
Discussion

@Vladimir Alexiev: I don't think this would be a sensible thing to do. I often reference claims with external IDs which do not belong to the subject itself. For example, statement "John has two sons, Fred and George" can be found in an entry of John but used as a reference in his son's items in their father (P22) section.Vojtěch Dostál (talk) 20:50, 2 July 2022 (UTC)

I personally agree, I also have used IDs not belonging to the person X to reference stataments in item X (mainly genalogical relations, but also "student of" and sometimes others). This could be a good (but huge) task for manual review, but cannot be performed massively. --Epìdosis 21:55, 2 July 2022 (UTC)
My example itself is of the kind you said: the qualifier is on an Article but applies to its Author.
If someone undertakes to work on such both, I'll undertake to curate reliable patterns where this proliferation can be applied -- Vladimir Alexiev (talk) 09:24, 5 July 2022 (UTC)


Request process

Request to change lexeme forms' grammatical features (2021-07-08)Edit

Request date: 8 July 2021, by: Bennylin

Link to discussions justifying the request
Task description

How can I change grammatical features of form? (I operate bot, I just need to know the commands). I have the list of lexemes. I reckon this should be not too hard, I'm just not familiar with the command to do the changes.

Licence of data to import (if relevant)
Discussion


Request process

request to depreciated ethnic group only sourced with P143 (2021-10-23)Edit

Request date: 23 October 2021, by: Fralambert

Link to discussions justifying the request
Task description
Hi, since ethnic group (P172) is a highly contencious subject, the property already mandate a source and imported from Wikimedia project (P143) is not a reliable source, it would be fine if a bot put a depreciated rank when the statement in ethnic group (P172) use only imported from Wikimedia project (P143) as a source. Also the bot could add reason for deprecated rank (P2241) and source known to be unreliable (Q22979588)as a qualifier. We could also only remove statement with this source, but they are likely to come back, so depreciated them would be a best. --Fralambert (talk) 15:12, 23 October 2021 (UTC)
Licence of data to import (if relevant)
Discussion
  •   Support - it would also be nice to deprecate or remove all those same statements, when there is no source at all… --Hsarrazin (talk) 15:19, 23 October 2021 (UTC)
  •   Support, this could be an ongoing task, since this is not the first time this property needs to be cleaned up: Property talk:P172. — eru [Talk] [french wiki] 15:32, 23 October 2021 (UTC)
  •   Support; I would even support a complete removal of unsourced or Wikipedia-imported claims. —MisterSynergy (talk) 15:58, 23 October 2021 (UTC)
    complete removal would only lead to re-adding of the same statements…  I've already cleaned hundreds of so-called "French" ethnic group, only to see them back after months - a lot of contributors tend to use P172 instead of P27… Hsarrazin (talk) 16:43, 23 October 2021 (UTC)
    If we were to make it, say, a daily job, then we would not accumulate larger amounts of unsourced claims anymore and the users who add these unsourced claims would also learn quickly to adapt to the new situation. —MisterSynergy (talk) 18:30, 23 October 2021 (UTC)
    I think that unsourced statement should be deleted, since sourcing of this property is mandatory. As Wikipedia-imported claim, best is to kept them as depreciated as they are the most likely to come back. Fralambert (talk) 18:51, 23 October 2021 (UTC)
  • delete them. As Help:Ranking#Deprecated_rank says, deprecation isn't an option for claims that can't be sourced. --- Jura 14:01, 25 October 2021 (UTC)
    Agree. Deprecation would cause a mess. Remove if you think that's necessary. Vojtěch Dostál (talk) 06:01, 29 October 2021 (UTC)
    disagree. this isn't a case of "claims that can't be sourced" this is a case of "claims that aren't sufficiently sourced". many of these claims probably could be correctly sourced. BrokenSegue (talk) 02:27, 30 October 2021 (UTC)
    • "imported from" isn't considered sourcing/proper references and for these statements is a requirement that references be added. If you think you are able to do so, please proceed. We could revisit the question in a month and clean up whatever you didn't correctly reference. --- Jura 07:26, 30 October 2021 (UTC)
  •   Oppose I strongly disagree, deleting is not option, I also don't see that it would be a highly contentious issue. Germartin1 (talk) 10:17, 21 December 2021 (UTC)
  •   Support - deprecate all wikipedia-sourced statements, remove all unsourced statements.Matthias Winkelmann (talk) 21:44, 22 March 2022 (UTC)
Request process

request to merge MNAC dups. (2021-11-13)Edit

Request date: 13 November 2021, by: Jura1

Task description

Back in 2016, there seems to have been some duplication between two bots. Compare:

It showed up for several works at Wikidata:WikiProject_sum_of_all_paintings/Creator/Ramon_Casas_i_Carbó in Museu Nacional d'Art de Catalunya (Q861252) and Art Modern Art MNAC (Q23681318).

The idea to identify all of them (for other artists as well) and merge them.

Discussion


Request process

request to merge true duplicates (2021-11-27)Edit

Request date: 27 November 2021, by: Jura1

Link to discussions justifying the request
Task description
Discussion
Thanks. I will have look. --- Jura 12:23, 4 March 2022 (UTC)
Request process

request to add "published in" (P1433) to subpages of plwikisource (2021-12-02)Edit

Request date: 2 December 2021, by: Jura1

Link to discussions justifying the request
Task description


Discussion


Request process

request to undo merge EC meetings (2021-12-02)Edit

Request date: 2 December 2021, by: Jura1

Link to discussions justifying the request
Task description
Licence of data to import (if relevant)
Discussion


Request process

request to delete statements and sitelinks and merge items: dewiki duplicates (2021-12-16)Edit

Request date: 16 December 2021, by: Jura1

Task description
Discussion


Request process

request to make buildings searchable by address (2022-01-05)Edit

Request date: 5 January 2022, by: Jura1

Problem

When adding these statements Special:Search/haswbstatement:P669=Q688477 (currently 62), I noticed that most buildings can't be found by merely searching for the address: Special:Search/Getreidegasse Salzburg (currently 21).

This despite that most items include street address (P6375) with the building address: Q37970986#P6375. This as P6375 isn't indexed.

The easiest solution would have been to index the statement for full text search, but apparently this wont happen any time soon (see Wikidata:Report_a_technical_problem/WDQS_and_Search#index_"street_address"_(P6375)_strings).

The alternative would be to add the address as alias. Sample: Q37998050 with alias "Getreidegasse 11, Salzburg".

Task description
  • select items with buildings and P6375
  • check if address is in label or alias
  • if not, add address as alias (without postal code)
Discussion


Request process

Request to fix Spanish labels wrongly copied from English labels (2022-02-09)Edit

Request date: 9 February 2022, by: Epìdosis

Task description
  • In a relevant number of cases I have come across items having the following problem for "es" labels: in 2013 KLBot2 copied "en" label into "es" label for people, sometimes incorrectly (e.g. for noble people); my proposal is: 1) finding all items with instance of (P31) human (Q5) where "en" and "es" label are identical and containing a sitelink to es.wikipedia; check if the title of es.wikipedia article, without eventual parts in parenthesis, corresponds to "es" label; if they are different, use the title of es.wikipedia article as "es" label and remove it from "es" aliases if already present (e.g.). --Epìdosis 18:33, 9 February 2022 (UTC)
Discussion
Request process

Request to add stats: number of statements with a single external-id (2022-02-09)Edit

Request date: 9 February 2022, by: Jura1


Task description
  • Currently a bot generates statistics of property uses, as main statements among others. This is included in property documentation on talk pages.
  • For external-id properties, it could be interesting to have a slightly modified version: one that shows the number of items with a single external-id.
  • Sample 1:
    • VIAF ID (P214) has currently 2,791,472 uses as main statement
    • 15395 (0.6 %) have only a VIAF ID.
  • Sample 2:
    • IMDb ID (P345) has currently 801,061 uses
    • 61,681 (7.7 %) have only an IMDB ID.
  • Sample 3:
Discussion


Request process

Wikidata link bot request (2022-02-19)Edit

Request date: 19 February 2022, by: ToprakM

There is a category in trwiki that is named "Taxonbars that need a Wikidata item": tr:Kategori:Vikiveri nesnesine ihtiyaç duyan taksonçubukları. I think all articles listed already created some Wikipedias, such as svwiki. Can anyone link them with a bot?

Discussion

Many pages in that category have corresponding automatic taxonomy templates, which should also be linked to wikidata items. See tr:Kategori:Taksonomi şablonları. William Avery (talk) 12:13, 25 May 2022 (UTC)

@ToprakM: I did some research on your category. Of 5276 articles, I am able to detect about 4215 matches to existing Wikidata items. I have to skip about 130 of these - non-unique taxon names in Wikidata. Of the rest, 71 already seem to have an article in trwiki (for example, Accipiter and Atmaca - what do you want to do with these?). The rest is 4017 items where I am able to add a sitelink to trwiki. Do you want me to proceed with those?Vojtěch Dostál (talk) 19:57, 4 July 2022 (UTC)

Thank you @Vojtěch Dostál, 71 seems low. I can handle them manually. I'd do prefer you to proceed except for these 71. --ToprakM 18:38, 6 July 2022 (UTC)

@ToprakM Good, I started the job. The 71 articles which seem like duplicites are below:

New article Existing article-identical taxon name
Accipiter Atmaca
Trachurus %C4%B0stavrit
Lampyridae Ate%C5%9F_b%C3%B6ce%C4%9Fi
Buteo %C5%9Eahin
Menura Lir_ku%C5%9Fu
Chamaeleonidae Bukalemun
Physalis_peruviana Alt%C4%B1n_%C3%A7ilek
Pterois Aslan_bal%C4%B1%C4%9F%C4%B1
Pythonidae Piton
Vigna_unguiculata B%C3%B6r%C3%BClce
Testudinidae Kara_kaplumba%C4%9Fas%C4%B1
Balaenopteridae Oluklu_balinagiller
Pernis Ar%C4%B1_%C5%9Fahini
Aix Aix_(hayvan)
Arthrobacter_nicotinovorans Paenarthrobacter_nicotinovorans
Arthrobacter_viscosus Rhizobium_viscosum
Engraulidae Hamsigiller
Bolitaenidae Bolitaeninae
Solanum_betaceum Tamarillo
Pectinidae Tarak
Arthrobacter_nasiphocae Falsarthrobacter_nasiphocae
Arthrobacter_sanguinis Haematomicrobium_sanguinis
Thraupidae Tanager
Chortinaspis Choucoris
Birgus_latro Hindistan_cevizi_yengeci
Melanotaeniidae G%C3%B6kku%C5%9Fa%C4%9F%C4%B1_bal%C4%B1%C4%9F%C4%B1
Merlucciidae Berlam_bal%C4%B1klar%C4%B1
Lindsaeaceae Lindus
Batis_(bitki) Batis
Vigna_radiata Ma%C5%9F_fasulyesi
Vigna_angularis Azuki_fasulyesi
Gryllotalpidae Gryllotalpa
Cicadidae A%C4%9Fustos_b%C3%B6ce%C4%9Fi
Chrysopidae Ye%C5%9Fil_zarkanat
Nephropidae Istakoz
Argasidae K%C4%B1%C5%9F_kenesigiller
Spermophilus_suslicus Benekli_gelengi
Arthrobacter_histidinolovorans Paenarthrobacter_histidinolovorans
Arthrobacter_cumminsii Pseudoglutamicibacter_cumminsii
Arthrobacter_cryotolerans Paeniglutamicibacter_cryotolerans
Arthrobacter_chlorophenolicus Pseudarthrobacter_chlorophenolicus
Arthrobacter_nitroguajacolicus Paenarthrobacter_nitroguajacolicus
Arthrobacter_aurescens Paenarthrobacter_aurescens
Phyllostomidae Yaprak_burunlu_yarasa
Galbulidae Jakamar
Rajidae Vatoz_bal%C4%B1%C4%9F%C4%B1giller
Regalecidae K%C3%BCrek_bal%C4%B1%C4%9F%C4%B1
Synodontidae Kertenkele_bal%C4%B1%C4%9F%C4%B1
Tetraodontidae Kirpi_bal%C4%B1%C4%9F%C4%B1
Anarhichadidae Kurt_Y%C4%B1lan_Bal%C4%B1%C4%9F%C4%B1
Belonidae Sargan
Myxinidae Myxini
Crambe_(hayvan) Crambe
Desmanthus_(hayvan) Desmanthus
Chondrilla_(hayvan) Chondrilla
Mertensia_(hayvan) Mertensia
Calamus_(hayvan) Calamus
Liparis_(hayvan) Liparis
Synanceia Ta%C5%9F_bal%C4%B1%C4%9F%C4%B1
Clupea Ringa
Sphaerophysa_(hayvan) Sphaerophysa
Pica_(hayvan) Saksa%C4%9Fan
Melampitta Melampittidae
Oenanthe_(hayvan) Oenanthe
Echinops_(hayvan) Echinops
Otus C%C3%BCce_bayku%C5%9F
Psittacella Kaplan_papa%C4%9Fan
Neodiplotrema_(hayvan) Neodiplotrema
Kermia_(hayvan) Kermia
Ditropis_(hayvan) Ditropis
Simsia_(hayvan) Simsia

Vojtěch Dostál (talk) 19:01, 6 July 2022 (UTC)

Also tagging @Vikiolog: who seems to have created some of the articles and may be interested in cleaning up those :)Vojtěch Dostál (talk) 19:08, 6 July 2022 (UTC)

Vojtěch Dostál, I'm on it too, they are on my list. Some marked for merge, some needs change on taxons. As ToprakM said, manually manageable. Thanks for linking items :) Vikiolog (talk) 19:16, 6 July 2022 (UTC)
@Vikiolog Yeah, some may be rare instances where 2 taxons have the same name. Vojtěch Dostál (talk) 19:20, 6 July 2022 (UTC)
Request process

Accepted by (Vojtěch Dostál (talk) 19:57, 4 July 2022 (UTC))


Request to deprecate P2190 string formats as property should use numeric format (2022-03-15)Edit

Request date: 15 March 2022, by: Wolfgang8741

Link to discussions justifying the request
  • C-SPAN person ID (P2190) is transitioning to a numeric format for reliability of linking since the string format has been found to break when C-SPAN changes the string it doesn't always redirect. See property discussion. as well as on Project chat. Coordinated updates to templates using this property have been notified on Wikipedia for update and cleanup.

For the entries added prior to 26 Feb 2022 all matched numeric formats have been uploaded. Strings added after that date have not been checked.

Task description

1. Deprecate all existing statements using a string for the value in C-SPAN person ID (P2190) and add qualifier reason for deprecated rank (P2241) with withdrawn identifier value (Q21441764)

2. Remove any strings added for the property after 14 March 2022 when the property officially started validating for numeric IDs.

3. For string IDs added between 26 Feb and 14 March, resolve the string to the C-SPAN url. Parse the url response and extract the numeric ID.

Discussion


Request process

Request to extract music titels from headline (2022-03-21)Edit

Request date: 21 March 2022, by: Bigbossfarin

Link to discussions justifying the request

I would like to feed Wikidata with Offizielle Deutsche Charts album ID (P10262) of all the albums on the website offiziellecharts.de/album-details-$1 (examples).
Problem: The name of the interpret and album on the website is in header 1 (h1) and header 2 (h2) in the HTML source code (example) and I don't know how to crawl this data.

Task description

I need a list of the headers with ID number:

URL ID h1 h2
... ... ... ...
https://www.offiziellecharts.de/album-details-12 12 Michael Jackson Thriller
https://www.offiziellecharts.de/album-details-13 13 ZZ Top Eliminator
... ... ... ...

the same thing would be fine for artists

URL ID h1
... ... ...
https://www.offiziellecharts.de/suche/person-978 978 Michael Jackson
... ... ...

and songs

URL ID h1 h2
... ... ... ...
https://www.offiziellecharts.de/titel-details-1680 1680 Michael Jackson Bad
... ... ... ...
Licence of data to import (if relevant)
Discussion

Hello @Bigbossfarin, I'm not sure offiziellecharts.de really appreciate to have their whole website crawled. And I don't know if the license is ok with adding data to Wikidata. Myst (talk) 19:29, 24 March 2022 (UTC)

Request process

Shakeosphere person IDEdit

Shakeosphere person ID (P2886) error. Example:

Sorry: id_new = id_old + 24638. --Khodakov Pavel (talk) 14:37, 24 March 2022 (UTC)

I am taking a look at this. Some care is required, e.g. Benjamin Franklin (Q34969) is at https://shakeosphere.lib.uiowa.edu/persons/person.jsp?pid=2437 William Avery (talk) 07:55, 4 June 2022 (UTC)

I ran a script to scan through all 2783 items with a Shakeosphere ID, added 24638 to the id, retrieved the corresponding page from Shakeosphere, and tried to match the name. Somewhat cryptic output is at User:William Avery Bot/Shakeosphere report.

In 117 cases, that gave an invalid ID that didn't return a valid page on Shakeosphere. These need further investigation.

In 2 cases, a valid page was returned, but the name on it didn't match at all:

The other 2644 items (95.7%) can be corrected by adding 24638 to the existing id.

The stability of the ID is clearly dubious, but I'll go ahead with an update, unless somebody proposes P2886 for deletion. William Avery (talk) 21:25, 4 June 2022 (UTC)

Request process

BRFA filed at WD:BRFA § William Avery Bot 6 William Avery (talk) 10:22, 11 June 2022 (UTC)

Set preferred rank on census population of Quebec's municipalitiesEdit

Request date: 6 April 2022, by: YanikB

Link to discussions justifying the request
Task description

1-remove Preferred rank of population (P1082) for 2016 census.

SELECT ?item 
WHERE {
       {?item wdt:P31/wdt:P279* wd:Q3327873} UNION {?item wdt:P31 wd:Q81066200}
       ?item p:P1082 [ ps:P1082 ?population; pq:P459 wd:Q39825; pq:P585  ?date  ] .
       FILTER (?date  = "2016-01-01T00:00:00Z"^^xsd:dateTime )
}
LIMIT 1500

Try it!

2-Change population (P1082) to Preferred rank for 2021 census.

SELECT ?item 
WHERE {
       {?item wdt:P31/wdt:P279* wd:Q3327873} UNION {?item wdt:P31 wd:Q81066200}
       ?item p:P1082 [ ps:P1082 ?population; pq:P459 wd:Q39825; pq:P585  ?date  ] .
       FILTER (?date  >= "2021-01-01T00:00:00Z"^^xsd:dateTime )
}
LIMIT 1500

Try it!

thx

Licence of data to import (if relevant)
Discussion


Request process

Request to add Russian descriptions (2022-04-10)Edit

Request date: 10 April 2022, by: 217.117.125.83

Task description

Please, add Russian description «

вид птиц

» to all the items that have English description «species of bird». 217.117.125.83 17:27, 10 April 2022 (UTC)

Discussion


Request process

Tracking parameters in reference URLsEdit

Do we have a bot that makes edits like this, to trim fbclid and other tracking parameters from URLs in citations, and if not, could someone's bot do that, please? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:20, 24 April 2022 (UTC)

@Matěj Suchánek Do I remember correctly that you bot has been doing this? Vojtěch Dostál (talk) 17:56, 3 July 2022 (UTC)
@Vojtěch Dostál: Yes and no. Not on Wikidata: [9]. --Matěj Suchánek (talk) 07:17, 4 July 2022 (UTC)

Bot to regularly substitute certain templates (2022-04-29)Edit

Request date: 29 April 2022, by: GZWDer

Task description

Currently there are 1981 pages transcluding Template:Unsigned. We should have a bot to automatically substitute it regularly.

Discussion

Thought would be better NOT to substitute those templates ({{Unsigned}}, {{Unsigned2}}), and do some modification to documentation page. Wikidata is a multilingual sites, substitute such templates would lose autotranslate feature. Stang 12:11, 7 June 2022 (UTC)

Request process

Accademia delle Scienze di Torino multiple references (01-05-2022)Edit

Request date: 1 May 2022, by: Epìdosis

Link to discussions justifying the request
Task description

Given the following query:

SELECT DISTINCT ?item
WHERE {
  ?item wdt:P8153 ?ast .
  ?item p:P570 ?statement.
  ?reference1 pr:P248 wd:Q107212659.
  ?reference2 pr:P248 wd:Q107212659.
  ?statement prov:wasDerivedFrom ?reference1.
  ?statement prov:wasDerivedFrom ?reference2.
  FILTER (?reference1 != ?reference2)
}
Try it!

In many items there are multiple references to date of death (P570) referring to www.accademiadellescienze.it (Q107212659)=Accademia delle Scienze di Torino ID (P8153). Cases:

  1. three references: maintain the first (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)+subject named as (P1810)), delete the second (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)), delete the third (stated in (P248)+retrieved (P813)) transferring the retrieved (P813) to the first
    1. three references bis: if the first is stated in (P248)+Accademia delle Scienze di Torino ID (P8153)+subject named as (P1810)+retrieved (P813), the second and the third get simply deleted
    2. three references ter: if there is a reference with reference URL (P854) containing a string "accademiadellescienze", it should be deleted; maintain the second (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)), delete the third (stated in (P248)+retrieved (P813)) transferring the retrieved (P813) to the first
  2. two references: maintain the second (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)), delete the third (stated in (P248)+retrieved (P813)) transferring the retrieved (P813) to the first

Repeat the above query substituting date of birth (P569) to date of death (P570). Cases:

  1. two references: maintain the first (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)+subject named as (P1810)), delete the second (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)+retrieved (P813)) transferring the retrieved (P813) to the first
    1. two references bis: if the first is stated in (P248)+Accademia delle Scienze di Torino ID (P8153)+subject named as (P1810)+retrieved (P813), the second gets simply deleted
    2. two references ter: if there is a reference with reference URL (P854) containing a string "accademiadellescienze", it should be deleted; maintain the second (stated in (P248)+Accademia delle Scienze di Torino ID (P8153)+retrieved (P813))
Discussion

@Ladsgroup: as his bot is probably ready for doing this; the first request was archived despite not being solved, as well as the second. --Epìdosis 21:45, 1 May 2022 (UTC)

@Epìdosis I'm so sorry but complicated duplicate clean ups require dedicated time for coding which I really don't have among this and a million other volunteer responsibilities :( Amir (talk) 03:24, 8 May 2022 (UTC)

@Epìdosis: I took a look at this. Are the following updates acceptable?

I have not got as far as cases where reference URL (P854) and Accademia delle Scienze di Torino ID (P8153) are present. William Avery (talk) 18:38, 7 June 2022 (UTC)
@William Avery: Absolutely perfect; and it is plausible that the cases where reference URL (P854) and Accademia delle Scienze di Torino ID (P8153) are present are very few to none. Thanks very much! --Epìdosis 08:54, 8 June 2022 (UTC)
Great, I will take it forward. I clearly have more time on my hands than some people, and only English summer weather. :-( William Avery (talk) 09:31, 8 June 2022 (UTC)
Request process

Accepted by (William Avery (talk) 12:10, 24 June 2022 (UTC)) and under process
BRFA filed at WD:BRFA § William Avery Bot 7 William Avery (talk) 12:10, 24 June 2022 (UTC)

Request to run a script to remove a template call on items talk pages .. (2022-05-19)Edit

Request date: 19 May 2022, by: TomT0m

Link to discussions justifying the request
Task description
  • it’s of course required to remove the ~4000 template call to {{Item documentation}} on item talk pages as they would be redundant. Delete them altogether should be fine.
Licence of data to import (if relevant)
Discussion
  • it’s somewhat trivial to implement, just amount to a call of something like mw:Manual:Pywikibot/template.py on the Main Talk namespace. Would do it myself if my bot still had a botflag but it’s easier to ask to someone who just has to call the command line.
  Comment Is all we have to do is remove {{Item documentation}} from the top of all item talk pages, like this? --Kanashimi (talk) 21:41, 25 May 2022 (UTC)
@Kanashimi Yep, as far as the bot task is concerned. It could be done as soon as the talkpageheader is edited to include it. author  TomT0m / talk page 08:12, 26 May 2022 (UTC)
... which has now been done — Martin (MSGJ · talk) 16:12, 26 May 2022 (UTC)
Request process
  •   Doing… --Cewbot (talk) 00:21, 27 May 2022 (UTC)
  • Kanashimi:   Done Please check the results and let me know if there is something wrong, thank you. - log --Cewbot (talk) 18:32, 27 May 2022 (UTC)
    There are still >200 transclusions [10] — Martin (MSGJ · talk) 20:49, 27 May 2022 (UTC)
    @MSGJ@TomT0m It seems there are many {{Item documentation}} not in the lead section. Do we need to remove all {{Item documentation}} in the talk pages? Kanashimi (talk) 23:36, 27 May 2022 (UTC)
    @Kanashimi I guess yes, at least the one with no parameters. Some calls might for some reason call the documentation on another item ?
    There might be some glitches in the aftermath such as if someone created a section to put the doc it will be an empty section but I think we can live with this. author  TomT0m / talk page 06:52, 28 May 2022 (UTC)
    @TomT0m@MSGJ   Done Kanashimi (talk) 08:59, 28 May 2022 (UTC)
  • Follow-up question: is there a reason not to delete now empty talk pages? There are currently roughly 7k of these after (most of) the template transclusions have been removed. —MisterSynergy (talk) 23:52, 27 May 2022 (UTC)
    My robot doesn't have permission to delete pages, but yes, this job can be solved with a robot. Kanashimi (talk) 00:54, 28 May 2022 (UTC)

Request to add main subject.. (2022-06-10)Edit

Request date: 11 June 2022, by: ChoKukSuho

Link to discussions justifying the request
Task description
Add superpixel (Q112331841) or stereo matching (Q3307153) as main subject of scholary articles. ChoKukSuho (talk) 05:53, 11 June 2022 (UTC)
SELECT 
  DISTINCT 
  ?item ?title
  (REPLACE(STR(?item), ".*Q", "Q") AS ?qid) 
  ("P921" AS ?property)
  ("Q112331841" AS ?ReplaceWithYourTargetID)
  ("S887" AS ?heuristic)
  ("Q69652283" AS ?deduced)

WHERE {
  hint:Query hint:optimizer "None".
  
{  SERVICE wikibase:mwapi {
    bd:serviceParam wikibase:api "Search";
                    wikibase:endpoint "www.wikidata.org";
                    mwapi:srsearch "\"superpixel\" haswbstatement:P31=Q13442814 -haswbstatement:P921=Q112331841".
      ?page_title wikibase:apiOutput mwapi:title.
  }
 }
  BIND(IRI(CONCAT(STR(wd:), ?page_title)) AS ?item)
  
  ?item wdt:P1476 ?title.
  FILTER CONTAINS(LCASE(?title), "superpixel").

}
Try it!



SELECT 
  DISTINCT 
  ?item ?title
  (REPLACE(STR(?item), ".*Q", "Q") AS ?qid) 
  ("P921" AS ?property)
  ("Q3307153" AS ?ReplaceWithYourTargetID)
  ("S887" AS ?heuristic)
  ("Q69652283" AS ?deduced)

WHERE {
  hint:Query hint:optimizer "None".
  
{  SERVICE wikibase:mwapi {
    bd:serviceParam wikibase:api "Search";
                    wikibase:endpoint "www.wikidata.org";
                    mwapi:srsearch "\"stereo matching\" haswbstatement:P31=Q13442814 -haswbstatement:P921=Q3307153".
      ?page_title wikibase:apiOutput mwapi:title.
  }
 }
  BIND(IRI(CONCAT(STR(wd:), ?page_title)) AS ?item)
  
  ?item wdt:P1476 ?title.
  FILTER CONTAINS(LCASE(?title), "stereo matching").

}
Try it!
Please do the first task for "superpixels", "super-pixel" and "super pixel" too. ChoKukSuho (talk) 05:53, 11 June 2022 (UTC)
Licence of data to import (if relevant)
Discussion

@ChoKukSuho: Do you know you can do this yourself using Petscan? Just paste the query into <Other sources>/<SPARQL> and Use Wiki > Wikidata. Then you'll be able to add simple statements like this...Vojtěch Dostál (talk) 12:46, 20 June 2022 (UTC) @Vojtěch Dostál: I will learn it. ChoKukSuho (talk) 14:30, 3 July 2022 (UTC)

Request process

Request to change P18 to P10253 in references (2022-06-12)Edit

Request date: 12 June 2022, by: Epìdosis

Task description

Since reference image (P10253) was created, the use of image (P18) in references becomes obsolete; please substitute all its occurrences with reference image (P10253). Thanks! --Epìdosis 07:14, 12 June 2022 (UTC)

Discussion


Request process

Request to mass-merge items (2022-07-03)Edit

Request date: 3 July 2022, by: Vojtěch Dostál

Link to discussions justifying the request
Task description

Please merge the following items: User:Vojtěch Dostál/to-merge. My usual tools (QuickStatements, Wikibase-CLI) fail there.

Discussion
I think we should be careful here since there might be a considerable amount of cases that should not be merged. Not sure how to resolve this, but maybe manual verification is inevitable. —MisterSynergy (talk) 09:15, 3 July 2022 (UTC)
@MisterSynergy OK, can you give an example? What's a situation where we don't want to merge? Vojtěch Dostál (talk) 10:52, 3 July 2022 (UTC)
The very first example that I've checked from your list was Q3802587 (given name item) and Q10303957 (dab page item), and there are others with a similar problem. —MisterSynergy (talk) 10:58, 3 July 2022 (UTC)
@MisterSynergy In that example, wouldn't a bot normally make this redirect anyway? I mean, the merge was probably unfortunate, but we not in a position to verify all merges that ever happened... Vojtěch Dostál (talk) 12:49, 3 July 2022 (UTC)
Request process

Request to correct description "ulice v Chotovinech" (2022-07-04)Edit

Request date: 4 July 2022, by: ŠJů

Task description

Please find items with cs description "ulice v Chotovinech" and replace it with "ulice v Chotovinách". 51 item pages affected.

Discussion

A grammar error, wrong form of locative case. See here and here for the correct flection which has no doublets. It is right that the general rules supposes the ending "-ech" for pluralia tantum ending with ‑ny, ‑by, ‑dy, ‑my, ‑ry, ‑vy, but this one name obviously falls not under this type (probably because it is a feminine plurale tantum and not a masculine plurale tantum, i.e. rather like "v horách" than like "v hradech"). Google findings of "v Chotovinách" / "v Chotovinech" are 13200 : 50 for the correct form.

(Btw., I recommend a systematic check of all other municipalities, because more mass grammatical errors of that type have already been found here). --ŠJů (talk) 00:52, 4 July 2022 (UTC) @Frettiebot, Frettie, Matěj Suchánek, Ben Skála:

Request process