User:GZWDer/issues

Use {{Curate issue}} to report issues about the account GZWDer (flood), RegularBot or LargeDatasetBot. Remember to ping if their is an urgency (as I do not use watchlist).

Already known issues that should be cleaned up regularly:

  • Duplicates in imports from Wikimedia project sitelinks
  • Duplicates in imports from other databases

Other

Label from The Peerage import (closed)

label Label from The Peerage import (closed)
samples
description Some labels include prefixes generally not used in labels.
to do Label may be fixed via QuickStatements
query Using queries like https://w.wiki/ZaU may found them
impact ~8000
status fixed ((Lady, Lord and Sir not changed as many enwiki articles using this as title))


New items without labels (completed?)

label New items without labels (completed?)
samples
description PetScan does not add any labels if rhe page is imported from non-Wikipedia project. Pywikibot is not affected.
to do Label may be fixed via QuickStatements
query The are other bots fixing them, but sometimes it worth fixing earlier
impact Unknown remaining (they are being fixed automatically)
status unknown


Multiple MGP IDs (closed)

label Multiple MGP IDs (closed)
samples
description Some items have multiple MGP IDs; either they are duplicates, or one are assigned to the wrong person due to errornous MR ID.
to do Remove or split them
query See constraint violation report
impact ~400
status completed

Imperfect labels from Geni import (to be discussed)

label Imperfect labels from Geni import (to be discussed)
samples
description Labels with a title, or use married names instead of maiden names. See also #Label_from_The_Peerage_import
to do For items using married name, add maiden name as alias
query ...
impact Uncertain; some does not need fixing
status open


Empty lexemes (fix not planned initially)

label Empty lexemes (fix not planned initially)
samples
description Lexemes that does not have any forms or senses
to do Add forms (if source exists)
query Easily queried, though will include lexemes created by others
impact ~40000
status open

Lexemes for non-lemma form (fix ongoing)

label Lexemes for non-lemma form (fix ongoing)
samples
description WordNet (and Wolfram Knowledge Base which are based on WordNet) contains many terms that is not in lemma form.
to do Merge them to lexemes of lemma form. User:Nikki is doing them
query ...
impact Unknown
status ongoing fix by others

Lexemes created without forms and without reference on at least one form.

label Lexemes created without forms and without reference on at least one form.
samples
description
to do Add the form mentioned in Wolfram and add a reference to it pointing to a URL/property with URL formatter that others can use to verify.
query see page below
impact ~8000
status unfixed

See: Wikidata_talk:Lexicographical_data#English_noun_ending_with_-s

Uncommunicated imports from Wikipedias (to be discussed)

label Uncommunicated imports from Wikipedias (to be discussed)
samples
description Some Wikipedias does not want items be connected without human check (e.g. nlwiki); some does not like importing many duplicates; and some imports will be concerned by Wikidata communities.
to do Skip such Wikipedias; If possible draft an RFC for Wikipedia imports in general
query N/A
impact N/A
status open

"Educated at" linked to disambiguation pages (closed)

label "Educated at" linked to disambiguation pages (closed)
samples
description "Educated at" imported from MGP are based on English Wikipedia sitelink.
to do Mass clean up using QuickStatements
query ...
impact ~400
status fixed

Chemical compounds

label Chemical compounds
samples
description duplicate/missing data
to do
query ongoing fix by others
impact Unknown
status ongoing fix by others


Edit speed (closed)

label Edit speed (closed)
samples
description Bots edits more than 90/minutes (before rate limit is set up)
to do N/A
query N/A
impact N/A
status fixed

Items created by PetScan with empty labels

label Items created by PetScan with empty labels
samples Q25506711, Q25506550
description In 2016, several items were created with one sitelink and no statements. I'm not knowledgeable about what the source page says so I can't add the statements for myself.
to do Add some statements to empty items
query
impact Low
status


LargeDatasetBot related

Invalid ORCID (closed)

label Invalid ORCID (closed)
samples
description Sometimes ORCID provided by European PubMed Central is invalid
to do Need regular check
query Wikidata:Database_reports/Constraint_violations/P496#Format
impact medium (for the author items only); 1-2 every day
status open (regular)


Authors removed

label Authors removed
samples
description When an article is updated, the existing resolved author are removed. This is because of the order of author in European PubMed Central may be different from that from other sources, so it can not be simply combined. Currently the bot only edits if more resolved authors are present. For recent runs, most edits are new item creations that does not have such problem
  • Alternatively, the author statements should not be touched at all, but this may mean
to do
query
impact
status open (regular)


Removal of instanc of=review article (closed)

label Removal of instanc of=review article (closed)
samples
description Removal of instance of (P31) review article (Q7318358), or systematic review (Q1504425), meta-analysis (Q815382), case report (Q2782326), editorial (Q871232). WikidataIntegrator removes all existing statements with the same property as the property to add, so we need to remove instance of (P31) from property to add if the item exists. This is fixed, but there are remaining items to fix. (A list is generated.)
to do
query
impact ~21000
status fixed


Items without P1433

label Items without P1433
samples
description Probably the journal item is not created yet, or created after the article item. Or it is not published in a journal
to do Rerunning the bot on those items may fix the issue.
query
impact
status open (regular)


Some articles not meeting notability policy

label Some articles not meeting notability policy
samples
description It is suggested by someone that some (especially non-scientific) articles does not meet the notability policy. See Wikidata_talk:WikiProject_Source_MetaData#Encyclopedia_articles_and_notability.
to do We need to resolve the RfC first.
query See RfC
impact ~80000
status on hold

Could you add a link to the RfC in question?So9q (talk) 09:29, 9 February 2021 (UTC)

@So9q: See Wikidata:Bot_requests#Admin_bot_for_deletion_of_100k_non-notable_items (the last comment), but a formal RFC is not opened yet.--GZWDer (talk) 12:51, 9 February 2021 (UTC)