Wikidata:Project chat/Archive/2024/07

Jon DeVries

Q104346704 (duplicate: Q111549344) RIMOLA (talk) 09:54, 6 July 2024 (UTC)

  Merged  Done
I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. RVA2869 (talk) 11:54, 6 July 2024 (UTC)

Named after

we have the property "named after" which ban have the value John Smith. Is there a property to use at John Smith to show "things named after this person"? RAN (talk) 16:31, 29 June 2024 (UTC)

If such a property would exist you would see it listed at named after (P138). We generally avoid inverse properties and I can't think of a good reason to have the property "things named after this person". ChristianKl16:47, 29 June 2024 (UTC)
Not sure if this would be useful for you but there is a gadget that you can enable in preferences called "relateditems" which "Adds a button to the bottom of item pages to display inverse statements." So would show all things named after the person as well any other properties that link to the item Piecesofuk (talk) 17:01, 29 June 2024 (UTC)

Merging Q117208646 (exercise & fitness product) into Q352222 (exercise equipment)?

The former seems to be generated from Google's product taxonomy, but overall seems to refer to the same concept. The subgraphs of both terms are overlapping but not identical, so perhaps a clean-up would be welcome. Any thoughts? Alcinos (talk) 22:34, 29 June 2024 (UTC)

Is there any fitness products that are not exercise equipment? Trade (talk) 13:05, 1 July 2024 (UTC)
Fitbits and other activity tracker (Q16001686) perhaps. Not my area, but all of https://www.wikidata.org/wiki/Special:WhatLinksHere/Q117208646 look like they could fit in the other category Vicarage (talk) 13:36, 1 July 2024 (UTC)
Activity trackers are a good example, one could argue that they are indeed fitness products but not really exercise equipment (although the link to either concept is currently missing in the page you linked). Other elements that could be in the same case: fitness app (Q25104632), smart scale (Q116454756), perhaps also massage gun (Q110997596).
In the light of this, here is a refined proposal:
- rename exercise & fitness product (Q117208646) to "fitness product", and make exercise equipment (Q352222) a subclass of it
- move all current sub-classes of exercise & fitness product (Q117208646) to be sub-classes of exercise equipment (Q352222) (as Vicarage noted, currently all of them seem to be appropriate sub-classes
- add links for the remaining "fitness products" that are not "exercise equipment", such as activity tracker and the other listed above
How does that sound? Alcinos (talk) 15:51, 1 July 2024 (UTC)
Other proposed "fitness products" that are not "exercise equipment":
- heart rate monitor (Q925303) although that one is currently listed as an exercise equipment, not sure if I agree
- yoga pants (Q8054336) (perhaps it would be nice to have a "workout clothes" class? I can't seem to find one currently) Alcinos (talk) 15:58, 1 July 2024 (UTC)
sportswear (Q645292) includes exercise in description Vicarage (talk) 16:37, 1 July 2024 (UTC)
It may be too broad to be a subclass of "fitness product". Eg sports jersey (Q2623418) is a sportswear but likely wouldn't be a good (indirect) subclass of "fitness product" Alcinos (talk) 17:09, 1 July 2024 (UTC)

Bogus disease English aliases prefixed with "obsolete" - cleanup needed

A large number (thousands?) of pages for diseases and classes of diseases currently have bogus aliases in English "obsolete X", where X is usually the main English label. For example, hemophilia (Q134003) has the alias "obsolete hemophilia". Likewise, rinderpest (Q157008) has alias "obsolete rinderpest" (though in a sense it actually is obsolete!). Some have variations, e.g. chronic pancreatitis (Q1996053) has alias "obsolete relapsing pancreatitis".

These seem to have been added by a bot trying to import an external taxonomy in 2020. Example of a bad revision: https://www.wikidata.org/w/index.php?title=Q194435&oldid=1313119769

How should these be cleaned up? Can a bulk query be used to find them all?

73.223.72.200 05:00, 1 July 2024 (UTC)

Wikidata weekly summary #634

The wiki is now in read-only mode

"Failed to save due to an error." and "The wiki is now in read-only mode." pop up. Why? Eurohunter (talk) 05:26, 2 July 2024 (UTC)

Apparently there were some brief spikes of replication lag around the time you posted that message; when this happens, the wiki may automatically put itself into read-only mode temporarily until the database has caught up again. Lucas Werkmeister (WMDE) (talk) 09:25, 2 July 2024 (UTC)

Implementing Orphanet Data into Wikipedia

Orphanet is an important reference within wikipedia with over 1000 refs. Recently, they changed their data structure, thus the former Template:Orphaned does no longer work. I got a file with relevant changes I would like to be implemented. Zieger M (talk) 07:38, 2 July 2024 (UTC)

@Zieger M Hi, can you share the file publicly, so that I (or others) can have a look and decide if we're able to implement the change? Vojtěch Dostál (talk) 07:42, 2 July 2024 (UTC)
Yes, how can I share it? Zieger M (talk) 07:44, 2 July 2024 (UTC)
@Zieger M If it is a table file, maybe you can upload somewhere and share a link? Ideally, with properly labelled columns so that we understand what changes to what :-). Vojtěch Dostál (talk) 07:46, 2 July 2024 (UTC)
"upload somewhere"? Never done, don't know where to. Sorry Zieger M (talk) 07:51, 2 July 2024 (UTC)
https://www.mediafire.com/file/uimhjnvs9g4uf49/Linkliste+Orphanet_Original.xlsx/file Zieger M (talk) 12:01, 2 July 2024 (UTC)
@Zieger M Hi, I checked the file and I think I now better understand what you mean. In fact, the change does not have anything to do with Wikidata - you just want to properly format its links to Orphanet. I think that you only need to replace the URL string "https://www.orpha.net/consor/cgi-bin/Disease_Search.php?lng=DE&data_id=" at de:Template:Orphanet with "https://www.orpha.net/en/disease/detail/". Isn't that right? You can do it locally in Dewiki. Vojtěch Dostál (talk) 13:16, 2 July 2024 (UTC)

Wikidata Question

Hi Wikipedia, I have two concerns regarding data for Blic, daily newspaper from Serbia. I have tried entering publication interval and for some reason it does not let me publish it. Also, I have tried editing their social media information and it did not let me. For both of them, it does not let me publish changes. Can you tell me why ? Боки 18:21, 2 July 2024 (UTC)

What does it say? Ymblanter (talk) 18:42, 2 July 2024 (UTC)
@Ymblanter it doesnt say anything.
Basically, when I try and change it, publish button is blanked so I cant click on it. Боки 18:50, 2 July 2024 (UTC)
If you enter say "1 week" in the field for publication interval then the check-mark can't be clicked. Unit goes into a separate field. Infrastruktur (talk) 19:03, 2 July 2024 (UTC)

I noticed that this item is linked not only as an antiseptic but also for many other medical topics. Its description only mentioned "antiseptic" and I've added the prevention and treatment of iodine deficiency, based on its page linked from WikiProjectMed. The mistake may arise from the fact that it's disambiguated in the English- (and several other) language Wikipedia(s) as "iodine (medical use)". I think all other medical uses (e.g. radioactive iodine therapy (Q13233408)) should link to either to iodine as an element (iodine (Q1103)), or to a new item created for this purpose, but the antiseptic (and possibly the deficiency-preventing) use shouldn't be conflated with the radioactive or other medical means of using it. Adam78 (talk) 21:08, 2 July 2024 (UTC)

@Adam78 Is iodine as antiseptic in any way chemically different from the iodine element? If not, all such links should point to iodine (Q1103) and Q28196266 should instead be facet of (P1269) of iodine (Q1103) or something of that sort. A similar example is calcium in biology (Q60097). Vojtěch Dostál (talk) 11:25, 4 July 2024 (UTC)

API / Pyton / SPARQL access questions

Hi everyone,

please see Wikidata:Project chat#Conventions for Knowledge Graph aligning for context.


TL;DR, we're looking to check if a wikidata instance exists for ~500 entries we have in our database. We also don't want to overburden the Wikidata API, hence:

What can we do to most efficiently query the wikidata database?


What currently do is:

query = f"""
SELECT ?item ?itemLabel (GROUP_CONCAT(DISTINCT ?altLabel; separator = ", ") AS ?altLabels) 
(SAMPLE(?description) AS ?description) WHERE {{
{selection[select]}
OPTIONAL {{?item skos:altLabel ?altLabel FILTER(LANG(?altLabel) = "en")}}
OPTIONAL {{?item schema:description ?description FILTER(LANG(?description) = "en")}}
SERVICE wikibase:label {{bd:serviceParam wikibase:language "en".}}
}}
GROUP BY ?item ?itemLabel
LIMIT {limit}
"""

, wherin we limit the results to 20 at most, and select based on:

selection = {
    'label' : f'?item rdfs:label "{label}"@en.',
    'altLabel' : f'?item skos:altLabel "{label}"@en.'
}

Then, per label, we check if:

  1. entries with that label are available (e.g. "STEP file" to Q3509055
  2. if these entries do not sum up to our limit (20), then we also check if entries with that label as altLabel exist (e.g. ".stp" to Q3509055),
  3. if these entries do not sum up to our limit (20) then we try 1. and 2. again with (if != label):
    1. label.lower(), so "STEP" -> "step",
    2. label.capitalize(), so "STEP" -> "Step",
    3. label.upper(), so "STEP" -> "STEP" -> not done, since == label


Then we store all queries and results so we run no query twice, and can just check our local "copy" for the result.


Given all this, our Question:

  1. Is there a better way?

Better as in "easier on wikidata / time" as well as "better results", since currently we have about 40% match rate. Likely, many ouf our instances do, in fact, have no match, but others (like Q2117885 "Systems Modeling Language" or "SysML") are currently just not catched. We have seen advise to run some preprocessing on the labels, to lower all wikidata labels in a filter, but that seemed unfathomably taxing on all parties involved.

There is also the general advice to use a data dump. We have checked Wikidata:Database download and https://dumps.wikimedia.org/wikidatawiki/entities/, and not found a dump that contains all labels AND is relatively small. The lexemes do not seem to contain all labels, presumably only Q111352 instances. All the aformentioned entries, e.g. .p21 and .stp, are not mentioned therein.


I really appreciate your help, and am open to suggestions, improvements, hints or anything, really :)


Best, TimBorgNetzWerk (talk) 11:30, 4 July 2024 (UTC)

Have you considered using a tool like OpenRefine to help reconcile your data with Wikidata's? M2Ys4U (talk) 16:26, 4 July 2024 (UTC)
Haven't heard about it yet (I think), will be looking into it, thanks! TimBorgNetzWerk (talk) 09:50, 5 July 2024 (UTC)
OpenRefine is nice if you intend to import data into Wikidata. Last time I checked the reconciliation it uses yielded less than ideal results. Is this a publicly available graph? If your graph had it's own identifier registered on Wikidata you could use Mix'n'match to do a preliminary matching of the dataset and then let you verify each match manually. Asking for a new identifier can be done at WD:PP.
In any case freetext search may be what WDQS is worst at. Unsurprisingly the built-in search does a much better job, see [1] for Wikidata specific functionality. You won't tax the API as long as you make calls sequentially and support maxlag. There are libraries available that makes this easier. Infrastruktur (talk) 16:36, 5 July 2024 (UTC)

"agency" property?

I'm getting "{{cite journal}}: |author= has generic name (help)" from:

  • CNN Newsource (24 February 2021). "Urban League of Greater Kansas City unveils social justice bus". KMIZ. Wikidata Q126365824. 

in Wikipedia:Gwendolyn Grant (activist)#References.

In a section on "work with template:Cite Q?" on the talk page associated with Wikipedia:Template:Sfn, Wikipedia:User:ActivelyDisinterested said, "CNN News Source is not a valid author name ... . The correct field in this case would be |agency= but [that is not] supported by Wikidata / Cite Q." I've experimented with assigning "CNN Newsource" to different properties, so far without finding one that makes this complaint disappear.

Can someone help me find a property to which to assign "CNN Newsource" (Q5013147) so this complaint in Wikipedia disappears? Thanks, DavidMCEddy (talk) 00:15, 6 July 2024 (UTC)

Why If I add subclass of (P279) with for example history of Berlin (Q679741) then value-requires-statement constraint (Q21510864) pop up? For example, it pop up at history of trams in Berlin (Q1514212) while it not pop up in history of trams in Barcelona (Q11925955). Eurohunter (talk) 06:18, 2 July 2024 (UTC)

@Eurohunter You have to make sure there is a complete hierarchy of classes. In the example you have given, Q1514212 has class Q679741, but Q679741 needs to have some class too... I suggest Q122131 be added there as P279. Vojtěch Dostál (talk) 13:26, 2 July 2024 (UTC)
@Vojtěch Dostál: Thanks. Eurohunter (talk) 12:14, 6 July 2024 (UTC)

We need to put an end to this

For months, items like

and likely more others have been target of constant edit warring, having English and Russian description changed back and forth by various IP addresses and few-edits-accounts. Could anyone have a look, say what is going on and suggest how administrators should deal with it? --Matěj Suchánek (talk) 08:57, 5 July 2024 (UTC)

Chechen-Ingush wars. All items should be protected at a random version. May be we should block the warriors as well. Ymblanter (talk) 19:41, 5 July 2024 (UTC)
Though may be things like this would help before protection, but then I need to go manually through the list. I can do it, but very slowly. Ymblanter (talk) 19:44, 5 July 2024 (UTC)
I see no good reason to protect them to be only edited by admins. Semiprotections should be good enough. ChristianKl21:09, 6 July 2024 (UTC)

set confirm user

i contribute in wp/fa with 1300 edits. in wiki data i cant change semi protected pages, so im gonna ask you for give me this level. thank you میسانو (talk) 00:41, 13 July 2024 (UTC)

  Done Infrastruktur (talk) 14:14, 13 July 2024 (UTC)
I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. Infrastruktur (talk) 14:14, 13 July 2024 (UTC)
no problem. first i didn't know where i can issue that میسانو (talk) 14:20, 13 July 2024 (UTC)