User:Mateusz Konieczny/failing testcases

Please, edit sections that are fixed or outdated! Please do not archive fixed items - I often need to take extra actions so that future reports will not include them again.

In general, any help with fixing problem reported here is helpful and welcome!

If you want, post on User talk:Mateusz Konieczny/failing testcases and request to be notified on page update (or just watchlist this page as usual)

Löwin (Q121323215) is art (field of work, not the resulting work), according to Wikidata ontology edit

en: animal sculpture (artistic genre of reproducing animals in sculpture) [1]

en: animal art (artistic theme of reproducing animals in art) [2]
en: figurative art (art that depicts real object sources) [3]
en: visual arts (practice of art which creates works that are primarily visual in nature) [4]
en: art (field of work focused on creating expressive work intended to be appreciated for its beauty or emotional power (use Q838948 for the resulting work)) [5] this was unexpected here as it indicates art (field of work, not the resulting work) !!!!!!

Q124429211 is a construction (as economic activity), according to Wikidata ontology edit

en: rammed earth building (building whose walls are constructed from tamped earth) [6]

en: mud building (building made mainly of mud or clay) [7]
en: earthen architecture (type of building construction) [8]
en: construction (economic activity that consists of the building or assembling of a building or infrastructure) [9] this was unexpected here as it indicates a construction (as economic activity) !!!!!!!!!!!!

Police Broadcasting Service (Q6701894) is a science, according to Wikidata ontology edit

en: public radio (radio format) [10]

en: public broadcasting (electronic media outlets whose primary mission is public service) [11]
en: public service (service provided by a government to people living within its jurisdiction) [12]
en: service (economic product that directly satisfies wants without producing a lasting asset) [13] this was unexpected here as it indicates a service !!!!!!!!!!!!!!!!!!!!!!!!!!
en: broadcasting (distribution of audio and video content to a dispersed audience via any audio or visual mass communications medium) [14]
en: public disclosing (in communication, to broadcast a message without direct feedback from the audience) [15]
en: dissemination (the spread of something (e.g. a pathogen, an idea, a technique)) [16]
en: process (series of events which occur over an extended period of time) [17] this was unexpected here as it indicates an event !!!!!!!!!!!!!!!!!!!!!!!!!!
en: radio broadcasting (distribution of audio content to a dispersed audience via any audio mass communications medium) [18]
en: broadcast engineering (field of electrical engineering, and now to some extent computer engineering and information technology, which deals with radio and television broadcasting) [19]
en: electrical engineering (field of engineering that deals with electricity) [20]
en: engineering (applied science) [21]
en: applied science (discipline that applies existing scientific knowledge to develop more practical applications) [22]
en: science (systematic system that builds and organizes knowledge, and the set of knowledge produced by this system) [23] this was unexpected here as it indicates a science !!!!

Missouri Botanical Garden (Q1852803) is a service, according to Wikidata ontology edit

en: online database (database accessible from a network, including from the Internet) [24]

en: website (set of related web pages served from a single web domain) [25] this was unexpected here as it indicates a website !!!!

Rainbow Village (Q22814370) is a genre, according to Wikidata ontology edit

en: street art (art that is public and temporary in public spaces) [26]

en: urban art (art genre related to cities) [27]
en: genre (category of creative works based on stylistic, thematic or technical criteria) [28] this was unexpected here as it indicates a genre !!!!!!!!!!!!!!!!!

Colonia Murri (Q125361729) is an intentional human activity, according to Wikidata ontology edit

Unexpected type Q123154102 undocumented format

en: summer camp (supervised program for children or teenagers conducted during the summer months) [29]
en: recreation (human activity of leisure (discretionary time)) [30]
en: intentional human activity (human activity driven by purposeful motives) [31] this was unexpected here as it indicates an intentional human activity !!!!!!!

Mural Nelson Mandela (Q109331485) is a genre, according to Wikidata ontology edit

en: graffiti (writing or drawing etched, scratched, scribbled, or sprayed (often, but not always illicitly) on a wall or other surface in a public place) [32]

en: street art (art that is public and temporary in public spaces) [33]
en: urban art (art genre related to cities) [34]
en: genre (category of creative works based on stylistic, thematic or technical criteria) [35] this was unexpected here as it indicates a genre !!!!!!!!!!!!!!!!

Tracking query: items likely to be works of art, not fields of art edit

https://query.wikidata.org/#SELECT%20%3Fitem%20%3FitemLabel%20%0AWHERE%20%0A%7B%0A%20%20VALUES%20%3Fclassofart%20%7Bwd%3AQ36649%20wd%3AQ184485%20wd%3AQ8242%20wd%3AQ2864737%20wd%3AQ10988986%20wd%3AQ213156%20wd%3AQ682010%20wd%3AQ473743%20wd%3AQ11633%20wd%3AQ125191%20wd%3AQ2394336%20wd%3AQ11629%20wd%3AQ11634%20wd%3AQ2921001%20%7D%0A%20%20%3Fitem%20wdt%3AP31%20%3Fclassofart.%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%20%23%20Helps%20get%20the%20label%20in%20your%20language%2C%20if%20not%2C%20then%20en%20language%0A%7D

74511 results in 9848 ms - is it an useful report or listing bunch of false positives? Mateusz Konieczny (talk) 19:20, 15 May 2024 (UTC)

apparently missing wikidata entries edit

https://taginfo.openstreetmap.org/keys/name:etymology:wikidata:missing#values lists cases where feature is named after person/object/thing with no Wikidata entry.

Presumably this Wikidata entries should be created

Would it be useful to report somewhere?

Try Wikidata:Bot requests. Swpb (talk) 18:00, 15 May 2024 (UTC)

Items that may be correct as they are edit

Kappa Kappa Kappa (Q6367049) is an object that exists outside physical reality, according to Wikidata ontology edit

en: fraternity (collegiate social organization for men) [36]

en: fraternities and sororities (social organizations at colleges and universities) [37]
en: social organization (pattern of relationships between and among individuals and social groups) [38]
en: pattern (discernible spatial or temporal regularity in the world or in a man-made design) [39]
en: regularity (abstract concept describing something that appears more than once with a certain space or time between the occurances) [40]
en: abstract entity (entity that exists outside physical reality, including abstract objects and properties) [41] this was unexpected here as it indicates an object that exists outside physical reality !!!!

This is correct; it is an organization without physical substance. Swpb (talk) 16:26, 23 April 2024 (UTC)

Santa Casa da Misericórdia de Soure (Q64555514) is an object that exists outside physical reality, according to Wikidata ontology edit

en: confraternity (generally a Roman Catholic or Orthodox voluntary association of lay people) [42]

en: Sodality (also a lay organization in the Roman Catholic Church) [43]
en: sodality (non-kin group organized for a specific purpose and frequently spanning villages or towns) [44]
en: social organization (pattern of relationships between and among individuals and social groups) [45]
en: pattern (discernible spatial or temporal regularity in the world or in a man-made design) [46]
en: regularity (abstract concept describing something that appears more than once with a certain space or time between the occurances) [47]
en: abstract entity (entity that exists outside physical reality, including abstract objects and properties) [48] this was unexpected here as it indicates an object that exists outside physical reality !!!!!!
  • Organizations are platonic objects and not physical objects, so I don't see the problem here. ChristianKl❫ 16:13, 21 April 2024 (UTC)
    • "nonphysical object" would be fine. But "entity that exists outside physical reality" is not true. "number 1", "NGO" is "abstract entity - entity that exists outside physical reality". While specific organisation is nonphysical but existing within reality Mateusz Konieczny (talk) 09:19, 24 April 2024 (UTC)
      • I think that's a distinction without a difference. Organizations are abstract entities; you can quibble with the description of abstract entity (Q7048977) if you want but I think it's accurate as is. Swpb (talk) 16:05, 8 May 2024 (UTC)

Subject matter expertise needed edit

Jesus (Q302) is a fictional entity, according to Wikidata ontology edit

en: Salvator Mundi (title of Jesus and subject in Christian iconography) [49]

en: Salvator [50]
en: Messiah (saviour or liberator of a group of people, most commonly in the Abrahamic religions) [51]
en: fictional religious occupation (religious occupation which only exists in a work of fiction) [52]
en: fictional entity (entity that only exists in a work of fiction) [53] this was unexpected here as it indicates a fictional entity !!!!!!!!!!!!!!!!!!!!!!!!!!

en: historical character (character in works of fiction inspired by an actual person in history, often heavily romanticized) [54]

en: fictional character (fictional human or non-human character in a narrative work of art) [55]
en: fictional entity (entity that only exists in a work of fiction) [56] this was unexpected here as it indicates a fictional entity !!!!!!!!!!!!!!!!!!!!!!!!!!

en: film character (fictional character appearing in a film) [57]

en: fictional character (fictional human or non-human character in a narrative work of art) [58]
en: fictional entity (entity that only exists in a work of fiction) [59] this was unexpected here as it indicates a fictional entity !!!!!!!!!!!!!!!!!!!!!!!!!!

See https://en.wikipedia.org/wiki/Historicity_of_Jesus

This in general is a pretty problematic mess in many aspects.

Mateusz Konieczny (talk) 07:19, 19 July 2023 (UTC)

The solution would be the same as that discussed for other biblical figures with evidence of historicity: one item for the historical person, and one for their biblical representation. It would be best for someone with topic expertise to sort out which statements and links belong to which of those two items. Swpb (talk) 19:04, 19 July 2023 (UTC)

Andorra (Q228) classified as goods and services edit

principality (Q208500)

manorialism (Q1550557)
landed property (Q845132)
property (Q1400881)
goods (Q28877)
goods and services (Q2897903)

manorialism (Q1550557) currently conflates an economic system and properties held under that system. To the extent that principalities are or were manorial properties, they are goods. The question of whether all instances of principality (Q208500), and Andorra in particular, are also instances of manorialism (Q1550557) needs a subject-matter expert. Swpb (talk) 20:03, 6 July 2023 (UTC)

Well, nowadays Andorra cannot be simply bought. So at least some qualifiers would be needed Mateusz Konieczny (talk) 22:42, 6 July 2023 (UTC)
Yes, probably end time (P582) at least. Swpb (talk) 13:42, 7 July 2023 (UTC)

Posted to https://www.wikidata.org/wiki/Wikidata:Project_chat#Andorra_(Q228)_classified_as_goods_and_services Mateusz Konieczny (talk) 12:58, 28 November 2023 (UTC)

See https://www.wikidata.org/wiki/Wikidata:Project_chat/Archive/2023/11#Andorra_(Q228)_classified_as_goods_and_services - remained unfixed Mateusz Konieczny (talk) 05:54, 12 December 2023 (UTC)

Unsolvable conflations? edit

Conflated classes that would be impractical and/or counterproductive to deconflate.

Imprints vs. publishers edit

Verlag Ferdinand Schöningh (Q1298441) is an object that exists outside physical reality, according to Wikidata ontology edit

en: imprint (trade name under which works are published, often corresponding to a division of a publishing company) [60]

en: trade name (name which a business trades under for commercial purposes) [61]
en: wordmark (stylized text-only representation of a brand used for identification and branding) [62]
en: name (word or phrase used for identification) [63]
en: word or phrase (sequence of one or more words) [64]
en: linguistic form (any meaningful unit of speech such as word, phrase, sentence, morpheme, affix (prefix, suffix, etc.) and the like) [65]
en: unit of speech (whatever the English phrase "unit of speech" means, used e.g. in the definitions in Merriam-Webster and dictionary.com) [66]
en: linguistic unit (any of a range of units of language, whether a word, phrase, clause, sentence, paragraph, whole conversation or a story, morpheme, grapheme, phoneme and syllable) [67]
en: unit (entity regarded or used as an elementary structural or functional constituent to measure, analyse or describe another entity) [68]
en: abstract entity (entity that exists outside physical reality, including abstract objects and properties) [69] this was unexpected here as it indicates an object that exists outside physical reality !!!!!!!

This represents a whole class of tricky cases – it hinges on whether the entity continued as a defined organization within a parent company, or if it became simply a name that the parent company publishes certain works under, without any corresponding internal organization. In the former case, it wouldn't strictly be an imprint, and in the latter, we'd want to put an end time (P582) on instance of = publisher (Q2085381). However, in the grand scheme, I suspect it's best to leave this type of conflation alone: imprint (Q2608849) is an accepted value class for publisher (P123), which is what these entities are overwhelmingly used for, and I suspect the degree of organizational independence of entities identified as imprints is often both A) not easily determined and B) not important enough to justify the work to separate them, and the overhead of maintaining two Q-items for each such entity. The same issue applies to brand (Q431289) and company (Q783794) generally; they are often not worth separating. So I think this belongs in the "Unsolvable" section below. Swpb (talk) 19:51, 24 April 2024 (UTC)

Offices vs. professions edit

https://www.wikidata.org/wiki/Q449319 via https://www.wikidata.org/wiki/Q3368517 classifies it as a profession

But it is not a specific profession, and article at https://en.wikipedia.org/wiki/Public_Prosecutor_General_(Germany) describes rather government office

Mateusz Konieczny (talk) 05:44, 16 September 2023 (UTC)

https://www.wikidata.org/wiki/Q5166910 and https://www.wikidata.org/wiki/Q7583851 and https://www.wikidata.org/wiki/Q55499784 have similar problem. Not sure what would be a proper fix... Mateusz Konieczny (talk) 05:46, 16 September 2023 (UTC)
Public Prosecutor General (Q449319) is a profession, according to Wikidata ontology edit

en: public prosecutor general (public office) [70]

en: prosecutor (legal representative of the state in criminal trials) [71]
en: government attorneys (type of professional employees in government) [72]
en: lawyer (legal professional who helps clients and represents them in a court of law) [73]
en: legal profession (profession of those who study, develop and apply law – as a lawyer, judge, etc.) [74]
en: profession (occupation requiring specialized training) [75] this was unexpected here as it indicates a profession !!!!!!!!!!!!!!!!!!!!!!!!!!

  Comment That's a tricky one because the label Generalbundesanwalt beim Bundesgerichtshof applies to both the prosecutor (=person) as well as the agency they are overseeing. Maybe these two meanings should be modelled as separate items.
The same issue applies to every instance of Federal Commissioner (Q1005815) and State Commission for Data Protection (Q1802121). --Nw520 (talk) 09:14, 30 November 2023 (UTC)

And what worse Wikipedia articles will tend to describe both in the single article - what is not unreasonable at all. But it will explode wikidata modelling. I faintly remember we have something along lines of "is instance of: wikipedia article describing comingled topics" Mateusz Konieczny (talk) 14:35, 30 November 2023 (UTC)
Are you looking for ambiguous Wikidata item (Q122754124)? Swpb (talk) 17:51, 30 November 2023 (UTC)
Probably Mateusz Konieczny (talk) 23:26, 30 November 2023 (UTC)
Do you think that ambiguous Wikidata item (Q122754124) would work here? Or should I put it on "Wikidata is incapable of handling it" pile? Mateusz Konieczny (talk) 21:29, 7 December 2023 (UTC)

FYI, there are currently at least 9500 items that are both instances of position (Q4164871) or its subclasses, and subclasses of occupation (Q12737077) or its subclasses. Each item in each of those 9500 subclass chains may be a proper occupation, or it may be a subclass of position. Take head of state (Q48352) - which is it? If it's a class of positions, then it shouldn't be a subclass of statesperson (Q372436) (assuming that statesperson is an occupation), but it could have occupation (P106) = statesperson instead (if you add Q4164871 to the subject-type constraint on P106). But you will surely get complains that head of state is indeed an occupation, and a proper subclass of statesperson. The closer you look, the more the dividing line vanishes. I think you're going to have to just accept these conflations. I wouldn't apply ambiguous Wikidata item (Q122754124), as it suggests the possibility of a reasonable deconflation. Swpb (talk) 20:13, 4 January 2024 (UTC)

public prosecutor general (Q3368517) is a profession, according to Wikidata ontology edit

en: prosecutor (legal representative of the state in criminal trials) [76]

en: government attorneys (type of professional employees in government) [77]
en: lawyer (legal professional who helps clients and represents them in a court of law) [78]
en: legal profession (profession of those who study, develop and apply law – as a lawyer, judge, etc.) [79]
en: profession (occupation requiring specialized training) [80] this was unexpected here as it indicates a profession !!!!!!!!!!!!!!!!!!!!!!!!!!
Síndic de Greuges de Catalunya (Q7583851) is a profession, according to Wikidata ontology edit

en: ombudsperson (official representing the interests of the public) [81]

en: judge (official who presides over court proceedings) [82]
en: legal profession (profession of those who study, develop and apply law – as a lawyer, judge, etc.) [83]
en: profession (occupation requiring specialized training) [84] this was unexpected here as it indicates a profession !!!!!!!!!!!!!!!!!!!!!!!!!!

Projects vs. products edit

Project Riese (Q320076) classified as an intentional human activity edit

project (Q170584)

intentional human activity (Q451967)

It is. Swpb (talk) 20:10, 6 July 2023 (UTC)

I guess that it describes both constructed structures and project to build them... Not sure how to handle such case on my side Mateusz Konieczny (talk) 12:14, 7 July 2023 (UTC)
Yeah, it's the same issue I raised here. I don't find the conflation satisfying either, but Vicarage is right that splitting all such entities is not practical. You might want to just write an exception for these into your tool. Swpb (talk) 13:39, 7 July 2023 (UTC)

KATRIN (Q316053) is a human activity, according to Wikidata ontology edit

en: experiment (scientific procedure carried out to support, refute, or validate a hypothesis) [85]

en: research work (activity performed as part of scientific research) [86]
en: human activity (activity initiated by a human, intentionally or unintentionally) [87] this was unexpected here as it indicates a human activity !!!!!!!!!!!!!!!!!!!!!!!!!!
en: test (way of checking something by interacting with it) [88]
en: intentional human activity (human activity driven by purposeful motives) [89] this was unexpected here as it indicates an intentional human activity !!!!!!!!!!!!!!!!!

In the same way as Project Riese (Q320076) above, I'd consider this "unsolvable" in that it's probably counterproductive to try to de-conflate the experiment (activity) from the apparatus of the same name. Swpb (talk) 20:58, 29 November 2023 (UTC)

Halls of fame as lists vs. as buildings edit

International Tennis Hall of Fame (Q52454) is an award, according to Wikidata ontology edit

en: sports hall of fame (hall of fame for topics related to sports) [90]

en: hall of fame (list of outstanding individuals in a particular group, which may or may not be embodied in a literal physical structure) [91]
en: award (something given to a person or a group of people to recognize their merit or excellence) [92] this was unexpected here as it indicates an award !!!!!!!!!!!!!!!!!!!!!!!!!!

Change made: [93]. Lots of instances of hall of fame (Q1046088) are intangible lists, lots are buildings, and lots refer to both. It would be impractical to have separate "hall of fame (list)" and "hall of fame (building)" classes and attempt to separate the many, many instances into those two piles. Swpb (talk) 20:25, 2 May 2023 (UTC)

Walk of Fame of Cabaret (Q2345775) is an award, according to Wikidata ontology edit

en: walk of fame (sidewalk or similar construction that commemorates outstanding individuals in a particular group) [94]

en: award (something given to a person or a group of people to recognize their merit or excellence) [95] this was unexpected here as it indicates an award !!!!!!!!!!!!!!!!!!!!!!!!!!

Somewhat fixed - see section on International Tennis Hall of Fame (Q52454) above. Swpb (talk) 20:26, 2 May 2023 (UTC)

Blues Hall of Fame (Q258100) is an award, according to Wikidata ontology edit

en: Blues Hall of Fame (award by Blues Foundation, since 2015 also a music museum in Memphis, Tennessee) [96]

en: award (something given to a person or a group of people to recognize their merit or excellence) [97] this was unexpected here as it indicates an award !!!

mixing two things in one entry again

Other edit

The Bitches (Q878769) (set of rocks) is a physical process, according to Wikidata ontology edit

part of problem is that two entities are merged together - or that single entity has two components

still, location with fast-moving tidal flow is still not a physical process

en: tidal race (fast-moving tidal flow passing through a constriction, forming waves, eddies and strong currents) [98]

en: ocean current (continuous, directed movement of ocean water) [99]
en: current (magnitude and direction of flow in a fluid) [100]
en: fluid flow (movement of fluid matter) [101]
en: motion (change in position of an object over time; a body is said to be in motion if it changes its position with respect to its immediate surroundings) [102]
en: movement (act of moving) [103]
en: behavior (actions by entities within a system) [104] this was unexpected here as it indicates a behavior !!!!!!!!!!!!!!!!!!!!!!!!!!
en: change (process, event or action that deviates from the present state) [105]
en: occurrence (occurrence of a fact or object in space-time; instantiation of a property in an object) [106] this was unexpected here as it indicates an event !!!!!!!!!!!!!!!!!!!!!!!!!!
en: physical process (process that can be described with physics) [107] this was unexpected here as it indicates a physical process !!!!!!!!!!!!!!!!!!!!!!!!!!

Nothing reasonable to be done. Probably not practical to have one item for the rocks and a separate item for the resulting tidal flow (process), which goes by the same name. A query for instances of tidal race (Q495844) should return this item. Swpb (talk) 19:42, 2 May 2023 (UTC)

List of detected issues in OpenStreetMap edit

(OSM issue listing is the main project, Wikidata problems are reported when they cause false positives)

more info about this list edit

Produced from https://github.com/matkoniecz/wikibrain/blob/master/test_wikidata_structure.py

see also Wikidata talk:WikiProject Ontology (write to me if you solved all what is posted there and here - or at least tried to solve - and want more)

note to self edit

adding new cases to Wikidata talk:WikiProject Ontology is fine as long as they are no more than 20% of threads there and this page is overloaded (100 unsolved cases)

ad (to be used when linking it on Wikidata talk:WikiProject Ontology): BTW, if anyone wants listing of other issues like this - see [[User:Mateusz Konieczny/failing testcases]] (some may be easier to solve than this one) ~~~~

Wikidata:Pump - 2023 share done and archived thread linked in one of still unsolved cases

share on #wikimedia on US Slack done in 2023

Slack share text edit

I want to advertise my https://www.wikidata.org/wiki/User:Mateusz_Konieczny/failing_testcasesit is listing of cases where I discovered bogus ontology on Wikidata while trying to find bad wikipedia tagsSee https://matkoniecz.github.io/OSM-wikipedia-tag-validator-reports/ and https://maproulette.org/browse/projects/53065 if fixing OSM problems is more important for you