Wikidata:Bot requests/Archive/2014/11

Template descriptions

The german description for templates is usually "Wikimedia-Vorlage". Unfortunately, many templates have the description "Wikipediavorlage", which is not so good. Could a bot change this descriptions to the standard one? 79.217.130.43 20:51, 22 November 2014 (UTC)

  Done --Pasleim (talk) 18:08, 23 November 2014 (UTC)
This section was archived on a request by: --Pasleim (talk) 18:42, 6 December 2014 (UTC)

Bot request

On Wikidata:Database reports/Constraint violations/P1577 there is a long list of items. All manuskripts with GA-numbers are definitely Greek manuskripts, so all lack language of work or name (P407) + Ancient Greek (Q35497). This is an easy job for bots.--Giftzwerg 88 (talk) 11:59, 26 November 2014 (UTC)

It's even a easy job for you: Open this link and insert in the controls box P407:Q35497 and then click "Proceed commands". --Pasleim (talk) 13:27, 26 November 2014 (UTC)
Thank you, I did not know about this tool.--Giftzwerg 88 (talk) 16:15, 26 November 2014 (UTC)
This section was archived on a request by: --Pasleim (talk) 18:42, 6 December 2014 (UTC)

batch autopatrol

If it's possible, could somebody somehow mark all of Special:Contributions/10.68.17.174 as patrolled? They come from tools operated by autopatrolled users who are operating tools that log out momentarily apparently due to some high-level changes in the servers happening recently. HHVM or something. There are hundreds of them every hour and it's no fun to mark them all by hand. It's only possible to view RecentChanges for unpatrolled/IP users for a few hours into the past. Alternatively, could somebody please tell me how to set the URL for RecentChanges to go start at a time before the present? I can't get the API flags to work at all. Thanks for any help --Haplology (talk) 00:25, 26 November 2014 (UTC)

Haplology: These edits are happening because the tool isn't using assert to check if it's properly logged in. The flood seems to be over now. Next time I'll put a softblock on it. Multichill (talk) 18:18, 26 November 2014 (UTC)
@Multichill: Just a note: Wikidata:Administrators' noticeboard/Archive/2014/10#Please indef soft block 10.68.0.0/16.
I don't think it is necessary to mark them all as patrolled unless we use better patrolling system... Matěj Suchánek (talk) 19:07, 26 November 2014 (UTC)
The last edits of 10.68.17.174 were in November 2014 so I don't think we still need a bot doing the autopatrol --Pasleim (talk) 19:56, 30 January 2015 (UTC)
This section was archived on a request by: --Pasleim (talk) 19:56, 30 January 2015 (UTC)

Populated places in Sweden by municipality

Last week, some categories about populated places in Sweden by municipalities were created. However, many of them still have no Wikidatalinks to Swedish, where such categories also exist.

example:

  • Category:Populated places in Kiruna Municipality

shall have a Wikidata link to

  • Kategori:Orter i Kiruna kommun

J 1982 (talk) 15:20, 28 November 2014 (UTC)

seems to be done --Pasleim (talk) 14:03, 30 January 2015 (UTC)
This section was archived on a request by: --Pasleim (talk) 14:03, 30 January 2015 (UTC)

Add description of categories in Esperanto

Can someone add description in Esperanto (languagecode "eo") to all items of categories where the description is empty? The description would be "Vikimedia kategorio". --Venca24 (talk) 11:16, 30 November 2014 (UTC)

Would "kategorio en Vikimedio" be acceptable? If so, please see my post in wrong category item descriptions, above. Popcorndude (talk) 00:25, 5 January 2015 (UTC)
Yes, it will. --Venca24 (talk) 20:50, 5 January 2015 (UTC)
This section was archived on a request by: Popcorndude (talk) 22:59, 26 February 2015 (UTC)

Fill out "symmetric" statements

When X is tagged as father of Y, a bot should really add "Y: child of X". Currently this is done by hand, and this is really not the best use of our time. Beside, it is usually done in a somewhat sloppy fashion, like copying the main part of the statement, so as to remove the "constraint violation" but not copying the qualifiers and sources, because it is so time consuming.

Ideally, the bot should be time-conscious. For instance, if someone removes a wrong "son of X" statement and not the corresponding "father of Y", then the bot should remove "father of Y" rather than readd "son of X".

user:Magnus Manske had proposed to do it once, but gave up during the bot approval process. See Wikidata:Requests for permissions/Bot/ImplicatorBot, apparently, some code is still available. -Zolo (talk) 15:30, 5 November 2014 (UTC)

Perhaps, now that some time has passed, User:Magnus Manske would reconsider? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 17:08, 5 November 2014 (UTC)
I suggest to wait for arbitrary access in WP and then to delete one half of the symmetric statements. Having redundant information in a database always causes trouble, no matter how intelligent the bots are programmed.--Pasleim (talk) 17:53, 5 November 2014 (UTC)
Arbitrary access alone will not solve the issue. If you want to know who is the father of X, it is of little use that item Y contains the statement "father of: X", because you would not know that you have to look in item Y. --Zolo (talk) 21:14, 5 November 2014 (UTC)
I agree with Zolo here. Emw (talk) 02:58, 6 November 2014 (UTC)
I would support a program like ImplicatorBot that adds certain types of entailed statements (e.g. symmetric, inverse, etc.), as long as how it did so was documented and agreed to by the community for each type of statement. Constraint templates on Property talk pages are likely the best shim for statements on properties for now. Ideally the source code would be publicly available. Emw (talk) 02:58, 6 November 2014 (UTC)
An example for father (P22) / child (P40) is at http://wdq.wmflabs.org/stats?action=missingpairs&prop1=22&prop2=40
It was processed the other day so the number reduced quite drastically.
However, we found that the constraints on these properties weren't set up correctly, so we propagated a few errors that would already have been visible on the constraint reports.
The advantage of the them being made symmetric is that the errors became more visible and easier to fix (mostly done). --- Jura 03:52, 7 November 2014 (UTC)
@Jura: the tool shows the list a little more nicely than the in-wiki constraint violation report, but there is no "process it" option is there ?--Zolo (talk) 10:07, 8 November 2014 (UTC)
You need to reformat it yourself to paste it into http://tools.wmflabs.org/wikidata-todo/quick_statements.php ----- Jura 10:13, 8 November 2014 (UTC)
OK, I do not think it is a really good solution. The bot should really keep the qualifiers and source. Plus, rethinking about it, it seems important that when both items have a different values, it should somehow give priority to the more recent one. --Zolo (talk) 11:03, 8 November 2014 (UTC)
The reasoning is probably different for each type of "symmetric" item.
As source I indicated the item it was derived from. This has the advantage that when people fix one, they might also fix the other. --- Jura 11:48, 8 November 2014 (UTC)
Example: Q3260846 --- Jura 11:51, 8 November 2014 (UTC)
Clearly we need to start with properties were the logic is simple (like father/son).
Providing the other item as a source may be better than nothing, but this is clearly subptimal. It is hard to reuse in Wikipedia for example. Also, it seems conceptually incorrect: "imported from English Wikipedia" means imported from the website en.wikipedia.org, not "stated in item English Wikipedia (Q328)", so "imported from Ludovic-François Douillard" should mean imported from the human called Ludovic-François Douillard, not "stated in item Q17486077". A consistency-checking tool would rigthfully find that a bit strange.
There will certainly be tricky cases, but this is the reason why we would need a single bot that we can document, discuss and monitor, rather than various humans hacking various semi-automated stopgaps. --Zolo (talk) 12:23, 8 November 2014 (UTC)

Please be careful before starting a bot to fill out symmetric statements, if the statements are really symmetric according to their definition. E.g. present in work (P1441) and characters (P674) are not symmetric, although they may seem to be. They may become symmetric, but we would have to change one or both of their definitions and the constraints based on them before.--Shlomo (talk) 16:10, 9 November 2014 (UTC)

Minerals

Is it possible to get following data:

Adding UNII identifiers

I've noticed that UNII (P652) didn't have much data, while a canonical list from the Food and Drugs Administration (thus PD and freely reusable, cf http://www.nlm.nih.gov/copyright.html) does exist.

The data is tabulated as follows and has several existing identifiers for cross checking:

Légende
UNII PT RN MF INCHIKEY EINECS NCIt ITIS NCBI PLANTS SMILES INN_ID UNII_TYPE
007C07V77Z BIBENZYL 103-29-7 C14H14 QWUWMCYKGHVNAV-UHFFFAOYSA-N 203-096-4 c1ccc(cc1)CCc2ccccc2 Ingredient Substance

It is available at : http://fdasis.nlm.nih.gov/srs/jsp/srs/uniiListDownload.jsp
What I think could be nice are the UNII identifiers, and the data for the other properties when it is not present, and creating any items that would not be present.
But I am not a chemist and while the data seems trustworthy, there might be issues.
I'm interested by this since aside from Wikidata, I contribute to Open Food Facts, a cross language open database (OdBL and CC) of food products, and some of the items of this list end up as ingredients, additives and sometimes Wikipedia articles in various langages :-)

Teolemon (talk) 13:31, 21 November 2014 (UTC)

We're talking 60k+ items --Teolemon (talk) 13:36, 21 November 2014 (UTC)
If the licence is compatible, I think it would be a great idea to import the UNII identifiers. If possible we should also link to Open Food Facts via an identifier. Some food items are notable enough and have a Wikipedia page bzw. a Wikidata item. --Tobias1984 (talk) 16:43, 21 November 2014 (UTC)
Maybe Magnus can add it to Mix'n'Match so id's can be matched to items here. Multichill (talk) 19:28, 21 November 2014 (UTC)
Added (bottom of page). Currently trying to auto-match items. Also working on importing the other data in the set (e.g. SMILES, CAS) to match/import later. --Magnus Manske (talk) 20:18, 22 November 2014 (UTC)
Update: Have ~10K identified through name and/or CAS number. Will update Wikidata with the UNII IDs tomorrow. --Magnus Manske (talk) 23:55, 22 November 2014 (UTC)
@Magnus Manske: Hi, can you just explain which parameter from which data source did you use to match UNII ID with wikidata items ? Thanks. Snipre (talk) 15:24, 24 November 2014 (UTC)
I used RN, which is CAS (for above example). --Magnus Manske (talk) 15:29, 24 November 2014 (UTC)
Mmouais. It would be better first to check the CAS number of the items using a cross-checking between EN, DE and FR WP. Snipre (talk) 13:56, 27 November 2014 (UTC)
Open Food Facts has individual pages for additives (http://world.openfoodfacts.org/additives), food categories (http://world.openfoodfacts.org/categories) and brands (http://world.openfoodfacts.org/brands) in addition to products (which aren't the best option available since there can be dozens of variation and different versions of coke, with identical or different barcodes). If you see fit, I can open property proposals for some of those page slugs Teolemon (talk) 20:21, 28 November 2014 (UTC)

Astronomical objects in the Index Catalog

There are hundreds of astronomical objects in the Index Catalog which have no label or description in English. I did a hundred or so of these by hand to see what's involved, and it seems to me a bot could generate labels and descriptions fairly easily with this logic:

This would give a description like "galaxy in the constellation Virgo" which is essentially what I was doing by hand (except I was looking them up so I could say spiral galaxy, elliptical galaxy, etc., but I don't think that's necessary for basic disambiguation). Thoughts? - PKM (talk) 03:16, 23 November 2014 (UTC)

PS most of these items don't have articles in EN wiki, but do have articles and infoboxes in BS, UK, SR, SH, KK, RU, etc. - PKM (talk) 20:21, 24 November 2014 (UTC)
I have made a script to do this (code here), but I have no idea how to obtain a list of items to edit. Any suggestions? Popcorndude (talk) 22:51, 28 February 2015 (UTC)
All items with catalog code (P528) you can get with WDQ: http://wdq.wmflabs.org/api?q=claim%5B528%5D --Pasleim (talk) 23:03, 28 February 2015 (UTC)
Thanks! It seems the problems I was having getting a list were due including a 'P' before the property numbers. Popcorndude (talk) 02:58, 1 March 2015 (UTC)
thanks for picking this up! I'm not really a programmer, but I looked at your script and didn't see where it is inserting the phrase "in the constellation" between the P31 and P59 in the description. - PKM (talk) 07:45, 1 March 2015 (UTC)
Line 15, right in the middle:
item.editDescriptions({'en':item.claims['P31'][0].get()['labels']['en'] + " in the constellation " + item.claims['P59'][0].get()['labels']['en']})
Popcorndude (talk) 14:16, 1 March 2015 (UTC)
Doh! Thanks! - PKM (talk) 04:11, 5 March 2015 (UTC)
This section was archived on a request by: --Pasleim (talk) 20:28, 12 June 2015 (UTC)