Wikidata:Bot requests/Archive/2014/10

Importing IETF language code from French Wikipedia

Hello,

Could some bot import a few thousand data from French Wikipedia?

Recipe: For each page in w:fr:Catégorie:Inventaire de langues:

  • if it use the infobox template w:fr:Modèle:Infobox Langue
  • then if it use the parameter ietf
  • then if the parameter ietf is not empty
  • then if it does not include "," or "<"
  • then import it in the corresponding Wikidata item as IETF language tag (P305)

Visite fortuitement prolongée (talk) 20:22, 13 October 2014 (UTC)

With Catscan2 is easy extract the page that you want. If anyone can do it, I'll do it this week end --ValterVB (talk) 21:42, 13 October 2014 (UTC)
  Done. Added on 2218 item. --ValterVB (talk) 15:19, 18 October 2014 (UTC)
Thank you very much. Visite fortuitement prolongée (talk) 21:16, 19 October 2014 (UTC)
This section was archived on a request by: --Pasleim (talk) 19:51, 22 October 2014 (UTC)

Badges for Wikivoyage

Now e got the new badges, and the previous consensus was that in Wikivoyage, a star corresponds to FA, a guide to GA, and a usable to a recommended article. I would very much appreciate if someone will take

  • All the articles from this category and add a Wikidata badge of a recommended article; then do the same with all interlinked Wikivoyage categories (de, nl, pl, pt, ru, uk);
  • All the articles from this category and subsequently all interlinked categories, and add a Wikidata badge of a good article;
  • All the articles from this category and subsequently all interlinked categories, and add a Wikidata badge of a featured article.

If the article already have a correct badge, nothing should be done; if an article has an incorrect badge, it might be a good idea to create a log for manual treatment.

I will appreciate even more if such a job can be undertaken on a regular basis, similarly to how Wikipedia interwiki bots work.

Thanks a lot.--Ymblanter (talk) 07:32, 15 October 2014 (UTC)

@Ladsgroup: A job for you? --Pasleim (talk) 14:56, 15 October 2014 (UTC)
@Pasleim:Sure, consider it done Amir (talk) 18:21, 15 October 2014 (UTC)
Great, thank you.--Ymblanter (talk) 18:34, 15 October 2014 (UTC)
Done Amir (talk) 12:34, 17 October 2014 (UTC)
Thanks.--Ymblanter (talk) 14:05, 17 October 2014 (UTC)
This section was archived on a request by: --Pasleim (talk) 19:48, 22 October 2014 (UTC)

Bot Run for Opera Singers?

On this page [1] there are listed over a thousand items with constraint violations. There is missing occupation (P106) = singer (Q177220) however all or most of all of this items do have a subclass of singer (Q177220) like singer-songwriter (Q488205) or opera singer (Q2865819). Can somebody change the rules to singer (Q177220) or any subclass of singer (Q177220) ? Otherwise we need a bot to add additional singer (Q177220) to all opera singers .--Giftzwerg 88 (talk) 15:34, 24 October 2014 (UTC)

This doesn't really look like a bot request, more like a request for Ivan to improve the bot. Not sure what you're asking is possible in the current implementation. Would be a good feature if it's not already in there. Multichill (talk) 16:23, 24 October 2014 (UTC)
  Done. {{Constraint:Item}} does not allow specify class. But it allows to use item list: [2]. Please add more acceptable items if needed. — Ivan A. Krestinin (talk) 17:24, 24 October 2014 (UTC)
Great!--Giftzwerg 88 (talk) 18:22, 24 October 2014 (UTC)
This section was archived on a request by: Multichill (talk) 09:57, 25 October 2014 (UTC)

german Wiktionary: sorting of text modules, completion of similar words

I know, that this page is only for bot requests for Wikidata, but in our Wiktionary we don't have anyone who could do this. We need a bot who sorts the linked text modules in this order.
Furthermore we need a bot who adds similar (in wiktioanry existing) words (in german and other languages) to the last text module "Ähnlichkeiten" (english: similarities). If anyone could do this please respond to me and we could talk about the differences the words should have (of course with the feedback of my community). Please respond in an easy english :) Greetings Impériale (talk) 14:52, 10 October 2014 (UTC)

This section was archived on a request by: Impériale (talk) 14:17, 27 October 2014 (UTC)

Cultural monuments of Slovenia

I'd like to kindly ask a bot operator to populate the new property Slovene Cultural Heritage Register ID (P1587) with values already entered in :slwiki. The task is relatively straightforward, the monuments have values entered in infoboxes from where they should be extracted, and there are two variants:

  • refšt = x in one template or
  • designation1_number = x in another,

where x is a string with 1-5 digits not preceded by 0. Either one template or the other is used in an article. First, the following 111 objects should be done for testing, because for those I'm sure that they are well labelled (nationally significant monuments in sl:Kategorija:Kulturni spomeniki državnega pomena).

If this goes without problem, we can talk about locally significant monuments which are many more (around 500) described in :slwiki, but not all have corresponding Wikidata items yet. Thank you in advance,Yerpo Eh? 18:52, 31 October 2014 (UTC)

I withdraw my request, got tired of waiting for a response and did it semi-automatically myself. — Yerpo Eh? 09:04, 8 November 2014 (UTC)

This section was archived on a request by: --Pasleim (talk) 13:22, 11 November 2014 (UTC)

Remove repeated statements

Some items have Sandrart.net person ID (P1422) and also again the same code but written using catalog code (P528)+catalog (P972):sandrart, could you please remove the second group? With Sandrart.net person ID (P1422) is enough. Example: Aristotle (Q868).--Micru (talk) 21:01, 21 October 2014 (UTC)

  Done --Pasleim (talk) 23:21, 13 November 2014 (UTC)
This section was archived on a request by: --Pasleim (talk) 18:42, 6 December 2014 (UTC)
This section was archived on a request by: Sjoerd de Bruin (talk) 12:46, 6 January 2015 (UTC)

I have made several list, User:GZWDer/temp19  Done, User:GZWDer/temp20  Done, User:GZWDer/temp21]]  Done, User:GZWDer/temp22  Done, User:GZWDer/temp23  Done, User:GZWDer/temp24  Done, User:GZWDer/temp25  Done, User:GZWDer/temp26  Done. Each of them have links to enwiki and values of GeoNames ID (P1566). I tried to import some, but it is a huge work. Note there're some page deleted or moved without redirect.--GZWDer (talk) 06:10, 21 October 2014 (UTC)

@GZWDer: I can do it , but there is something of wrong: example Takht-e Qeysar (Q5846480) from your list I have added 12 like value, but I can't found it on en page, and I can't found the page on Geonames.org. What's the correct link on Geoname? And what's the source of the value? --ValterVB (talk) 18:20, 21 October 2014 (UTC)
@ValterVB: See [3] and [4]. If you don't trust the data you can just download and uncompass http://download.geonames.org/export/dump/alternateNames.zip , and get all lines contains "wikipedia.org" (Note the URL is encoded, and there're 450k+ such records). Note if you meet a disambigion page, you should get the move log and put the claim to the page that was moved to (e.g. [5]).--GZWDer (talk) 05:16, 22 October 2014 (UTC)
@GZWDer: My misunderstanding sorry, I thinked that source was enwiki, so no problem. For the second problem is necessary to know how format the URL, because AuthorityControl gadget isn't correct. So for value 12 what's the full url? --ValterVB (talk) 19:28, 22 October 2014 (UTC)
@ValterVB: http://sws.geonames.org/12/

P.S. Please don't put claims on disambigion pages. Instead your bot should put it to where the page moved to.--GZWDer (talk) 05:10, 23 October 2014 (UTC)

@GZWDer: Started, follow my bot if you want check :) . Rules: If P1566 exist, I skip the item, if is a disambiguation on en.wiki, I skip the item, if is a redirect I add P1566 to redirected page. Source: imported from Wikimedia project (P143)= GeoNames (Q830106) --ValterVB (talk) 17:32, 25 October 2014 (UTC)
Great to see Geonames here on Wikidata! A lot of GLAMs make use of it so this is a good way to get everything better connected.
Are you working on all Wikipedias? Multichill (talk) 19:23, 25 October 2014 (UTC)
I use the lists provided by GZWDer, so is only en.wiki. --ValterVB (talk) 19:36, 25 October 2014 (UTC)
Ok, downloaded the dump. Some numbers:n Moo
  • Total number of lines: 9529290
  • Wikipedia links: 470321
  • en.wikipedia links: 458757
  • ru.wikipedia links: 10041
  • Other Wikipedia links: 1523
So the dataset is rather biased towards English and someone invested some time in Russian. The 450.000-ish links should keep your bot busy for a while. If you manage to do 5000 edits a day (I doubt it), this will take you about 90 days (3 months). It will make it one of the most popular properties. Multichill (talk) 21:11, 25 October 2014 (UTC)
@ValterVB, Multichill: User:GZWDer/temp27 is the list for non-en wiki (not decoded). Because it is not very long, I can do it myself.--GZWDer (talk) 05:56, 26 October 2014 (UTC)
@Multichill: «5000 edits a day» The BOT is a bit faster, just now is 22 edits/min and probably more late I can double the speed. --ValterVB (talk) 09:18, 26 October 2014 (UTC)

I've got some questions about the linked items... Possibly this is a problem at Geonames, not here, but still. Item on Krylatskoe metro station in Moscow has been linked to Krylatskoye at Geonames, which is classified as "section of populated place" (PPLX) and has coordinates that are a long way from the station, appearing to be those of the center of the Krylatskoye District, but is linked to the WP article about the station. Same for Maryina Roshcha: edit, Geonames item – it even has "population : 66000", but links to en:Maryina Roshcha (Moscow Metro) instead of en:Maryina roshcha District). (I also tried searching Geonames for all items with "Station" in their names around Moscow, and many of them are several kilometers away from their actual locations – but that's another question.) It there a possibility to sort such things out? YLSS (talk) 23:10, 26 October 2014 (UTC)

And now also with Konkovo metro station: edit linked it to Geodata item on "section of populated place" which is marked in a totally different area of Moscow than en:Konkovo District actually is, while there is also another Geodata item on "railroad station (RSTN)" Kon'kovo, marked close to where the metro station actually is (though still not using the proper code "metro station (MTRO)"). YLSS (talk) 23:31, 26 October 2014 (UTC)
@YLSS, ValterVB: Now just skip part of (P361)=Moscow Metro (Q5499). For sorting such things out, there're a script to remove such wrong data.--GZWDer (talk) 06:10, 27 October 2014 (UTC)
Since I used to live in Konkovo, I know that the metro station is located in the district, and the railway station does not exist (at least not in or around Moscow).--Ymblanter (talk) 15:47, 2 November 2014 (UTC)
@GZWDer: The problem persists: [6][7]. YLSS (talk) 20:09, 6 November 2014 (UTC)

Type geographic location (Q2221906)

I set up a constraint violation report to track items that are not instance of (P31) geographic location (Q2221906) (or one of it's subclasses). This turned up a lot of items that should have instance of (P31) added. Easy way is of course to grab the data from Geonames, but this data is the wrong license (cc-by-sa instead of CC0) so I don't think we can touch that. Anyone want to run a bot to update these items? Most of the items seem to have an English Wikipedia article. So probably get the item, look for categories. If it's "Populated places in...." add human settlement (Q486972), if it's "villages in ..." add village (Q532), etc. Multichill (talk) 20:04, 29 October 2014 (UTC)

Doing it myself now. Have to work through 38.000 items so that might take a while. Multichill (talk) 13:41, 2 November 2014 (UTC)
Already processed quite a few items. Restarting with some more choices and now working on about 45.000 items. Let's see how many the bot is able to identify. Multichill (talk) 09:44, 8 November 2014 (UTC)

shwiki

@ValterVB: Dcirovic created 52615 articles about places in Italy based on Geonames data. Probably we should import the ID from sh:Šablon:Насеље у Италији. There should not be any false positives.--GZWDer (talk) 05:02, 7 November 2014 (UTC)

Now added country (P17) to all such pages.--GZWDer (talk) 13:59, 6 December 2014 (UTC)
@Pasleim: Probably you can help me to extract data from infoboxes in shwiki. Also coordinate location (P625) can be imported from infoboxes.--GZWDer (talk) 14:33, 6 December 2014 (UTC)
started my bot. --Pasleim (talk) 16:16, 6 December 2014 (UTC)
@Pasleim: coordinate location (P625) is also present in infoboxes, гшир and гдуж parameters.--GZWDer (talk) 10:19, 8 December 2014 (UTC)
Finished the import of geoname id, now started with geocoordinates. Can someone confirm that all items with this infobox are about human settlements? In this case I will also add P31 claims. --Pasleim (talk) 09:01, 11 December 2014 (UTC)

Done

All   Done --ValterVB (talk) 07:48, 8 November 2014 (UTC)

Great. Now that the automatic work is done, maybe User:Magnus Manske can at Geonames to Mix'n'Match? What do you think Magnus, dump is available. Multichill (talk) 09:48, 8 November 2014 (UTC)
Nice idea, but 9 million entries might overstretch the system. --Magnus Manske (talk) 21:46, 29 November 2014 (UTC)

Freebase identifiers

According to Wikidata:Database reports/Popular properties we currently have only about a million Freebase identifiers here (the next most common identifier is VIAF). It makes little sense to have this property half used; please import the remaining million items from [8] (CC0). --Nemo 14:31, 27 April 2014 (UTC)

Don't have sense to have freebase propriety is unuseful, don't is an ufficial source, don't is an Authorities --Rippitippi (talk) 23:40, 2 July 2014 (UTC)
It's useful for cross-referencing with other databases, as for all the other identifiers. The usefulness of the property should be discussed on its talk page, IMHO; we already have a million items using it, so it only makes sense to do it properly and finish the job. Half-baked choices don't help anything. --Nemo 11:20, 16 October 2014 (UTC)

Corporations infobox data of it.wiki

it:Template:Azienda is the infobox for corporations. I'm trying to integrate it with Wikidata a bit (currently it's only fetching the ISIN), but let's see what can be imported immediately, looking at Wikidata:List of_properties/Organization. I suggest that you start testing with the category for publishers, it:Categoria:Case editrici italiane, on which I'm keeping an eye now. Main ones:

  • data_fondazione -> P571
  • data_chiusura -> P576
    • causa_chiusura -> free text qualifier?
  • sede -> P159
  • industria -> P452

Others:

  • logo -> P154
  • foto -> P18?
  • tipo -> P31?
  • borse -> P414
  • luogo_fondazione -> P740
  • fondatori -> P112 (one statement per link?)
  • gruppo -> P749
  • filiali -> P355
  • prodotti -> P1056
  • fatturato, margine d'intermediazione, risultato operativo, utile netto -> ?
    • anno_fatturato, anno_margine d'intermediazione, anno_risultato operativo, anno_utile netto -> qualifier date
  • dipendenti -> ? (integer)
    • anno_dipendenti -> qualifier date
  • slogan -> P1451??
  • sito -> P856

--Federico Leva (BEIC) (talk) 12:04, 16 October 2014 (UTC)

Import Persondata from English Wikipedia

This is a huge collection of highly valuable human-curated metadata about 1.2 million articles on English Wikipedia (which is in danger of being deleted). Please see the discussion at Project Chat for more details. Kaldari (talk) 18:17, 23 October 2014 (UTC)

Moin Moin, I think it will be make sense to import the data for german Wikipedia, too. --Crazy1880 (talk) 05:30, 24 October 2014 (UTC)
One thing that there is in Personadata is the name in the order it should be sorted in. (Which can be non-trivial). This is something we really ought to have in Wikidata, to be able to easily sort the output of WDQ queries. It would also be useful to have, eg for c:Commons:Structured data, to be able to sort search hits or a category by artist. Can we please import this as a string. This may need the creation of a new property to store it in. Jheald (talk) 09:23, 24 October 2014 (UTC)
Proposal posted; see Wikidata:Property_proposal/Generic#Sort_key. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:47, 24 October 2014 (UTC)
Support: This is useful data, but clearly belongs in Wikidata, not hidden from view (and sometimes duplicating what we already have), in Wikipedia. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:40, 24 October 2014 (UTC)

Not completely feasible: If entered properly, the birth and death dates in Persondata follow WP:Manual of Style; the details are contained in WP:Manual of Style/Dates and numbers (MOSNUM) which calls for using the Julian calendar on and before 4 October 1582. After that, the Gregorian calendar is used, if it was in force in the location(s) discussed in the article. If the Julian calendar was in force in the relevant location(s), it is used. MOSNUM is not explicit about what to do if the relevant location used some other calendar, and it is in the period where the Gregorian calendar had not fully supplanted the Julian calendar. Persondata does not have any flag to indicate if the Gregorian or Julian calendar was used. This makes it essentially impossible for a bot to import dates before 1924, the first full year Greece adopted the Gregorian calendar for secular use. I will repeat this comment in the bot request. Jc3s5h (talk) 14:58, 24 October 2014 (UTC)

Please can you give specific examples of en.WP and de.WP articles where this is an issue, so that we can examine them in order to work on a solution? One answer may be to write such cases (based on date ranges and locations of birth/death) to a list for manual checking. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:34, 24 October 2014 (UTC)
Consider en:Johann Rudolph Ahle; the English Wikipedia asserts his birth and death dates are 24 December 1625 – 9 July 1673. The article does not state, at least in any obvious place, whether these dates are Julian or Gregorian dates. According to Explanatory Supplement to the Astronomical Ephemeris and the American Ephemeris and Nautical Almanac (1961, pp. 414-415), Catholic German states adopted the in Gregorian Calendar 1583 or 1584, but Protestant German states adopted it in 1700. It would require research to find which of these apply to Ahle. Indeed, the answer for the birth date might be different from the death date. I don't read German, so won't attempt to provide an example from the German Wikipedia. Jc3s5h (talk) 17:21, 24 October 2014 (UTC)
Thank you. Incidentally, I note that Johann Rudolph Ahle (Q69440) already has his DoB and DoD, both marked as Gregorian. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 17:41, 24 October 2014 (UTC)
First, does anyone know the name of whatever it is that displays information when you click on a link to a Wikidata item, or enter the number of a Wikidata item in the search box? Next, when you do that, you see the birth and death dates, with the word "Gregorian" as a superscript next to the dates. But it is misleading to say "both marked as Gregorian." The interface I just described always displays Gregorian dates. What the superscript means is that when the data is presented to the user, it should be displayed as-is. It could have been set to "Julian", which would mean the date should be converted to the Julian before presenting to the reader. Jc3s5h (talk) 18:07, 24 October 2014 (UTC)
No; the dates on, for example, Tim Berners-Lee (Q80) are not marked "Gregorian" (and nether, of course, are they marked "Julian"). Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 18:13, 24 October 2014 (UTC)
For whatever reason, the calender in which the date is to be displayed to readers is not stated for Tim Berners-Lee. Nevertheless, the date displayed, 8 June 1955, is almost certainly a Gregorian calendar date, since that is the calendar in force in 1955 in the United Kingdom. Jc3s5h (talk) 18:23, 24 October 2014 (UTC)
So, now that we have established that "the interface... always displays Gregorian dates" is not true (in the logic sense; no dishonesty implied), I can reiterate my statement that "Johann Rudolph Ahle (Q69440) already has DoB and DoD, both marked as Gregorian". We still need examples of en.WP (or de.WP) articles, where the import of ambiguous persondata dates to Wikidata would cause an issue. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 20:42, 24 October 2014 (UTC)

────────────────────────────────────────────────────────────────────────────────────────────────────I disagree. Lets take this step by step. The interface sometimes shows the name of a calendar as a superscript, and sometimes it doesn't. The superscript is advice to the person or software that has extracted the item from Wikidata that the date ought to be displayed to the reader in the stated calendar. But the interface does not follow its own advice. Whatever has been entered as the date into the database is displayed as-is.

According to date of birth, it contains Data type "time". The associated talk page contains a link to Dates and times which says "The calendar model used for saving the data is always the proleptic Gregorian calendar according to ISO 8601...." Since the interface does not make any conversions when it displays the date, the date displayed is always a Gregorian calendar date (or else the date is wrong). For an example of a birth date that is stored in the database as a Gregorian date but is suggested to be displayed as a Julian date, see Q935. Jc3s5h (talk) 21:27, 24 October 2014 (UTC)

It is patently true, and visible to all, that next to each of the two dates in question is a label which says "Gregorian". Whether those dates and/or labels are correct or not, and whether the method of storing dates in MediaWiki is optimum, are separate issue; the dates are thus labelled. And none of this is relevant to the question at hand; about importing data from persondata in Wikipedias. What is pertinent is that we still need examples as described in my earlier posts. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 22:12, 24 October 2014 (UTC)
The subscripted words are not labels. You have two English examples. Perhaps a German-speaker will provide some examples from the German Wikipedia. Jc3s5h (talk) 22:17, 24 October 2014 (UTC)
An example of an incorrect Wikidata entry is Johann Sebastian Bach (Q1339), who was born 21 March 1685 (Julian calendar). This may be verified by reading this article from the Guardian. The correct Julian calendar date is contained in the Persondata of the English Wikipedia article. The German Wikipedia article has the German analog of the Persondata template, and contains this: "GEBURTSDATUM=21. März 1685". I don't know what the rules are for filling in values in the German Personendaten template, but the copy from German Wikipedia to Wikidata is wrong; the German and Russian Wikipedias are given as references for the Wikidata value. Jc3s5h (talk) 22:41, 24 October 2014 (UTC)
I have corrected the birth date data for Johann Sebastian Bach (Q1339). Jc3s5h (talk) 18:20, 25 October 2014 (UTC)
My guess is that it is not "in danger of being deleted", but rather "in danger of being moved somewhere else", such as to wikidata. The way wikipedia works is it is almost impossible to delete anything (that is pretty inherent to the Internet too), and without looking at the discussion of deleting it I would guess that the proposal was made "because it could better be centrally located on wikidata". 76.24.193.7 18:08, 26 October 2014 (UTC)
@Jc3s5h: (about the UI): dates are always stored in Gregorian, but when calendarvalue is set to Julian, it it supposed to (and used to) be displayed in Julian. There is a bad bug at bugzilla:70398.--Zolo (talk) 15:01, 28 October 2014 (UTC)
@Zolo: The bug 70398 has been marked as a duplicate of bugzilla:70395. Jc3s5h (talk) 17:07, 20 November 2014 (UTC)

Clarity of scope

Discussion of such things as the DoB of Johann Sebastian Bach (Q1339), which was not - and would never be - imported from Persondata, is irrelevant to the issue at hand.

We need to decide whether or not to import Persondata; and if so which data items to import (or discard). If we decide not to import any, or when we are done, we need to let the en.WP and de.WP communities know, so that thy may proceed accordingly. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:37, 27 October 2014 (UTC)

Agreed, I'll split this up into separate requests that can be discussed separately... Kaldari (talk) 23:50, 28 October 2014 (UTC)
Good move. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 19:56, 29 October 2014 (UTC)

Categories of data

Import NAME from Persondata

This is supposed to be the sortable name, i.e. "surname, firstname", but isn't 100% reliable. We probably only want to import this if the DEFAULSORT value isn't set (since it's more reliable). Currently there is no property to import this into, but see Wikidata:Property proposal/Generic#Sort_key. See en:Wikipedia:Persondata#Name and titles for more info. Kaldari (talk) 23:50, 28 October 2014 (UTC)

  • Most I see are "firstname surname", not in sortable order. Maybe just write as an alias, if not already present as one, or as the label?

Import ALTERNATIVE NAMES from Persondata

This is a comma separated list of aliases. Should be pretty straightforward to import as Wikidata aliases, just be sure to check for existing duplicates. See en:Wikipedia:Persondata#Alternative names for more info. Kaldari (talk) 23:50, 28 October 2014 (UTC)

  Support. Check they don't match the existing en-label, too. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 19:50, 29 October 2014 (UTC)

Import SHORT DESCRIPTION from Persondata

This is a short description of the person, i.e. 'German physicist'. Should be pretty straightforward to import as a Wikidata description, although we may want to exclude any longer than 12 words or so. See en:Wikipedia:Persondata#Short description for more info. Kaldari (talk) 23:50, 28 October 2014 (UTC)

  Support, where no description already exists. Otherwise, discard. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 19:51, 29 October 2014 (UTC)

  Support, as the only Persondata field that doesn't seem to have any major complaints about accuracy.—Msmarmalade (talk) 12:09, 20 November 2014 (UTC)

  Support, although I am really here to mention (in case anyone missed it) that the Short Description missing category is now empty ie all Persondata has a short description. Periglio (talk) 00:43, 15 December 2014 (UTC)

  Done --Pasleim (talk) 15:58, 11 February 2015 (UTC)

Import DATE OF BIRTH and DATE OF DEATH from Persondata

These follow WP:Manual of Style/Dates and numbers (MOSNUM) and may include link syntax (which should be stripped out). The dates are assumed to be Julian calendar dates if they are on or before 4 October 1582. For dates between 1582 and 1923, there is no way to reliably tell which calendar applies, although there are lots of ways to guess. See en:Wikipedia:Persondata#Dates of birth and death for more info. Kaldari (talk) 23:50, 28 October 2014 (UTC)

A lot of data was already imported by Reinheitsgebot, like these.--GZWDer (talk) 05:04, 8 January 2015 (UTC)

Import PLACE OF BIRTH and PLACE OF DEATH from Persondata

Usually formatted as 'City/Village, State/Province, Country'; or 'City/Village, Country'; or 'State/Province, Country'. May include link syntax. See en:Wikipedia:Persondata#Places of birth and death for more info. Kaldari (talk) 23:50, 28 October 2014 (UTC)

Even with link markup there might be problems. Take en:Keshorn Walcott as an example. The place of birth value in persondata is [[Saint Catherine Parish]], Trinidad and Tobago. However, en:Saint Catherine Parish is located in Jamaica and not Trinidad and Tobago. How to proceed? --Pasleim (talk) 12:08, 28 February 2015 (UTC)

Conversation continued

My personal analysis seems to be showing that there is only a need for the short description parameter to be copied to the Wikidata description. Combined with the fact that this parameter was successfully completed last month, what do I need to do to make this happen? I can easily provide a complete Persondata extract matching "Q-Code" and "Description" in any format if someone is able to do the updating Wikidata side. Periglio (talk) 02:16, 18 January 2015 (UTC)

If anyone needs convincing a description is needed, try searching for "Charles Robinson" to find a potential match for w:Charles Robinson (cricketer) one of my latest no link to Wikidata flags Periglio (talk) 23:43, 18 January 2015 (UTC)
started my bot, see [9] --Pasleim (talk) 19:40, 19 January 2015 (UTC)
I have suggested some tracking categories to enable easier transfer of information by bots. George Edward CTalkContributions 17:17, 23 January 2015 (UTC)
I have performed a run of 1000 random Persondata records. 25 of these had dates not available on Wikidata. Taking a liberty with my small sample size, this indicates about 30,000 dates that could be copied across from wikidata. (only where no Wikidata dates exist). Having said that, if someone is extracting data, the Birth and Date templates would be more reliable than the Persondata template. Periglio (talk) 17:12, 25 January 2015 (UTC)
Extracting dates from templates is still a major problem as in Wikipedia normally no calender is indicated. In Wikidata, however, we need to assign a calender (Julian/Gregorian) to all dates --Pasleim (talk) 19:59, 30 January 2015 (UTC)

Final conclusion

My personal view is that Wikidata has extracted all it can from the Persondata template and I am hoping to convince everyone that this bot request may be closed. The EN-description has pretty much been copied over now, and frankly, this is the only useful data that could be extracted.

The discrepancies I have been finding are largely down to dodgy dates in Persondata that were copied to Wikidata. Mainly different years and dates, but sometimes as dates of death on living people. As it is hidden, Persondata was not a reliable source. My experience with Persondata shows that there is a lot of hidden vandalism lurking. The good news is that the correct data is available in other templates - birth/death templates, DEFAULTSORT and Infobox (place of birth/death). This are subject to public scrutiny and hopefully a much better source for any Wikidata bots.

My conclusion is to say to Wikipedia, "Thank you for Persondata, we have extracted all we need and no longer require access". Periglio (talk) 20:00, 13 March 2015 (UTC)

@Pasleim, Kaldari, GZWDer, Periglio, Msmarmalade, Jheald: We should now implement this. But how? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:28, 18 April 2015 (UTC)

@Pigsonthewing: (I hope I've understood your question) I've got a rough plan (here), which you're welcome to take a look at and see if it's of any use. I don't know how to make TfD/RfCs (or bots), but I think we should confirm the deprecation (from this TfD). Then we can move on to another RfC for methodical deletion. —Msmarmalade (talk) 09:39, 21 April 2015 (UTC)
Yes, that was what I meant. What, if anything, is currently blocking progress? How can I help? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:10, 21 April 2015 (UTC)
@Pigsonthewing: We only really need community consensus to move on to the next step. I've made a draft RfC in the same style as the last RfC. I'd appreciate any advice.—Msmarmalade (talk) 11:41, 22 April 2015 (UTC)
@Msmarmalade: Thank you. I have made some (minor) edits there. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:49, 22 April 2015 (UTC)
@Pigsonthewing:I did not quite understand your question either. As far as Wikidata goes, and as this is a Wikidata talk page, all that is required is to remove the bot request. Further discussion on Persondata should be at Wikipedia. Periglio (talk) 11:57, 21 April 2015 (UTC)

I'm going to archive this discussion as the bot request is completed. Please hold further discussions about Persondata on enwiki --Pasleim (talk) 11:29, 25 April 2015 (UTC)

This section was archived on a request by: --Pasleim (talk) 11:29, 25 April 2015 (UTC)

Please remove erroneous/unreliable country of citizenship (P27) statements

Can somebody remove all changes by User:GerardM from July 9th to 10th, 2014, which add a statement country of citizenship (P27) Poland (Q36). They are unsourced, many of them are obviously erroneous and the method of deducing this statements given by the author is (1) not correct and (2) doesn't correspond with all changes he made. Apparently, the author is not going to help with correcting this errors, therefore I suggest to remove all statements of the "infected" group.--Shlomo (talk) 10:17, 23 October 2014 (UTC)

The method is as advertised and yes, there are errors. Errors that exist in the source to the same extend. I am happy to collaborate with people who are civil towards me. I am not willing nor able to check every individual edit. GerardM (talk) 10:32, 23 October 2014 (UTC)
I am not willing nor able to check every individual edit - then please do not edit at all.--Mad melone (talk) 11:38, 23 October 2014 (UTC)
@GerardM:
  1. Even if the method were as advertised, the method is bad. Place of birth in today's Polish republic doesn't indicate polish nationality - not even in these days, the less centuries ago.
  2. The method apparently is not as advertised, at least doesn't explain edits like Special:Diff/143458975, Special:Diff/143489927, Special:Diff/143489685 a.s.o.
  3. I asked for your help in a quite civilized manner twice, and the only respond I got was something like Do It Yourself If You Dare. That's why I'm asking for help here.
  4. Other editors are not willing nor able to check every your individual edit either. What do you suggest to do?
  5. This thread is not about whether to remove your edits from the described set, it's about how to do it.
Regards,--Shlomo (talk) 13:08, 23 October 2014 (UTC)