Wikidata:Bot requests/Archive/2016/03

Latest comment: 8 years ago by Innocent bystander in topic Talk pages of constraint reports

Linking mexican municipalities

Hello! Would it be possible to link all the articles in this category to their corresponding item in Wikidata? All of the articles has a interwiki link to english Wikipedia, but maybe some links are not consistent, as the naming system doesn't rename the same in all municipalities. -Theklan (talk) 14:57, 26 March 2016 (UTC)

  Done (at ~99%). --XXN, 15:18, 29 March 2016 (UTC)
This section was archived on a request by: XXN, 15:18, 29 March 2016 (UTC)

Requesting bot flag on wikidata

Hi, My name is Ziyad and I'm running User:ZiyadBot which is pywikibot working already on AR-Wikipedia. I'm going to use it on Wikidata mainly to add coordination from Arabic wikipedia. here is an example --Ziad (talk) 15:40, 30 March 2016 (UTC)

Go to Wikidata:Requests for permissions/Bot. Matěj Suchánek (talk) 15:49, 30 March 2016 (UTC)
This section was archived on a request by: --Pasleim (talk) 17:51, 5 April 2016 (UTC)

Import missing coordinates from dewiki

Hi, apparently there are lots of pages in de.wikipedia that have coordinates, but the corresponding Wikidata items don't. For example for the administrative district of de:Category:Regierungsbezirk Stuttgart, user Edgars2007 has generated a list of ~530 such articles/items. See also Wikidata_talk:Bot_requests#Question_to_operators_of_Coordinate_bots. Can someone on a regular basis import missing coordinates to Wikidata, covering new articles? The use of having complete coordinates in Wikidata that I see is that it makes Magnus' wikishootme more useful - certainly there are other benefits too. Thanks --Dealerofsalvation (talk) 19:57, 27 March 2016 (UTC)

I'll have a look; anyone who has a working import bot please go ahead, I might take a while! FWIW, some entries on that list are biographies, without coordinates... --Magnus Manske (talk) 17:28, 28 March 2016 (UTC)
Magnus Manske, of course, I understand, that you didn't look very carefully (I wouldn't either), but this guy, for example, has coordinate link in lead, so theoretically article has coords :) If somebody is taking this, maybe the user could also take this. I could do that myself, but not sure Multichill's coordinate import.py is adding the right precision (see test edits for Aizpurve (Q16350572), Q16350730 and Aizpurve (Q16350572) and lvwiki), but maybe that is Wikipedia's problem? --Edgars2007 (talk) 17:47, 28 March 2016 (UTC)
Update: Importing ~22K coordinates from dewiki now. --Magnus Manske (talk) 19:59, 28 March 2016 (UTC)
@Magnus Manske: Oh, skip any items that already have coordinate location (P625) as a qualifier or you'll have a lot of angry people on your talk page. If you're using pywikibot that the bot is already doing that. Multichill (talk) 20:37, 28 March 2016 (UTC)
And isn't the precision too high? See decimal coordinates here. --Edgars2007 (talk) 20:47, 28 March 2016 (UTC)
@Multichill: Only using items that do not have coordinate location (P625) as statement or qualifier. @Edgars2007: Using the precision I am given. Why would "too high" be a problem? --Magnus Manske (talk) 13:53, 29 March 2016 (UTC)
This section was archived on a request by: --Edgars2007 (talk) 07:55, 7 April 2016 (UTC)

10,000 csv file of Latin / Welsh (cy) species of birds.

This csv file has just been released, after 20 years of work on it, by Uni of Wales / Llen Natur. Is there a tool to add the Welsh names to WD (similar to QuickStatements perhaps)? Llywelyn2000 (talk) 23:09, 1 March 2016 (UTC)

Second question on this. I would like to use the above database to create articles on cywiki. We have no auto taxobox templates, yet, on cywiki. Should I start creating them (for above birds articles) with AWB or is there a way to bring them in automatically from WD? In theory, if the hierarchy is on WD, then all I need to include in the infobox is the name of the species an WD would populate the rest automatically? Llywelyn2000 (talk) 07:24, 2 March 2016 (UTC)
Regarding the first question, how would the names be matched to existing items? Would matching the Latin name to taxon name (P225) work? Regarding the second question, some wikis do have modules/templates which can fill in entire infoboxes using Wikidata data, but they have to be set up locally first, it doesn't just happen automatically. :) I'm not sure if anyone has anything for taxon items yet though (and I don't have time to look for examples right now). - Nikki (talk) 08:14, 2 March 2016 (UTC)
Yes, could you give us a link to that csv file? We then could take a look and give a better answer. About taxoboxs, see Module:Taxobox (Q18091359). --Edgars2007 (talk) 08:20, 2 March 2016 (UTC)
Great! Many thanks! Here's a link to the db Here's the link to version 2 of the db. You're welcome to get them on WD too if you like! C:Latin, D=English, E=Welsh singular, F=Welsh plural. Llywelyn2000 (talk) 14:09, 2 March 2016 (UTC)
Around 7,500 species of around 9,400 included in this dataset have a matching taxon name (P225). --Succu (talk) 15:28, 2 March 2016 (UTC)
Thanks Succo! So 7,500 of the Latin names have matched up to taxon name (P225). Can we fill the gap of the missing 2,500, somehow? Llywelyn2000 (talk) 16:21, 2 March 2016 (UTC)
The list contains some families and subspecies too. It's likly we have the missing species names under a species synonym. I have a list of them of course. Is there a website or publication which contains the information included in the list. If we want to add the information we have to source it. --Succu (talk) 16:49, 2 March 2016 (UTC)
Yes. The website is Llen Natur, who I've been in contact with during the last 3 years. This db has been included in their Dictionary of Species. The whole dictionary has around 17,000 species with Welsh names. We have another project, to bring in the prefered image from Wikidata, with User:Magnus Manske. Type in 'buwch' in the search box and you will see the pilot, which was successful - 50 images of ladybirds. The whole db can bring in images from Commons, thus creating quite a large illustrated dictionary. Llywelyn2000 (talk) 11:10, 3 March 2016 (UTC)
@Succu, Edgars2007:As you suggest the missing ones ar Order and Family - both of which we can create articles on cywiki. Can we also improve the db (ie relevant info such as size of adult, geographical location etc) to create better articlesand can we please add the Welsh names to WD? Thanks! Llywelyn2000 (talk) 16:49, 4 March 2016 (UTC)
Llywelyn2000: Could you please create a new data object for this website and add some properties. I will use this item for references. The names will go to taxon common name (P1843). --Succu (talk) 17:18, 4 March 2016 (UTC)
All done: Gwefan Llên Natur (Q23002367). Thanks. Llywelyn2000 (talk) 17:33, 4 March 2016 (UTC)
Thx Llywelyn2000, but an english label would be helpfull. The name for penguin (Q9147) and other families is capitalized, e.g. PENGWINIAID. Should I add Pengwiniaid or pengwiniaid as the property value? --Succu (talk) 09:13, 6 March 2016 (UTC)
I see that the Latin family name diverts to the common name on en! I'd keep two articles - family and species, as most other languages seem to do. Spheniscinae (Q3966628) has an upper case, so let's stick to that, and I do think we should have family (teulu) in brackets to differentiate if possible. So, either Pengwiniaid (teulu) or Spheniscinae (teulu). I'd go for the latter / Latin myself (and take the consequences...!) Llywelyn2000 (talk) 09:53, 6 March 2016 (UTC)

In your dataset I found the following errors at family group level (column latin name):

  • MEGAPODIAE => MEGAPODIIDAE
  • PARULIADAE => PARULIAIDAE
  • PARADISAEDAE => PARADISAEIDAE
  • A. => EMBERIZINAE
  • B. => CATAMBLYRHYNCHINAE
  • C. => CARDINALINAE
  • D. => THRAUPINAE
  • E. => TERSININAE

Please check the spelling of the following names: Stercoraridae, Pteroclididae, Loridae, Xenicidae, Eopsaltridae, Nectarinidae, Emberizinae, Catamblyrhynchinae, Cardinalinae, Tersininae and Callaeidae. Looks like sometimes the ending is wrong: -idae/-iidae (e.g. Loridae). --Succu (talk) 10:22, 7 March 2016 (UTC)

Thanks. I've passed on the information to the editors of the dictionary who will get back to us or leave a message directly here, shortly. Llywelyn2000 (talk) 13:54, 7 March 2016 (UTC)
David (ed) has come back to me and suggests the following:
Stercoraridae > Stercorariidae
Eopsaltridae (= Eopsaltriidae) > Petroicidae
Nectarinidae > Nectariniidae
Catamblyrhynchinae > Thraupidae
Cardinalinae > Thraupidae
Tersininae > Thraupidae
Many thanks. Llywelyn2000 (talk) 12:50, 8 March 2016 (UTC)
I've amended the db as per David's (editor) request. new version of the db. Llywelyn2000 (talk) 17:02, 8 March 2016 (UTC)
Then I don't have to ask. ;) Thanks. --Succu (talk) 17:31, 8 March 2016 (UTC)
Is there a tool to add the Welsh names to WD? Can I do it? Llywelyn2000 (talk) 12:45, 9 March 2016 (UTC)
Be patient. I'm currently reworking parts of my bot. Then I'll add the names. --Succu (talk) 16:51, 9 March 2016 (UTC)
Ah! Thanks! I thought you expected me to do it! Llywelyn2000 (talk) 17:09, 9 March 2016 (UTC)

Just realised that you could also upload/match columns F and G. F is the plural and G = grammatical gender (Q162378). Is this possible? Llywelyn2000 (talk) 15:43, 10 March 2016 (UTC)

Your dataset contains 9659 scientific names (178 (sub)families, 9399 species and 82 subspecies). My bot matched them against taxon name (P225) and found 7567 species (remaining 1832 species) and 17 subspecies (remaining 65 subspecies). The bot added a cy label only when no label was present (7311 labels for species and 17 for subspecies). Because of the uppercase problem I did nothing for the (sub)families. --Succu (talk) 12:56, 15 March 2016 (UTC)
Oh that's absolutely brilliant! Next step is to get the Wikidata taxobox Module:Taxobox (Q18091359) working on cy-wiki. Many thanks!!! Llywelyn2000 (talk) 15:46, 15 March 2016 (UTC)
@Succu: I'm trying to create a list of birds with Welsh names on cy-wiki, and I'm using Gwefan Llên Natur (Q23002367) to do so. But it doesn't seem to work. Did you add Gwefan Llên Natur (Q23002367)? Any help would be appreciated. Llywelyn2000 (talk) 17:47, 15 March 2016 (UTC)
I didn't add taxon common name (P1843) today. So I wasn't using Gwefan Llên Natur (Q23002367). I'll have to write some lines of code to make sure the monolingual value isn't present. Maybe I can do this tomorrow. --Succu (talk) 18:29, 15 March 2016 (UTC)
'monolingual value'? sorry, to what are you referring? Llywelyn2000 (talk) 10:22, 16 March 2016 (UTC)
See Datatypes. --Succu (talk) 12:16, 16 March 2016 (UTC)
I understand the word 'monolingual' but not what you say, as all those added will be in both Latin and Welsh (as well as a number of other languages) therefore bilingual? By the way, I have uploaded all species with Welsh names (including ferns, fertibtates, fungus...) here from same dictionary (Llen Natur). No familes, subfamilies etc this time. Shall I make a new application or do you want to take it under this discussion? Llywelyn2000 (talk) 12:38, 16 March 2016 (UTC)
We can do it here, but your list contains all sorts of names. What means "is-rh." in "Asparagus officinalis is-rh. prostratus"? That's not part of a scientific name. Your "latin name" is the scientific or taxon name. You'll find them in P225. Lokal names as that in your list go to taxon common name (P1843) with the language information. --Succu (talk) 13:03, 16 March 2016 (UTC) BTW: The bot is nearly done. --Succu (talk) 13:03, 16 March 2016 (UTC)
@Succu: Great! "is-rh" = sub-species. Llywelyn2000 (talk) 13:47, 16 March 2016 (UTC)
Do you have an updated list? I found some more issues (e.g. missing spaces). Maybe you should double check the list. Regards --Succu (talk) 22:41, 19 March 2016 (UTC)
No. that's the one sent to me by Bangor University, and that's on the website now. Not sure why there are 'missing spaces', or why that is a problem. Unless, maybe the upload to Google drive can cause that problem? OR: circumflex, diaresis etc in Welsh need UTF-8. Llywelyn2000 (talk) 15:54, 22 March 2016 (UTC)
Llywelyn2000: I'll do this in the next days. But be aware that some possible matches are omitted. --Succu (talk) 19:39, 30 March 2016 (UTC)

@Succu: Hi. Did you manage to do these? Llywelyn2000 (talk) 19:10, 10 April 2016 (UTC)

Yes, it's done. --Succu (talk) 19:34, 10 April 2016 (UTC)
This section was archived on a request by: Jura 08:14, 8 April 2016 (UTC)

References for ISO-639-3 and Glottolog language codes

It would be nice if a bot could add references for the ISO 639-2 code (P219), ISO 639-3 code (P220), ISO 639-5 code (P1798) and Glottolog code (P1394) in the language elements (eg. English (Q1860), individual language or Germanic languages (Q21200), language family)

  • For ISO 639-2/3/5, the reference is: http://www-01.sil.org/iso639-3/documentation.asp?id=<ISO 369-X code>
  • For glottolog code, the reference is: http://glottolog.org/resource/languoid/id/<glottolog code>

Regards, Şÿℵדαχ₮ɘɼɾ๏ʁ 17:19, 7 March 2016 (UTC)

This seems unnecessary to me, see Help:Sources/Items_not_needing_sources#When_the_item_has_a_statement_that_refers_to_an_external_source. We can define URLs for identifiers as part of the property itself. - Nikki (talk) 18:42, 7 March 2016 (UTC)
This section was archived on a request by: Jura 08:14, 8 April 2016 (UTC)

desc: "sv:Italiens kommuner" -> "sv:kommun i Italien"

A grammaticaly akward description in Swedish (sv) has found its way into many items about Italian Municipalities. -- Innocent bystander (talk) 06:46, 28 March 2016 (UTC)

Like this? Edoderoo (talk) 20:17, 29 March 2016 (UTC)
  • @Edoderoo (if you plan to work on this), I just realised that Russian description in Montebello sul Sangro (Q3562) (item from your diff) is wrong: "Коммуны Италии" – is plural, but singular needed; correct description is "коммуна Италии" (like is in Alghero (Q166282) and others), and it's necessary to correct this. Moreover, maybe you want to add at the same time Romanian description for these items: "comună din Italia"? :) --XXN, 21:02, 29 March 2016 (UTC)
See this diff ... when I get confirmation that the sv/Swedish text is fine, I can run it tomorrow. Edoderoo (talk) 21:36, 29 March 2016 (UTC)
@Edoderoo: Run like Forrest Gump (Q134773)! Swedish looks fine in your example! --- Innocent bystander (talk) 16:00, 31 March 2016 (UTC)
Done ... Let me know if there are more requests for setting a description based on a wd-query. Those scripts are relatively easy to make, but can save a lot of painful manual work. Edoderoo (talk) 06:50, 1 April 2016 (UTC)
This section was archived on a request by: Jura 08:14, 8 April 2016 (UTC)

Values with units

While we don't have this functionality at Quick statements, I need some bot help to put few hundrets of length (P2043) values. If somedody provides needed format, I can generate the list in suitable format (that isn't a problem for me). --Edgars2007 (talk) 17:30, 15 March 2016 (UTC)

@Edgars2007: I can help with this. CSV, JSON or XML with entity, value, unit would be fine for me. -- T.seppelt (talk) 18:44, 7 April 2016 (UTC)
@T.seppelt: TSV also will be fine? :) Here they are (not so much, but for first time will be fine). So they all are metre (Q11573) (third column). --Edgars2007 (talk) 07:01, 8 April 2016 (UTC)
@Edgars2007:   Done -- T.seppelt (talk) 07:56, 8 April 2016 (UTC)

@T.seppelt: Could you add also these? You can add imported from Wikimedia project (P143)=Latvian Wikipedia (Q728945) to all of them. --Edgars2007 (talk) 16:29, 8 April 2016 (UTC)

Yes, sure. I'll do it probably tomorrow. – T.seppelt (talk) 17:31, 8 April 2016 (UTC)
@Edgars2007: everything's   Done. – T.seppelt (talk) 13:07, 10 April 2016 (UTC)
This section was archived on a request by: Edgars2007 (talk) 17:44, 10 April 2016 (UTC)

So apparently a bot has created ~half a million pages on https://ceb.wikipedia.org since November. At least some of them have language links. Could someone please run the "usual bot" to add them to the Wikidata item? --Magnus Manske (talk) 18:19, 15 March 2016 (UTC)

@Ladsgroup: Maybe something for your bot? --Pasleim (talk) 17:01, 25 March 2016 (UTC)
Definitely, I'm running my bot now. It's cleaning this wiki ATMAmir (talk) 17:57, 26 March 2016 (UTC)
This section was archived on a request by: Innocent bystander (talk) 17:30, 26 July 2016 (UTC)

Integrate languages

There are probably tens of thousands of minority language wikipedia articles that are not integrated into the wider interwiki linkage of more prominent languages. This is because several non-tech savvy editors find it too technical to figure out how to integrate the languages. I have even had such difficulties myself a while ago. This is a problem especially prominent among IP page creators. Therefore I propose some measures be taken to help novice editors to integrate the languages. My suggestion is to create a bot that automatically converts the old form (i.e. "en.articletitle") into the newer format. Example problem page.  – The preceding unsigned comment was added by 92.6.184.213 (talk • contribs) at 16:21, 16 March 2016‎ (UTC).

@Ladsgroup: Maybe something for your bot? --Pasleim (talk) 17:01, 25 March 2016 (UTC)
Yes, but can you give me list of language code for those wikis? Amir (talk) 18:02, 26 March 2016 (UTC)
I have created items from all unlinked articles created before ~June 2016 in all open Wikipedias except nlwiki, svwiki, cebwiki, warwiki and wikis whose language code containing hyphen. Pages still not connected are candidates to link to other items and should be linked manually.--GZWDer (talk) 17:21, 26 July 2016 (UTC)
This section was archived on a request by: GZWDer (talk) 17:21, 26 July 2016 (UTC)

Talk pages of constraint reports

It would be useful to have the talk page of each constraint report redirected to the talk page of the respective property.

For example, I have just redirected Wikidata talk:Database reports/Constraint violations/P2611 to Property talk:P2611. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 14:17, 25 March 2016 (UTC)

@ValterVB: I this something your bot could do? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:30, 15 April 2016 (UTC)

@Pigsonthewing: You mean the my bot must create the discussion page with redirect in all these pages? Probably is possible, I must check, but before is necessary to fix these talk @Pigsonthewing: --ValterVB (talk) 12:12, 15 April 2016 (UTC)
@Jura1, Pigsonthewing: Yes I can do it. I can start? --ValterVB (talk) 12:33, 15 April 2016 (UTC)
Is it necessary? There are also another ways how to prevent creating new discussions there. Matěj Suchánek (talk) 13:25, 15 April 2016 (UTC)
Yes, it is necessary. It is not just about preventing discussion on under-watched pages, but also directing people to the best place for those discussions. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:27, 15 April 2016 (UTC)
[ec] Thank you. I have cleared all the existing pages (only ten, which shows how little call there is for them). So far as I am concerned, this should proceed ASAP, but it may be best to get third parties opinions. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:26, 15 April 2016 (UTC)

──────────────────────────────────────────────────────────────────────────────────────────────────── Ten pages, on most of which old comments or questions went unanswered. We can do better. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:38, 15 April 2016 (UTC)

If there is one thing which I really abhor about Wikipedia projects, it is users who have nothing useful to do, and then go around enforcing their views of what they find to be pretty, completely disregarding how much this disrupts the project. - Brya (talk) 17:52, 15 April 2016 (UTC)
You acted first and moved the subpages. Afterwards you added a „hint” to justify your action. No you didn't talk to the people using this page. --Succu (talk) 18:06, 15 April 2016 (UTC)
The "work" with homonyms is quite questionable: "violations of Wikipedia policy (at least 50% fictitious taxa)"... --Averater (talk) 05:32, 18 April 2016 (UTC)
The natural place to discuss constraint reports is on the Talk pages of the constraint reports, while the natural place to discuss properties is on the Talk pages of the properties. These are quite different topics. - Brya (talk) 05:50, 18 April 2016 (UTC)
Allow me to remind you where the constraints, which determine what is in the constraint reports, are set: on the property talk pages. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:22, 19 April 2016 (UTC)
Which probably is a bad place for these constraints in the first place. Here, I expect to see discussions, but most of these edits are changes to templates. Even Nikkis +8000-edit is a change of a template, not the reply in a discussion I expected it to be. -- Innocent bystander (talk) 10:49, 19 April 2016 (UTC)
Indeed - a better solution would be to have pages like Property:P:496/Constraints (or Property:P:496/Documentation), with Property talk:P:496/Constraints redirected to Property talk:P:496. But we are where we are. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:24, 19 April 2016 (UTC)
And then, I suppose I have to point out that the constraint reports are at the constraint report pages ... - Brya (talk) 11:19, 19 April 2016 (UTC)
Brya, please stop feeding it. This is a more of a todo list and, as Multichill noted, not a place for big debates.
--- Jura 11:58, 19 April 2016 (UTC)
@Jura: That was an unnecessary insult. Please stop that. Innocent and Andy are giving constructive comments of how to improve the situation which should be taken seriously. --Averater (talk) 06:28, 22 April 2016 (UTC)
There is no such thing as a necessary insult. Besides, there is nothing insulting about saying that a debate that isn't at its place shouldn't be fed any further.
--- Jura 07:27, 22 April 2016 (UTC)
I was reacting since you and Andy have been kind of harsh towards each other and thought you were refereeing to him with your comment. But my apologies since you meant the discussion. --Averater (talk) 05:32, 23 April 2016 (UTC)
By now, it is clear enough that Averater is looking for opportunities to disrupt the project. Just ignore him. - Brya (talk) 10:42, 22 April 2016 (UTC)

  Not done no consensus --Pasleim (talk) 13:28, 11 July 2016 (UTC)

As noted above, no substantive arguments against this proposal have been made. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 17:58, 12 July 2016 (UTC)
Close this as   Not done -- Innocent bystander (talk) 17:31, 26 July 2016 (UTC)
This section was archived on a request by: Innocent bystander (talk) 17:31, 26 July 2016 (UTC)

Canadian lakes

@Laurianna2, VIGNERON: As mentioned on WD:Bistro#Liens interlangues, user:Lsj has created more than 50k items about Canadian lakes in svwiki and they need to be linked to Wikidata. Relevant catgegory: sv:Kategori:Insjöar i Kanada). Sample article: sv:Étang de Hart.

We should

  • link articles:
  1. create a new item when the article does not correspond to an existing label (parentheses excluded)
  2. when the title matches an item, either list them to check it by hand, or given that there will be hundreds of them, devise an algorithm to determine if it refers to the same lake based on coordinates.
  • add data
  1. P31: lake (Q23397) and P17:Q16
  2. P131 from the "region" param and GeoNames ID (P1566) from the geonames param of sv:Template:Geobox.
  3. add labels at least in English and French that should be equal to the Wikipedia article. When the title starts with "Lac .. ", "Baie ", "Bassin " or "étang" the first letter should be lower-cased, at least in French.
  4. elevation above sea level (P2044) and coordinate location (P625) from the Infobox, Geonoames or wherever.

Most of that can be done with creator.html, autolist, and harvesttemplates, but I think adding labels requires a real bot. If someone can do the whole thing in one go, that would probably be best. -Zolo (talk) 09:34, 30 January 2016 (UTC)

Add Names as labels (Q21640602) can work for labels. It seems that svwiki prefers that we wait a month or so after they created such stubs. Apparently it could happen that they delete entire bot created sets.
--- Jura 09:39, 30 January 2016 (UTC)
Even if the article end up being deleted in svwiki, I think it makes sense to have the items.
I didn't know Add Names as labels (Q21640602), that could do the job. So, I guess I can do it with the standard tools, but that will require something more than 10 edits by item so a flood of more than 500k edits in all, maybe it is better to wait for a bot that do it in fewer edits ? --Zolo (talk) 09:57, 30 January 2016 (UTC)
Well, if they delete it, I don't think we want to have it either. It's something Innocent bystander mentioned on some of the other series. As for the number of edits, I'm not sure if it matters.
--- Jura 10:31, 30 January 2016 (UTC)
Yes, I thought about using robots for this project. Where did you see that svwiki wanted to delete this robot's stub? Btw, I've noticed lot of mistakes in Geonames, and it seems that the site has not been updates since December.--Laurianna2 (talk) 19:41, 4 February 2016 (UTC)
If pages like these are deleted on svwiki it is most likely done because they have found mistakes in the database or that the quality of the data is poor. Pages are also deleted when Lsj find mistakes in the bot code. It is then sometimes easier to delete the pages and restart the bot. That is why I recommend you to wait a month. One problem we have detected in Canada is that there are often duplicate items in GeoNames. One item with an English name and one item with the French name. -- Innocent bystander (talk) 07:15, 2 March 2016 (UTC)

Import names in Latin script from kowiki

There are a few items for persons that link only to kowiki and don't have labels in English. Samples:

The articles for these in kowiki have names in Latin script (or other scripts) defined in the introduction.

This could be imported to Wikidata as label or alias.

The two samples were already merged as they appeared on the report for identical birth and death dates. --- Jura 10:54, 14 November 2015 (UTC)

All samples have been merged now. Maybe these items were all redundant? --Pyfisch (talk) 19:41, 27 December 2015 (UTC)
Does it matter? I'd assume there are other items without labels and articles that include names in Latin script at kowiki. No need for a bot for 3 items ;) --- Jura 10:37, 28 December 2015 (UTC)

Maybe the following could work for this:

  • Generate a list of items for people that don't have labels in a series of languages (including en), but (e.g.) kowiki
  • Maybe exclude items that already meet some other criteria
  • Scan these articles for names in Latin script at predefined places
  • Present the result in a browser like the ones for dates of birth/death to confirm by a user.
    --- Jura 09:06, 16 March 2016 (UTC)

Import number of state representatives in Congress

Please import the total number of seats in the US Hourse of Representatives, as listed in wikipedia:List_of_U.S._states_and_territories_by_population#States_and_territories table, into each US state. I think Property:P1410 is a perfect candidate for that, as it requires to qualify that this is related to US House of Reps. Also, it would be amazing to do the same for other similar legislature, like European parliament. And lastly, historical data is always amazing, if one could find it (this data is connected to US census). Having this data would allow interesting political visualizations like these demos. --Yurik (talk) 05:05, 17 February 2016 (UTC)

Property:P1410 does not seem suitable to me in this context because the most straightforward interpretation is the number of seats in the state legislature, not the US House of Representatives. It could also be interpreted as the number of seats the state has in the US Senate and US House of Representatives combined. Further confusion result because each state has its own name for its legislature and the houses that make up the legislature. Jc3s5h (talk) 13:07, 6 March 2016 (UTC)
Jc3s5h, I think that's why that property has a mandatory Property:P194 qualifier. So for my request, you can make 3 values in each state: US Congress, US Senate (2 each), and US House of representatives. --Yurik (talk) 09:59, 13 March 2016 (UTC)
I'm not familiar with mandatory qualifiers. Will the UI or the API prevent the storage of entries that lack the mandatory qualifier? Jc3s5h (talk) 13:05, 13 March 2016 (UTC)
Jc3s5h, seems that even though it is "required", the only enforcement comes from the bots at this point. I added a sample entry Q99 - seems to look good. Would be great to automate the import, plus it would be amazing if the historical numbers are also added (they kept changing throughout the history based on the population) --Yurik (talk) 21:39, 13 March 2016 (UTC)

Import date of birth (P569)/date of death (P570) from Wikipedia

Lang2007
→ ja334
[en]326 (~5%)
⇒ ru124
⇒ uk121
→ zh116
pt115
es103
→ ar100
fr91
hu83
tr79
→ ko56
id52
et49
fi44
→ el40
→ th34

Wikidata:Database reports/Deaths at Wikipedia lists items with dates of death at Wikipedia (10-15% of all). Some dates in articles of most languages are likely to be suitable for import by bot. For other dates, the only formatted part may be the year of death category. --- Jura 08:06, 2 August 2015 (UTC)

@Multichill: didn't you had a script for this? It only works when there is a strict format, yes. Sjoerd de Bruin (talk) 18:24, 3 August 2015 (UTC)
  Strong oppose for second time imports same data from the same Wikipedia. Any kind of automatic and repeatable Wikipedia->Wikidata copy work makes all others Wikipedia vulnerable to mistakes (and vandalism) in single. -- Vlsergey (talk) 19:55, 3 August 2015 (UTC)
None of these pages currently have P570 defined, thus it's not a matter of re-import. Many articles may only exist in 1 language. --- Jura 21:11, 3 August 2015 (UTC)
1. "Reimport" is not about statements, but about project+property. Having p570 imported from any wiki, it shall not be reimported. Especially not on scheduled/automated basis. Arguments above. 2. I'm okay with single time import of P569/P570 from those projects. -- Vlsergey (talk) 15:03, 4 August 2015 (UTC)
I agree that it shouldn't be done for the current year on an automated basis. If you look at "date added" column on the lists, you will notice that most entries are fairly old. --- Jura 08:30, 5 August 2015 (UTC)
Looking at en:Patri J. Pugliese it seems that the formatted version is fairly recent (2014), en:Victoria Arellano has persondata since 2010, pt:Joaquim Raimundo Ferreira Chaves since 2011. en:Mark Abramson since February 2013, but only the DOB got imported. tr:Yasemin Esmergül has the dates in the article lead. In any case, we can validate the year for P570. Maybe someone can assess ja,zh,uk, etc. To the right, the most frequent ones on the list for 2007. --- Jura 21:11, 3 August 2015 (UTC)
Persondata in en.WP is deprecated and should not be relied on. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:06, 4 August 2015 (UTC)
Can you provide references for your claims? Thanks. --- Jura 10:26, 4 August 2015 (UTC)
Discussion of persondata: RfC: Should Persondata template be deprecated and methodically removed from articles? Jc3s5h (talk) 11:33, 4 August 2015 (UTC)
The conclusion mentioned in the link only supports Pigsonthewing's first claim. How about the second? --- Jura 11:37, 4 August 2015 (UTC)
Q8078 refers. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:41, 4 August 2015 (UTC)
Funny. Wasn't it depreciated because Wikidata could hold the data rather than for data quality reasons? --- Jura 08:30, 5 August 2015 (UTC)
No. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:43, 6 August 2015 (UTC)
Any progress on the missing reference? --- Jura 04:34, 8 August 2015 (UTC)
In reply to @Sjoerddebruin: yes I imported date of birth and date of death in the past. I was certainly not the only one. I'm quite confident the persondata template on the English Wikipedia got scraped to Wikidata quite some time ago. I don't think there is much data left to scrape from that corner. My focus was on items about humans with a link to the Dutch Wikipedia, but without data of birth. I used regular expression to extract the data of birth from the introduction of the article. You could do that for other languages too. You just need to start conservative and expand a bit in each iteration. I was able to import thousands of birth dates this way. Multichill (talk) 17:17, 4 August 2015 (UTC)
Thanks for your helpful feedback. enwiki might indeed be mostly done. For the sample year 2007 in the table above, it's just 5%. BTW nl is not on the reports as there are no nl categories for persons by year of death. --- Jura 08:30, 5 August 2015 (UTC)
Actually of the 326 for enwiki, 300 do have persondata. --- Jura 08:36, 5 August 2015 (UTC)

I imported today birth and death dates of people deceased in 2000 by parsing the introduction phrase of the English article. If the edits [1] are okay, I could continue with other years and other languages. I pay attention not to import dates before 1924 and I will not run the script twice on the same article. --Pasleim (talk) 18:42, 12 August 2015 (UTC)

Thanks! I checked 10 and they were all fine. All but 2 or 3 had the sames dates in infobox and/or persondata too.
I noticed that many trwiki articles have a person infobox, maybe this could be imported as well. --- Jura 11:04, 15 August 2015 (UTC)
That was quick. Good work! It did reduce the numbers a bit. It might be worth applying the same method to some of the templates mentioned for enwiki.
The infobox in trwiki doesn't seem that frequent, but for ptwiki, I found that many use pt:Template:dni/pt:Template:Nascimento and pt:Template:Morte or pt:Template:morte e idade/pt:Template:Falecimento e idade. This is done in infoboxes or the article text. --- Jura 07:11, 16 August 2015 (UTC)
I did some from pt:Template:Morte. --- Jura 07:21, 17 August 2015 (UTC)
pt:Template:morte e idade/pt:Template:Falecimento e idade done as well. --- Jura 09:21, 17 August 2015 (UTC)
  • I had a look at 2009: Most frequent languages are: ar 125, uk 116, en 114, es 109, ru 99, hu 86
For ukwiki, of 10 articles, 6 had an infobox (5 different ones: the uk ones from Template:Infobox ice hockey biography (Q5650114), Template:Infobox scientist (Q5624818), Template:Infobox architect (Q10973090), Template:Infobox person (Q6249834), Template:Infobox artist (Q5914426) normally in the format dd.mm.yyyy), the other 4 had the dates in the beginning of the text in Cyrillic. --- Jura 10:37, 31 August 2015 (UTC)
For ukwiki, I just imported the dates from uk:Template:Особа. --- Jura 13:33, 7 September 2015 (UTC)

Given that we might have exhausted the bot approach, I made a request at Topic:Spgr35wayo8zy15y. --- Jura 06:20, 24 September 2015 (UTC)

Is there any way to import date of birth (P569) and date of death (P570) from Slovenian Wikipedia? We are at the halfway point in updating our infoboxes with Wikidata. We have 2 tracking categories that includes articles with birth and death dates, that are not yet written into Wikidata (birth: sl:Category:Lokalnega datuma rojstva še ni v Wikipodatkih and death: sl:Category:Lokalnega datuma smrti še ni v Wikipodatkih). Our biografic articles have special introduction phrase (example: * (?), ....., † 29. marec 1770, ... or * 7. junij 1707, ...., † 2. januar 1770 or just year † 1770 or unknown † ?. We first want to transfer dates to Wikidata and then continue with cleaning our infoboxes. Afterwars we will update next half of our infoboxes and according to that subsequent import data will be needed. --Pinky sl (talk) 11:11, 17 March 2016 (UTC)

You could try to do part of it with Harvesttemplates, e.g. [2]
--- Jura 11:29, 17 March 2016 (UTC)
You mention dates during which Europe was transitioning from the Julian to the Gregorian calendars, but you don't mention the data having any calendar indication. Thus I would suggest you not import any dates before 1924. Jc3s5h (talk) 12:08, 17 March 2016 (UTC)
Ok, thanks, will see what we can do. --Pinky sl (talk) 16:20, 18 March 2016 (UTC)