Hello, I see that you added some Bibliothèque nationale de France ID (P268) like here https://www.wikidata.org/w/index.php?title=Q20895083&diff=prev&oldid=246961749 It's nice but there is problem of syntax with this identifier : we have to remove the "cb". The precedent edition gives this http://catalogue.bnf.fr/ark:/12148/cbcb120526478 instead of http://catalogue.bnf.fr/ark:/12148/cb120526478 . I have done the correction for this one. Best regards. --Shonagon (talk) 22:35, 31 August 2015 (UTC)

Correction done for the others. Regards --Shonagon (talk) 22:47, 31 August 2015 (UTC)
Thanks for the note. It appears that User:KrBot fixes them. I will remove the cb when I add new ones. --Bamyers99 (talk) 22:54, 31 August 2015 (UTC)

Invitation to Wikidata user studyEdit

Dear Bamyers99,
I am a researcher of the Web and Internet Science group of the University of Southampton.
Together with a group of other researchers from the same University, we are currently conducting a research aiming to discover how newcomers become full participants into the Wikidata community. We are interested in understanding how the usage of tools, the relationships with the community, and the knowledge and application of policy norms change from users' first approach to Wikidata to their full integration as fully active participants.
This study will take place as an interview, either by videotelephony, e.g. Skype, phone, or e-mail, according to the preference of the interviewees. The time required to answer all the questions will likely be about an hour. Further information can be found on the Research Project Page Becoming Wikidatians: evolution of participation in a collaborative structured knowledge base.
Any data collected will be treated in the strictest confidentiality, no personal information will be processed for the purpose of the research. The study, which has submission number 20117, has received ethical approval following the University of Southampton guidelines.
We aim at gathering about 20 participants, chosen among experienced Wikidata users who authored a large number of contributions.
Should you be interested in taking part or wish to receive further information, you can contact us by writing to the e-mail address ap1a14+wikidata_user_study@ecs.soton.ac.uk.
Thank you very much, your help will be much appreciated!
--Alessandro Piscopo (talk) 09:07, 26 May 2016 (UTC)

Data behind Wikidata Class BrowserEdit

I tried out the Wikidata Class Browser which is interesting especially for the hierarchy of the classes. But I am more interested in the number of direct and indirect instances of the most used classes. Can you share the full list of all 1,3 Mio classes? Then, I could filter myself the most used of them, or maybe aggregate some of these into reasonable groups. --Zuphilip (talk) 15:11, 25 November 2017 (UTC)

@Zuphilip: The raw data is available at wdsubclasstotals.tsv (file size 28 M).
Layout: qid, root (Y/N), direct child count, indirect child count, direct instance count, indirect instance count, is list of count
--Bamyers99 (talk) 18:05, 25 November 2017 (UTC)
Thank you very much! I extracted from these data a list as a wanted: https://www.wikidata.org/wiki/User:Zuphilip/wd-statistics --Zuphilip (talk) 22:26, 25 November 2017 (UTC)

Find A Grave cemetery IDEdit

Are you adding the missing ones by hand, or using an automated tool? Have you thought about us importing every cemetery from Findagrave that we do not have already? --RAN (talk) 04:43, 12 February 2018 (UTC)

@Richard Arthur Norton (1958- ): I was just working on the Find a Grave memorial ID constraint violations and Find A Grave cemetery ID constraint violations. Although I plan on adding ones with a Find A Grave Cemetery link in enwiki (and other large wikis) that don't have P2025 from this quarry query. --Bamyers99 (talk) 15:04, 12 February 2018 (UTC)
I would love to construct a bot to add in every cemetery not represented in Wikidata, adding them one a time is tedious. How would I go about starting that project, do I suggest it somewhere? 10 years ago Findagrave had a lot of duplicates under various synonyms, now they are all sorted out. --RAN (talk) 15:15, 12 February 2018 (UTC)
@Richard Arthur Norton (1958- ): Here is a good place to start: Wikidata:Data donation. --Bamyers99 (talk) 19:44, 12 February 2018 (UTC)

CWGC data editingEdit

Hi there. I left a message on your talk page on en-Wikipedia, but realised that here might be a better place to discuss. See here. Carcharoth (talk) 15:11, 5 March 2018 (UTC)

@Carcharoth: I am working from two different lists to add CWGC ids to Wikidata. 1) I downloaded the CWGC website casualty list and matched on name and date of death with Wikidata name and date of death. That task is complete. 2) I am now matching on external links from enwiki to CWGC. I have about a 1000 more links to check. Thanks for fixing to CWGC property mixups. The mixups would have eventually shown up in the CWGC burial ground ID constraint report or the CWGC person ID constraint report. --Bamyers99 (talk) 15:50, 5 March 2018 (UTC)
Thanks for explaining. I will now know to ask around on Wikidata before taking on big mix-n-match tasks that are better done by those with the tools for it. :-) About the CWGC external links, some care is needed as the links will in many cases not match directly with a Wikidata entry. If you have time to look at en:w:User talk:Carcharoth/Pages using CWGC links and think about how to handle the uses where the match is not one-to-one, maybe we could ensure a consistent approach is taken? In many cases the cemeteries are not mentioned in articles on the cemeteries, but instead in articles on the general location. There are also list articles and various other categories of articles. On my attempt at mix-n-match, from the 1626 and 1708 (total: 3334) articles in those lists, there are around 693 that are direct matches, 934 that have a CWGC external link but no Wikidata value (these will mostly be articles on relatives of the casualties), and 1015 Wikidata values that don't have a CWGC link in the corresponding article (or at least no article in en-Wikipedia). This is not exact, as the article titles don't always match the Wikidata page name. One thing to be wary of is that the CWGC sometimes get the death date wrong, so there will be a few (rare) cases where the matches won't be found. Carcharoth (talk) 16:30, 5 March 2018 (UTC)
I am running into limitations with my ability to extract the data. Can you advise on how to get the names of the Wikipedia articles for the Wikidata pages listed here? There are too many cases of disambiguation for me to be able to match easily with the list of Wikipedia articles I have. Carcharoth (talk) 17:50, 5 March 2018 (UTC)
@Carcharoth: Is this query what you are looking for? Just press the blue button with the right facing triangle in it to run the query. --Bamyers99 (talk) 18:17, 5 March 2018 (UTC)
It might be. But the link doesn't work for me. I'll try and recreate the query from scratch. Carcharoth (talk) 18:23, 5 March 2018 (UTC)
# CWGC person ID with enwiki link
SELECT DISTINCT ?s ?sLabel ?article WHERE {
  ?s wdt:P1908 ?cwgcid .
    ?article schema:about ?s .
    ?article schema:inLanguage "en" .
    ?article schema:isPartOf <https://en.wikipedia.org/> .
  SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
 ORDER BY ?sLabel

Try it!

--Bamyers99 (talk) 18:36, 5 March 2018 (UTC)

That works! Many thanks for that. May I ask if: (a) you have finished your latest batch of data addition; and (b) what further plans you have to work with this data? I have some ideas (particularly related to identifying the relatives of the deceased who have articles, and to including gravestone inscriptions and memorials/burial locations where appropriate), but don't want to overlap or duplicate efforts too much. Would one aim be to end up with a list generated from the data at some point? I also have plans to update WP articles which lack a CWGC reference (but you may be able to do that quicker than I can). I do also (based on the UK Parliamentary War Memorials) have a list (in my userspace) of bereaved MPs and bereaved peers from both World Wars that I intend to return to at some point, with notes on which ones are lacking a citation to the relevant CWGC page. Carcharoth (talk) 14:59, 6 March 2018 (UTC)
@Carcharoth: I am done adding CWGC ids to Wikidata. I have no plans to do any more CWGC related tasks. --Bamyers99 (talk) 17:55, 6 March 2018 (UTC)
Understood. Many thanks for the edits, which will be incredibly useful for working out where work is needed. I may come back with more queries (about the queries like the one you set up above), or I may ask at the help area here. Carcharoth (talk) 18:07, 6 March 2018 (UTC)


Hi Bamyers99,

the links towards Gresset's CTHS page doesn't seem to work. What's wrong, according to you?

Regards. Nomen ad hoc (talk) 18:42, 15 June 2018 (UTC).

@Nomen ad hoc: Wrong link, it is JUGLAR Joseph Clément for Clément Juglar (Q921074). --Bamyers99 (talk) 18:49, 15 June 2018 (UTC)
Yep, but curiously, [1] redirects towards Gresset's page... Nomen ad hoc (talk) 18:51, 15 June 2018 (UTC).
@Nomen ad hoc: I am not fluent in French, so I don't what the difference is between the two types of pages. I am just working on constraint violation reports: Wikidata:Database_reports/Constraint_violations/P2383#"Unique value" violations. --Bamyers99 (talk) 19:00, 15 June 2018 (UTC)
Ok, no matter. Good continuation. Nomen ad hoc (talk) 19:02, 15 June 2018 (UTC).

Adding of StadiumDB identifier (P5288) to wikidata-entities for stadiumsEdit

Hello Bamyers. Thanks for adding several values of StadiumDB identifier (P5288). See also World Stadium Database identifier (P5289). Best regards Migrant (talk) 19:26, 5 October 2018 (UTC)

Navel Gazer by language?Edit

Is there a way to use Navel Gazer or similar to identify users by the language(s) they are editing in? When I come across an edit in a language/ script I can't handle myself, I would like to have a possibility to contact accounts that are editing in this language (or at least script), preferably recently and (if the edit was about a language-specific property) ideally have used the same property themselves. --Daniel Mietchen (talk) 23:13, 3 February 2019 (UTC)

@Daniel Mietchen: Navel Gazer can now show user language addition counts. It has total counts of label, description, alias additions. I couldn't add 'language-specific property' counts because the language is not saved in the edit comment. Navel Gazer gets its information from edit comments. Sample language addition count query: French (fr) addition counts. --Bamyers99 (talk) 02:05, 5 February 2019 (UTC)
@Bamyers99: That looks great and addresses my use case — thanks! Could you add the addition of sitelinks into the count? There is also a kind of inverse use case — if I'd like to discuss something with a given user whom I do not know yet, I'd like to pick a language close to their comfort zone if possible but if they have no babel set and their most recent edits are all language-neutral, then I have to manually search for things like multilingual properties they may have used, or pick in the dark. Usually, this means resorting to English, which may not be the best option for me to discuss with them. So is there a way to include language info into the Navel Gazer results for a given user? I understand that language-specific properties do not leave a language trace in the edit summary that you are parsing, but what about adding a column for the data type of a property to the results page? This would allow to look specifically at things like Monolingual text or Sense. There is also a minor bug: if a user name contains a space, the tool is replacing this with a "+", which causes the links to break (see your example). I would have tried to fix this one myself but don't know where the code sits. --Daniel Mietchen (talk) 07:35, 5 February 2019 (UTC)
@Daniel Mietchen: Everything has been implemented/fixed. --Bamyers99 (talk) 00:14, 8 February 2019 (UTC)
Thanks, Bamyers99 — this looks great! --Daniel Mietchen (talk) 15:31, 14 February 2019 (UTC)

Hi! I really love Navel Gazer and I absolutely have to thank you for it. I have just a little report about Label, description, alias, sitelink additions: in my case I have 43915 for "it", 11903 for "en", 6581 for "pl" and 5980 for "de" ... but, if I search by Language code, I appear in the "top 50" only for "it" and "pl", not for "en" or "de" ... is there any reason? It's not just for me, but maybe it's not an isolated case. Can you control? Thank you very much again, --Epìdosis 16:10, 2 May 2019 (UTC)

@Epìdosis: I have added the rank to the report to indicate which 50 are included because of total count and which 50 for last months count. --Bamyers99 (talk) 20:38, 2 May 2019 (UTC)
Perfect, thank you! --Epìdosis 20:39, 2 May 2019 (UTC)

Navel GazerEdit


First of all, I must say that I love Navel Gazer and use it quite often. I was also very pleased to see that you added "Pseudo properties" for Lexemes. But is it me who doesn't understand something or are the numbers wrong for P-7 or P-10? (I'm pretty sure I did more edit for form representations or sense glosses)

Cheers, VIGNERON (talk) 13:08, 19 April 2019 (UTC)

@VIGNERON: I have performed a detailed analysis of your edit history. I hacked my dump scanner to dump out the comments from just your edits (1712294 edits as of 1 April 2019). I then ran the comments through sed (Q305876) to extract the action performed (first column below). Then I sorted and aggregated to get total counts. To relate the action to the user contributions list, the second column below is the comment template for English that is used to format comments. MediaWiki translation files: Wikibase en translation file (GitHub), Lexeme en translation file (GitHub)
Based on the analysis, I believe that Navel Gazer is properly reporting totals using the actions found in the comment strings.
Action Contributions comment template (en) Navel gazer Count
wbsetdescription-add Added [$2] description -2 654472
wbcreateclaim-create Created claim Property added 402687
wbsetlabel-set Changed [$2] label Not reported 268474
wbsetdescription-set Changed [$2] description Not reported 121728
wbsetreference-add Added reference to claim Not reported 118409
wbsetlabel-add Added [$2] label -1 74361
wbsetclaim-create Created {{PLURAL:$3|claim|claims}} Property added 19419
wbremoveclaims-remove Removed {{PLURAL:$1|claim|claims}} Not reported 13199
wbsetsitelink-add Added link to [$2] -4 9249
wbsetclaim-update Changed {{PLURAL:$3|claim|claims}} Not reported 4714
undo ? Not reported 4658
wbsetqualifier-add Added {{PLURAL:$1|qualifier|qualifiers}} Not reported 4265
wbeditentity-create Created a new Item Not reported 4242
wbsetaliases-add Added [$2] {{PLURAL:$1|alias|aliases}} -3 1002
wbeditentity-update Changed an Item Not reported 969
wbmergeitems-to Merged Item into $3 Not reported 633
add-form Added new Form $3 with {{PLURAL:$1|representation|representations}} -6 626
wbmergeitems-from Merged Item from $3 -5 587
wbcreateredirect Redirected to $4 Not reported 564
clientsitelink-update Page moved from [$3] to [$4] Not reported 496
wbsetsitelink-remove Removed link to [$2] Not reported 482
wbeditentity-create-item Created a new Item Not reported 460
wbeditentity-override Cleared an Item Not reported 218
wbsetdescription-remove Removed [$2] description Not reported 202
wbsetaliases-remove Removed [$2] {{PLURAL:$1|alias|aliases}} Not reported 186
restore ? Not reported 150
wbsetlabel-remove Removed [$2] label Not reported 111
add-sense Added new Sense $3 with {{PLURAL:$1|[$2] gloss|glosses}} -9 111
wbsetaliases-set Set [$2] {{PLURAL:$1|alias|aliases}} Not reported 108
wbsetsitelink-set Changed link to [$2] Not reported 101
add-form-grammatical-features Added grammatical {{PLURAL:$1|feature|features}} of Form $3 -8 97
wbremovereferences-remove Removed {{PLURAL:$3|reference|references}} from claim Not reported 45
wbsetaliases-update Updated [$2] {{PLURAL:$1|alias|aliases}} Not reported 37
wbsetreference-set Changed reference of claim Not reported 29
wbsetlabeldescriptionaliases Changed [$2] label, description and aliases Not reported 24
wbsetclaim-update-qualifiers Changed {{PLURAL:$4|one qualifier|$4 qualifiers}} of {{PLURAL:$3|claim|claims}} Not reported 18
update-form-grammatical-features Changed representations and/or grammatical features of Form $3 Not reported 15
add-sense-glosses Added {{PLURAL:$1|[$2] gloss|glosses}} to Sense $3 -10 14
update-form-representations Changed representations and/or grammatical features of Form $3 Not reported 10
add-form-representations Added {{PLURAL:$1|representation|representations}} of Form $3 -7 3
--Bamyers99 (talk) 00:36, 21 April 2019 (UTC)

Thank you! re: schools and districtsEdit

Hi! I see how much work you've been doing adding schools and districts from the Common Core of Data. Thank you so much! I deeply appreciate your effort and how much you're improving coverage of educational institutions on Wikidata, especially adding NCES identifiers and relationships between schools and districts. Runner1928 (talk) 13:36, 17 May 2019 (UTC)


I certainly have nothing against Merriam-Webster, but why not use a Wikimedia project? Arlo Barnes (talk) 21:31, 11 December 2019 (UTC)

@Arlo Barnes : Great idea. I have updated the link. --Bamyers99 (talk) 22:03, 11 December 2019 (UTC)

Unit converter scriptEdit

Hello! I love your unit converter script, but I've noticed that errors build up quite rapidly as the converted figures get larger. I'm wondering if you'd be willing to extend the significant figures beyond three for greater accuracy. Perhaps to six decimal places in the conversion ratio? Huntster (t @ c) 20:01, 12 January 2021 (UTC)

@Huntster: I have increased the conversion unit precision. --Bamyers99 (talk) 21:06, 12 January 2021 (UTC)
Fantastic! Everything looks much better at first glance. Thank you. Huntster (t @ c) 21:56, 12 January 2021 (UTC)

Regarding your currency conversion, I see it trying to account for year (such as at Phobos (Q104922609)), but it's having a hard time for some reason. Notice different conversions for the same input at the example. Is there a preferred way to lay out the values so it converts properly? Thanks! Huntster (t @ c) 20:30, 22 January 2021 (UTC)

@Huntster: I assume you are referring to the 2006 cost property not displaying an inflation adjusted amount, while the 2006 significant event cost qualifier is displaying one. I have fixed a bug so that they both display the adjusted amount. The 2020 ship acquired amount is not adjusted because I have not entered 2020 adjustments yet. --Bamyers99 (talk) 23:10, 22 January 2021 (UTC)
Understood! Thanks for being responsive, the script really is a fantastic accessory. :) Huntster (t @ c) 01:18, 23 January 2021 (UTC)
It's strange, today I was checking Deimos (Q40426797) and Phobos (Q104922609), and the mid-2000s cost properties are again defaulting to 2019 inflation adjusted amount. Any ideas? Huntster (t @ c) 16:14, 24 January 2021 (UTC)
@Huntster: Operating as designed. 2019 is the latest that I have entered inflation data for. I have not entered 2020 inflation data yet.--Bamyers99 (talk) 16:33, 24 January 2021 (UTC)
No no, I understand that. I'm speaking of the construction costs from 2005 and 2006. Another interesting case is at Space Environments Complex (Q4049845); since the Euro wasn't around in 1968, the conversion seems...malformed? Huntster (t @ c) 18:49, 24 January 2021 (UTC)
@Huntster: Sorry, I assumed that you were using USD instead of EUR currency. I have added units conversion for the original year in addition to the inflation adjusted amount. I have inflation data that somebody synthesized for the combined countries of the Euro block going back to 1960. --Bamyers99 (talk) 20:27, 24 January 2021 (UTC)
I am using USD. As an example, the input is "312,000,000 United States dollar", conversion is "263,854,284 euro (2019)" but Point in Time is "2005" (and specifically "20 September 2005" in another location). I apologize if I am being unclear on anything. Huntster (t @ c) 20:52, 24 January 2021 (UTC)
@Huntster: Sorry, I deployed the updated gadget to the wrong place. Give it another try. --Bamyers99 (talk) 06:11, 25 January 2021 (UTC)
Looks mostly right, now. It's showing "312,000,000 United States dollar, (250,766,355 euro (2005)), (263,854,284 euro (2019))", so that 2019 is still sneaking in. On the other hand, the 2020 conversion looks good, assuming the ratio is taken from, what, the July 1 split? Or is it the high/low average for the year? Huntster (t @ c) 07:04, 25 January 2021 (UTC)

Proposal of improvement to Navel Gazer - tagsEdit

Hi! As I've already said, I consider Navel Gazer a fantastic tool and I always suggest other users to have a look at it. I'm thinking about the following addition to the statistics already present: as of now, I don't know any way to find how many edits have been done by a user through semi-automatic tools; so, I suggest to add to the present sections ("Property additions", "Other actions", "Label, description, alias, sitelink additions") a fourth section named "Tools used" or "Tags" or something similar, listing the number of edits with a tag performed by the user (I would consider tags as "Widar [1.4]", "QuickStatements [1.5]", "Widar [1.3]" and all others referring to tools, not the ones as "redirect created" or "new editor changing statement" and similar ones; otherwise, we can choose to show all tags) and finally the number of edits without tags; there could also be, in addition to "User", "Property"/"Pseudo-properties" and "Language code", a tab "Tools" or "Tags" in which it would be possible to see which users have done more edits with a specific tag. Obviously this is just a proposal and it has no urgency at all, so feel free to have a look whenever you have time and if you find it useful and significant. Thanks and good night, --Epìdosis 00:03, 14 January 2021 (UTC)

@Epìdosis: I have done some preliminary analysis. The tag information is not in the database dump wikidatawiki-stub-meta-history.xml. It is stored in wikidatawiki-change_tag.sql.

Sample revision dump:

        <username>Kam Solusar</username>
      <comment>/* wbcreateclaim-create:1| */ [[Property:P5905]]: 4020-56944, #quickstatements; #temporary_batch_1558736841614</comment>
      <text bytes="149740" id="955875077" />

Sample tag data for above revision:

MariaDB [wikidatawiki_p]> select * FROM change_tag,change_tag_def WHERE ct_rev_id = 948407982 AND ct_tag_id = ctd_id;
| ct_id     | ct_rc_id  | ct_log_id | ct_rev_id | ct_params | ct_tag_id | ctd_id | ctd_name        | ctd_user_defined | ctd_count |
| 311583001 | 985250530 |      NULL | 948407982 | NULL      |       594 |    594 | OAuth CID: 1253 |                0 |  47000918 |

"OAuth CID: 1253" would need to be looked up in localization data somewhere to get "QuickStatements [1.3]". I am in the middle of a multi-week project so it will be a while before I have a chance to look at this more thoroughly. --Bamyers99 (talk) 02:09, 14 January 2021 (UTC)

@Epìdosis: Tool usage is now available in NavelGazer. It only includes tools using OAuth. --Bamyers99 (talk) 02:06, 19 January 2021 (UTC)