See User talk:ArthurPSmith/Archive for older discussions.


authordisambiguator

edit

Hi,

Is it possible that the option "reset error" of the tool isn't working ? I tried it a couple of time without success. Simon Villeneuve (talk) 11:13, 19 January 2024 (UTC)Reply

  • @Simon Villeneuve: It seems to work for me - you click on the "Reset errors?" link for a batch and the batch should now show as in "Ready" state, then you can click the "Restart batch?" link to actually run it again. However of course if the error condition is still there it will return to the error state again when it runs... ArthurPSmith (talk) 15:17, 19 January 2024 (UTC)Reply

Another thing : can the tool convert non-romanized letters ? As an example, for Aleš Bezděk (Q112392511), I have no result if I use "Ales Bezdek", and plenty of results if I use "Aleš Bezděk". Simon Villeneuve (talk) 12:04, 25 January 2024 (UTC)Reply
P.S. : same thing for Olivier Bienaymé (Q102715089) (Bienayme) Simon Villeneuve (talk) 02:37, 28 January 2024 (UTC)Reply

  • @Simon Villeneuve: On copying the link - great idea but there was some reason I thought it wouldn't work, I'll have to dig into it. Can't remember what the problem was...
  • Regarding non-romanized letters: the trouble is the code finds works for authors through a SPARQL search and that depends on an exact string match. Going from a non-roman letter to a roman letter generally works, so if you input "Aleš Bezděk" it should match both "Aleš Bezděk" and "Ales Bezdek" because the code is translating the non-roman to roman ones. But it would be infeasible to do that in the other direction - there are too many possible name variants changing every possible letter that could be accented. ArthurPSmith (talk) 18:09, 29 January 2024 (UTC)Reply
edit

Hi, I don't know if something happened to the tool, but whenever I try to follow a link for IMDb ID (P345) I get a page saying that the tool is taking too long to respond. Agabi10 (talk) 09:59, 5 March 2024 (UTC)Reply

Invitation to participate in the WQT UI requirements elicitation online workshop

edit

Dear ArthurPSmith,

I hope you are doing well,

We are a group of researchers from King’s College London working on developing WQT (Wikidata Quality Toolkit), which will support a diverse set of editors in curating and validating Wikidata content.

We are inviting you to participate in an online workshop aimed at understanding the requirements for designing effective and easy-to-use user interfaces (UI) for three tools within WQT that can support the daily activities of Wikidata editors: recommending items to edit based on their personal preferences, finding items that need better references, and generating entity schemas automatically for better item quality.

The main activity during this workshop will be UI mockup sketching. To facilitate this, we encourage you to attend the workshop using a tablet or laptop with PowerPoint installed or any other drawing tools you prefer. This will allow for a more interactive and productive session as we delve into the UI mockup sketching activities.

Participation is completely voluntary. You should only take part if you want to and choosing not to take part will not disadvantage you in any way. However, your cooperation will be valuable for the WQT design. Please note that all data and responses collected during the workshop will be used solely for the purpose of improving the WQT and understanding editor requirements. We will analyze the results in an anonymized form, ensuring your privacy is protected. Personal information will be kept confidential and will be deleted once it has served its purpose in this research.

The online workshop, which will be held on April 5th, should take no more than 3 hours.

If you agree to participate in this workshop, please either contact me at kholoud.alghamdi@kcl.ac.uk or use this form to register your interest https://forms.office.com/e/9mrE8rXZVg Then, I will contact you with all the instructions for the workshop.

For more information about my project, please read this page: https://king-s-knowledge-graph-lab.github.io/WikidataQualityToolkit/

If you have further questions or require more information, don't hesitate to contact me at the email address mentioned above.

Thank you for considering taking part in this project.

Regards Kholoudsaa (talk) 03:29, 19 March 2024 (UTC)Reply

Property P4280 for deletion

edit

Hi! I have just proposed for deletion the property P4280 (P4280) that you created. Please check Wikidata:Properties for deletion/P4280. Horcrux (talk) 18:08, 14 April 2024 (UTC)Reply

A . M. Zhang and author_strings gadget madness

edit

Hi @ArthurPSmith, thanks for reverting the name change for A . M. Zhang. I had no idea I'd done this! I briefly installed the author_strings gadget, but it seemed horribly glitchy, made page loading slow, and sometimes generated a big list of seemingly unrelated authors which would suddenly appear on my page why I was editing. I must have accidentally clicked something which generated a new item for "A . M. Zhang" and then matched that to a number of author strings. Bizarre! Anyway shortly after installing it I switched it off as it made the editor barely usable. I've looked at my contributions and found the rest of the "A . M. Zhang" changes and reverted them. Sorry for the spurious edits, it's a bit alarming that a gadget can create a bunch of edits seemingly without me knowing. Rdmpage (talk) 12:27, 23 April 2024 (UTC)Reply

@Rdmpage: Thanks for fixing! That is definitely alarming... ArthurPSmith (talk) 13:27, 23 April 2024 (UTC)Reply

Universités

edit

Bonjour User:ArthurPSmith, merci pour votre intérêt. Je fais un peu de ménage dans les éléments sur les organisations, qui sont effectivement parfois confondus avec leurs implantations. Wikidata est une base de données structurées où les concepts doivent être bien définis : une université n'est pas un campus, comme une entreprise n'est pas une usine et inversement. Le Campus adventiste du Salève est un cas un peu déroutant parce que l'organisation elle-même porte le nom de "campus", mais ça ne change pas sa nature. Cordialement, Arpyia (talk) 17:23, 3 May 2024 (UTC)Reply

Bonjour @Arpyia:! I hope you don't mind me staying in English. I agree that a company is not a factory, a university is not a campus, etc. So is your point that Adventist University of France – Collonges (Q2935621) is a campus, not a university? Because right now it says instance of (P31) university (Q3918). While your new Q125753974 says instance of (P31) organization (Q43229). university (Q3918) is a subclass of organization (Q43229) so to me they seem the same. Something here needs to be adjusted. ArthurPSmith (talk) 17:30, 3 May 2024 (UTC)Reply
You are right. Usually we have items about organisations which get mixed up with properties about campuses, but this one looks to have been intented for the place, that's why I created the item about the organisation. A whole other question would be: what qualifies as a university? I don't think anyone in France would call Campus adventiste du Salève a university. But I won't get into that! Arpyia (talk) 17:40, 3 May 2024 (UTC)Reply
While you're here, I am trying to include the registration number for all dangerous or polluting facilities in France. This could include some research facilities too. Could you help me here: Wikidata:Property proposal/numéro d'établissement d'une ICPE? Thank you! Arpyia (talk) 08:42, 4 May 2024 (UTC)Reply
Thank you a lot for your help with that! Arpyia (talk) 13:07, 22 June 2024 (UTC)Reply
Hi - I'm not clear what help you need? The property has been created - ICPE establishment ID (P12719) and can be used right away! ArthurPSmith (talk) 16:53, 22 June 2024 (UTC)Reply

Dowry in Islam

edit

Hi ArthurPSmith. I discovered this morning that all my additions in the mahr (Q902443) page (Dowry in Islam) have been canceled. In particular, these authority links have been targeted:

  1. LCAuth: https://id.loc.gov/authorities/sh2022006551
  2. LCAuth: https://id.loc.gov/authorities/sh85039242
  3. LCClass: https://id.loc.gov/authorities/classification/BP190.5.D69
  4. FAST: https://id.worldcat.org/fast/2060623
  5. FAST: https://id.worldcat.org/fast/897273
  6. IdRef: https://www.idref.fr/17523244X
  7. BnF: https://catalogue.bnf.fr/ark:/12148/cb16729423j
  8. BNE: https://datos.bne.es/resource/XX547746

All the beautiful and patient work that I have brought to this page has been canceled. I am afraid that all the other pages that I have improved and enriched will suffer the same fate. SOS! Soufiyouns (talk) 06:24, 4 July 2024 (UTC)Reply

Sorry about that - have you talked to the person who reverted your edits? The account appears to be كريم رائد - so you should go to their talk page and ask them why they made the change, or if possible to reverse their removal of your edits. If that doesn't get anywhere then you should bring this up at the Wikidata:Administrators' noticeboard. I have no authority on this myself. ArthurPSmith (talk) 17:30, 8 July 2024 (UTC)Reply
@ArthurPSmith: Thank you very much for your kind and compassionate response, you can check this user's response in Mike Peel's talk page. Regards. Soufiyouns (talk) 17:36, 8 July 2024 (UTC)Reply

instance and subclass of the same class

edit

QLever can run the general query for this. Could it replace the several queries and pages you have set up? I'm willing to make the changes if you can tell me how to generate the reports.

Here are the top few entries:

 ?metaclass	?metaclassLabel	?count
 Q7187	gene	975340
 Q8054	protein	761241
 Q4164871	position	99687
 Q277338	pseudogene	49392
 Q427087	non-coding RNA	45202
 Q2996394	biological process	28254
 Q14860489	molecular function	11243
 Q294414	public office	9909
 Q34770	language	7074
 Q5058355	cellular component	4198
 Q898273	protein domain	2643
 Q12136	disease	2270
 Q282	wine	2145
 Q618779	award	1947
 Q929833	rare disease	1919
 Q8187769	economic activity	1790
 Q55788864	developmental defect during embryogenesis	1720
 Q201448	transfer RNA	1153
 Q112965645	symptom or sign	1124 Peter F. Patel-Schneider (talk) 12:50, 5 October 2024 (UTC)Reply
@Peter F. Patel-Schneider: The reports are generated by a Magnus Manske tool called "Listeria" - see Wikidata:Listeria. It is called through the Template:Wikidata list template. I think it would be a great idea to either add qlever support to this tool or to fork it to use qlever instead of WDQS. But you'll have to talk to Magnus or work on the code yourself I think for that to happen. ArthurPSmith (talk) 20:14, 14 October 2024 (UTC)Reply
please share your code that lead to the above data. International Press Center (talk) 16:12, 23 November 2024 (UTC)Reply
I think this is the query I used:
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX wdt: <http://www.wikidata.org/prop/direct/>
SELECT DISTINCT ?metaclass ?metaclassLabel (COUNT(DISTINCT ?class) as ?count) WHERE {
?class wdt:P31 ?metaclass ;
wdt:P279+ ?metaclass .
OPTIONAL { ?metaclass rdfs:label ?metaclassLabel . FILTER ( lang(?metaclassLabel)='en' ) }
} GROUP BY ?metaclass ?metaclassLabel ORDER BY DESC(?count) Peter F. Patel-Schneider (talk) 17:43, 23 November 2024 (UTC)Reply
Thank you, that is https://qlever.cs.uni-freiburg.de/wikidata/AVcvMG - it says 2ms for resolving and sending, it my browser it took longer to see a result, maybe longer than 2s. International Press Center (talk) 17:54, 23 November 2024 (UTC)Reply
The "+" removed https://qlever.cs.uni-freiburg.de/wikidata/zQM4BJ :
?metaclass ?metaclassLabel ?count
1 Q8054 protein 751,644
2 Q7187 gene 430,982
3 Q277338 pseudogene 49,392
4 Q427087 non-coding RNA 44,857
5 Q201448 transfer RNA 1,153
6 Q284416 small nucleolar RNA 560
7 Q898273 protein domain 473
8 Q502048 gasoline engine 162
9 Q6979593 national association football team 107
10 Q618779 award 106
11 Q163727 bachelor's degree 95
Suggest to fix these first. International Press Center (talk) 18:00, 23 November 2024 (UTC)Reply

A.D.

edit

Hi,

Again about AuthorDisambiguator.

Do you know why I can't process the results for R J Laureijs‎ ? Simon Villeneuve (talk) 13:52, 3 December 2024 (UTC)Reply

@Simon Villeneuve: That's weird - it means somehow the name is matching on a search but not when it tries to match on the author list that's returned. I'll have to dig a little deeper to get to the bottom of it. Have you seen any other examples like this? ArthurPSmith (talk) 18:21, 3 December 2024 (UTC)Reply
Ah, I think I see what it is. You have a hidden unicode character after the s in the name. Cut and paste the name from one of the Wikidata author name string values and it should work. ArthurPSmith (talk) 18:24, 3 December 2024 (UTC)Reply
I remember another one example like this, but I can't find it. Strange this hidden unicode. Thank you, now, I'll know that this case exist. Simon Villeneuve (talk) 20:30, 3 December 2024 (UTC)Reply
It would probably be helpful if there was some clear way to display such characters to warn people... I'll have to look into it. ArthurPSmith (talk) 21:29, 3 December 2024 (UTC)Reply

NUKAT URL

edit

Hi! I write you briefly because the last issue of the URL of NUKAT ID (P1207) (Property talk:P1207#Formatter URL) is still open: e.g. in Annibale Balocco (Q109498768) there are two NUKAT IDs, for the first https://wikidata-externalid-url.toolforge.org/?p=1207&url_prefix=http://nukat.edu.pl/aut/&id=n2006055186 correctly resolves into http://katalog.nukat.edu.pl/lib/authority?lccn=n%202006055186, whilst for the second https://wikidata-externalid-url.toolforge.org/?p=1207&url_prefix=http://nukat.edu.pl/aut/&id=nx2023522126 incorrectly resolves into the inexistent http://katalog.nukat.edu.pl/lib/authority?lccn=n%20x2023522126 (whilst http://katalog.nukat.edu.pl/lib/authority?lccn=nx2023522126 exists). Briefly, for IDs with nx prefix no space should be added. Would it be solvable? Thanks and happy holidays! --Epìdosis 12:05, 28 December 2024 (UTC)Reply

Hi - sorry I've been vacationing from work and Wikidata :) - will try to take a look this coming week, it sounds straightforward. ArthurPSmith (talk) 21:23, 4 January 2025 (UTC)Reply
@Epìdosis: Ok, this is fixed now! ArthurPSmith (talk) 01:15, 10 January 2025 (UTC)Reply

Thanks for your assistance with my bot

edit

Here's a fake award: 🏆 David!! (talk) 21:03, 6 February 2025 (UTC)Reply

authordisambiguator 2025

edit

It wouldn't be a good year if I didn't have something to say about Authorship Disambiguator. ;)

So, I saw that "Le Fèvre O." gave me 0 results, but "Le Fevre O." gave me 101 results. Simon Villeneuve (talk) 21:28, 30 March 2025 (UTC)Reply

@Simon Villeneuve: Hmm, I'm not seeing anything for either one right now, but I assume that's because you already handled them. I tried "A. Le Fèvre" instead just now, and that gave far more results than "A. Le Fevre" (and included all the unaccented ones). In general it looks like the translation of accented characters to plain ASCII does work. It's possible this was thrown off by you reversing the family and given names - did you try "O. Le Fèvre"? ArthurPSmith (talk) 12:55, 12 April 2025 (UTC)Reply
Well, I'm elsewhere now.
I'll give you another one that I'll not treat : De Laverny P. (77 results) and de Laverny P. (20 results). Simon Villeneuve (talk) 09:42, 13 April 2025 (UTC)Reply
Ok, I think I know why that would happen, I'll look into it! ArthurPSmith (talk) 23:44, 13 April 2025 (UTC)Reply

New pull request

edit

Hello Arthur! Sorry for the inconvenience, could you please take a look at this request and approve it if possible? Thank you in advance. Kirilloparma (talk) 01:05, 11 April 2025 (UTC)Reply

@Kirilloparma: sorry for the delay, I will try to get to this in the next day or two! ArthurPSmith (talk) 13:40, 11 April 2025 (UTC)Reply
No worries. Regards Kirilloparma (talk) 02:32, 12 April 2025 (UTC)Reply
Ok, all done! I updated the formatter URL on the property but it will probably be a day or two before the Wikidata UI registers the change. ArthurPSmith (talk) 12:46, 12 April 2025 (UTC)Reply

Just a little suggestion

edit

When making new properties for external identifiers could you add property scope > as reference as well? Trade (talk) 03:34, 24 April 2025 (UTC)Reply

Oh, yes I guess they are used there a lot. Sure. ArthurPSmith (talk) 12:47, 24 April 2025 (UTC)Reply

Recent edit

edit

Thank you for your recent edit(s) to the Wikidata:Property proposal/TeamUSA.com athlete ID. Since I have very little familiarity with this site (I work much moreso on en.wiki), I had initially reached out to user Zyxw to get that initiated.

In a similar vein to the Team USA proposal above, it appears the Hellenic Olympic Committee's information has also changed. That is at Hellenic Olympic Committee athlete ID (archived) (P4489). It appears they've all changed to https://www.hoc.gr/en/athletes/XXX, for example Nikos Andriakopoulos.

Would it be possible for you to make that change also, or at least start that conversation as I'm not sure how to? Thanks! GauchoDude (talk) 11:16, 30 April 2025 (UTC)Reply

@GauchoDude: Probably best to have Zyxw create the proposal you are looking for. I'm one of the (limited number of) property creators here; this is just one of about 40 I've created in the last week, and I don't have any special knowledge on the topic itself. ArthurPSmith (talk) 13:10, 30 April 2025 (UTC)Reply