ArthurPSmith
See User talk:ArthurPSmith/Archive for older discussions.
Jewish Museum Berlin object ID
editThanks for fixing my typo at Jewish Museum Berlin object ID and not excoriating me for it! --RAN (talk) 21:35, 29 January 2021 (UTC)
I want my talk page to look like User talk:Belteshassar's talk page
editHi. I'm asking for your help(totally out of the blue) if my talk page can look like User talk:Belteshassar's talk page. LotsofTheories (talk) 21:03, 9 February 2021 (UTC)
- @LotsofTheories: I believe this is a user preference called "Structured Discussions" - see Special:Preferences under "Beta features", check the box "Structured Discussions on user talk". This page maybe helpful also. ArthurPSmith (talk) 22:03, 9 February 2021 (UTC)
L14698 and L184508
editHi! I just came across the two lexemes bow (L14698) and bow (L184508) I asked myself why these two lexemes were not combined into one lexeme. Because you are the creator of both lexemes, what your intention is behind them? --Gymnicus (talk) 22:07, 20 February 2021 (UTC)
- @Gymnicus: Because they are two distinct lexemes, pronounced differently even though they are spelled the same and have the same forms. See the differing senses on the two lexemes. ArthurPSmith (talk) 13:20, 22 February 2021 (UTC)
- In my opinion, the different senses are no justification for separate lexemes. As a counter-example to this thesis, I would cite the lexeme Bienenstich (L10226) In the German language, this lexeme describes both the sting of a bee and a sheet cake. These two senses are also completely different and yet it is a lexeme. There must be other differences. For example, by the lexemes Bank (L34723) and Bank (L34791) the reason for the separation is the different grammatical forms. In this case of bow (L14698) and bow (L184508) the different pronunciation you mentioned could be the reason for the separation into two lexemes. But then, from my point of view, the different pronunciation should also be specified. There are various properties that can be used for this in Wikidata. You can see an overview in the template Lexicographical properties. Perhaps you can incorporate at least one of the three properties into each of the two lexemes so that one can see the difference between them. --Gymnicus (talk) 14:44, 22 February 2021 (UTC)
- @Gymnicus: I think any English speaker would understand the difference by looking at the senses, that was my implication. Yes hopefully pronunciation properties will be added, but I've added thousands of lexemes without those statements, and working on other things right now, so it's unlikely I will get to it. ArthurPSmith (talk) 16:05, 22 February 2021 (UTC)
- One reference on this. ArthurPSmith (talk) 16:07, 22 February 2021 (UTC)
- or more humorously... ArthurPSmith (talk) 16:11, 22 February 2021 (UTC)
- “I think any English speaker would understand the difference by looking at the senses” - This assumption has already been refuted by me, as you have already noticed. I speak English, but I didn't know the difference between bow (L14698) and bow (L184508). That is why it could have been that I thoughtlessly put the two lexemes together because I saw no difference between them, except for the different meanings. But as already shown on the lexeme Bienenstich (L10226), this is no reason not to merge the two lexemes. I think it's a shame that you don't add the pronunciation. Then one must hope that the two lexemes are not carelessly put together by someone. Unfortunately, I cannot add the pronunciation either, because I have no idea about it. --Gymnicus (talk) 21:29, 22 February 2021 (UTC)
- In my opinion, the different senses are no justification for separate lexemes. As a counter-example to this thesis, I would cite the lexeme Bienenstich (L10226) In the German language, this lexeme describes both the sting of a bee and a sheet cake. These two senses are also completely different and yet it is a lexeme. There must be other differences. For example, by the lexemes Bank (L34723) and Bank (L34791) the reason for the separation is the different grammatical forms. In this case of bow (L14698) and bow (L184508) the different pronunciation you mentioned could be the reason for the separation into two lexemes. But then, from my point of view, the different pronunciation should also be specified. There are various properties that can be used for this in Wikidata. You can see an overview in the template Lexicographical properties. Perhaps you can incorporate at least one of the three properties into each of the two lexemes so that one can see the difference between them. --Gymnicus (talk) 14:44, 22 February 2021 (UTC)
Strange merge
editHi Arthur, you've merged de:Universidad Santa María (Ecuador) (founded 1996) with en:Federico Santa María Technical University (founded 1931). [1] --Kolja21 (talk) 16:23, 26 February 2021 (UTC)
- @Kolja21: It is the same institution - however the inception date from GRID would appear to be incorrect (based on the dewiki text). It seems to be a branch of the Chilean university, Federico Santa María Technical University (Q457793); I'll link them together. ArthurPSmith (talk) 16:31, 26 February 2021 (UTC)
- Oh, wait, now I don't understand your comment. I didn't merge the de entry with the en entry, the de entry is for the university in Ecuador, the en one is from Chile. ArthurPSmith (talk) 16:35, 26 February 2021 (UTC)
Apps
editShould Android and iOS versions of apps have seperate items? I cannot for the life of me find anything regarding the modeling of apps --Trade (talk) 00:04, 1 March 2021 (UTC)
- @Trade: No, I think a single item is generally the right thing for a single piece of software - see Wikidata:WikiProject Informatics/Software/Properties and related pages. ArthurPSmith (talk) 18:53, 1 March 2021 (UTC)
- @Trade: @ArthurPSmith: You should create separate item for iOS and Android otherwise it will be messed. Eurohunter (talk) 20:44, 15 March 2021 (UTC)
Messaggio
editHo ricevuto un messaggio tuo ma non ho capito se c'è un testo.... Ho fatto qualcosa?... – The preceding unsigned comment was added by Gabriele.badii (talk • contribs) at 18:41, April 16, 2021 (UTC).
- @Gabriele.badii: It's just a standard welcome template - I try to add it when I see somebody commenting on here who seems new. Nothing you did was a problem at all! ArthurPSmith (talk) 12:27, 19 April 2021 (UTC)
Thank you Gabriele.badii (talk) 12:34, 19 April 2021 (UTC)
- salam Yassine Oujda Maroc 105.71.132.91 23:10, 15 July 2023 (UTC)
Call for participation in the interview study with Wikidata editors
editDear ArthurPSmith,
I hope you are doing good,
I am Kholoud, a researcher at the King’s College London, and I work on a project as part of my PhD research that develops a personalized recommendation system to suggest Wikidata items for the editors based on their interests and preferences. I am collaborating on this project with Elena Simperl and Miaojing Shi.
I would love to talk with you to know about your current ways to choose the items you work on in Wikidata and understand the factors that might influence such a decision. Your cooperation will give us valuable insights into building a recommender system that can help improve your editing experience.
Participation is completely voluntary. You have the option to withdraw at any time. Your data will be processed under the terms of UK data protection law (including the UK General Data Protection Regulation (UK GDPR) and the Data Protection Act 2018). The information and data that you provide will remain confidential; it will only be stored on the password-protected computer of the researchers. We will use the results anonymized (?) to provide insights into the practices of the editors in item selection processes for editing and publish the results of the study to a research venue. If you decide to take part, we will ask you to sign a consent form, and you will be given a copy of this consent form to keep.
If you’re interested in participating and have 15-20 minutes to chat (I promise to keep the time!), please either contact me on kholoudsaa@gmail.com or use this form https://docs.google.com/forms/d/e/1FAIpQLSdmmFHaiB20nK14wrQJgfrA18PtmdagyeRib3xGtvzkdn3Lgw/viewform?usp=sf_link with your choice of the times that work for you.
I’ll follow up with you to figure out what method is the best way for us to connect.
Please contact me using the email mentioned above if you have any questions or require more information about this project.
Thank you for considering taking part in this research.
Regards
Kholoud
Apologies for the incorrect edit. The context is linking the particular form in Sphinx of black quartz, judge my vow! (L501732). I meant imperative (Q22716) but was confused; 'case' versus 'mood' should have tipped me off. Sorry again and thanks for catching it. Arlo Barnes (talk) 18:12, 11 June 2021 (UTC)
- No problem - English is in some ways a much simpler language than some others... ArthurPSmith (talk) 18:21, 11 June 2021 (UTC)
Property proposal: Tax Identification Number (Belarus)
editHi Arthur,
Could you take a look at Wikidata:Property proposal/Tax identification number when you have a chance? Aestrum (talk) 17:23, 3 August 2021 (UTC)
- Thank you! Aestrum (talk) 08:47, 4 August 2021 (UTC)
Reverting edits by Kinvidia
editIs there a chance you could revert all changes to existing items yesterday by Kinvidia (talk • contribs • logs)? See my comments on his user page. --Hjart (talk) 06:46, 27 August 2021 (UTC)
- @Hjart: (1) I'm not an administrator so I don't think I can do that easily, and (2) there are thousands of changes from that user dated August 26 (almost all Quickstatements), many of them creating new items, and the ones I checked look ok to me. However, the user is not responding to comments on their talk page so maybe something should be brought up on the Administrators Noticeboard about this? ArthurPSmith (talk) 12:32, 27 August 2021 (UTC)
- I posted a note on the noticeboard. We'll see what comes of it... ArthurPSmith (talk) 12:41, 27 August 2021 (UTC)
music created for
editHi, thanks for marking "created for" ready for creation. Any chance that the related proposal music created for could also be ready? There's only been one objection and many supports, and I had hoped I had answered the objections. UWashPrincipalCataloger (talk) 18:59, 1 September 2021 (UTC)
Author Disambiguator stalled?
editHi Arthur, I'm not being impatient, but I noticed that the last few batches haven't been executing. There even seem to be some that are still at READY, but are no longer queued [2]. Maybe you could take a look? Thanks! --Azertus (talk) 17:38, 9 September 2021 (UTC)
- @Azertus: Thanks for letting me know! The background job runner had definitely crashed. I restarted the server - however it won't automatically restart those queued batches, you need to manually restart at least one of them (or submit a new batch) and then I think it will run the others; it doesn't save OAuth credentials in a way that it can use them after restart. ArthurPSmith (talk) 20:35, 9 September 2021 (UTC)
- @Alatalojm, GerardM, BugWarp, Daniel Mietchen, Infovarius: See above - sorry for the problem, but you need to check your batches on Author Disambiguator and restart at least one of them. Thanks! ArthurPSmith (talk) 20:46, 9 September 2021 (UTC)
Wikidata:WikiProject Energy - Are power outages relevant?
editDo you know if it would be relevant to add power outages to Wikidata? I know of a power outage where around 10 thousand people had a power outage for about an hour. Would this be relevant to Wikidata or should I consider making my own Wikibase instance where I can add such information? Only asking for advice since I find Wikidata scary.(the politics of Wikidata, not the database software itself) Oduci (talk) 14:03, 15 September 2021 (UTC)
- @Oduci: It may depend on the size of the dataset you are planning, and also whether the outages are notable in the sense that they are documented by some third party source (a newspaper article for instance). We already have things like 2009 Brazil and Paraguay blackout (Q32375), 2006 European blackout (Q1324840), and 2006 Queens blackout (Q4606840) - along with many more instances of power outage (Q828827). If you plan to add at most a few thousand such things here it's probably fine. If you have millions of them, then a separate wikibase would likely be a better choice, just for capacity reasons. ArthurPSmith (talk) 20:15, 15 September 2021 (UTC)
dissertation program ready but not appearing on the list of proposals ready for creation
editHi, thanks for marking the proposal for dissertation program ready for creation. But do you know why it is not appearing on the list of proposals that are ready for creation? UWashPrincipalCataloger (talk) 16:55, 16 September 2021 (UTC)
- @UWashPrincipalCataloger: It seems to be there now - maybe there's some delay? Not sure how that works. ArthurPSmith (talk) 17:57, 16 September 2021 (UTC)
Or maybe I am just going blind! Anyway, thanks again! UWashPrincipalCataloger (talk) 19:27, 16 September 2021 (UTC)
Wikidata:Property proposal/Smotrim.ru film ID, Wikidata:Property proposal/Channel One Russia show ID
editGood evening. Could you summarize my suggested properties, which have been ready for a few weeks now? MasterRus21thCentury (talk) 20:54, 11 December 2021 (UTC)
Thank you for your recent merges
editI guess I created some items based on redirecting DOIs. Thanks for merging them. I was aware that was happening, but I thought I repaired them. I'm not surprised I missed some, but I regret missing so many. Trilotat (talk) 01:45, 14 December 2021 (UTC)
- @Trilotat: I don't think you were the worst offender! I've been going through the list at User:Ivan A. Krestinin/To merge/Scholarly articles and done what I can so far; waiting on the next update to that list to see if there's a lot still left. Unfortunately there are also many non-duplicates in that list due to some journals listing several distinct articles under the same DOI, so just merging based on DOI isn't sufficiently reliable. ArthurPSmith (talk) 16:55, 14 December 2021 (UTC)
Hi Arthur,
Any chance you could take a look again at Wikidata:Property proposal/original catalog description? Other than Jura, there seems to be mostly support, and no additional comments in almost a month now. It would be great to be able to add these statements soon. Thanks! Dominic (talk) 15:39, 25 January 2022 (UTC)
- @Dominic: I am really not familiar with Commons or SDC at all; I'd prefer that a property creator more familiar with the situation do this. Check out the list on Wikidata:Property creators and see if you recognize anybody who might be more involved with Commons. ArthurPSmith (talk) 16:14, 25 January 2022 (UTC)
- Sure, no worries! I just saw you'd commented on it earlier. Thanks! Dominic (talk) 16:16, 25 January 2022 (UTC)
Hello. Would you help with this property? MasterRus21thCentury (talk) 15:43, 16 April 2022 (UTC)
- @MasterRus21thCentury: It looks ok to me? I guess somebody else helped? ArthurPSmith (talk) 13:32, 18 April 2022 (UTC)
Querying cross-wiki links via wikidata
editHello Arthur, I hope this is an appropriate and non-burdensome question. I saw you named on the [Wikiproject] and that you have developed many wikidata tools. In the way of due diligence, I have browsed resources like [[3]] and [[4]] (and I am a comfortable programmer and API-figure-outer) but I cannot find an answer to my question.
In service of an academic research project, I must compile all Wikipedia namepsace articles in all language editions that have 10 or more cross-wiki links. From each I will need it's language, title, link, article ID, and creation date. I am not asking you to build any queries for me, but I would be very appreciate if you could point me to the tool, link, or even jargon that I should be using to collect this dataset. Enfascination (talk) 00:54, 26 June 2022 (UTC)
- @Enfascination: Wikipedia articles use wikidata for cross-wiki links, but they may also use internal link text which would not be known to wikidata. But assuming you are ok with just the links provided by wikidata, then all you should need to do for this is to find all wikidata items that have 10 or more sitelinks. If you go to the query service (https://query.wikidata.org/) and click on "Examples" you can find several examples that count and rank items by number of sitelinks, for example try the "Authors, writers and poets ranked by sitelink" query. Let me know if you need further help on this. ArthurPSmith (talk) 13:04, 27 June 2022 (UTC)
- @ArthurPSmith this is tremendously helpful, thank you! That's a surprise that the sitelink system is still hybrid, with cross-wiki links stored either in wikidata or wikitext. I'm OK missing some links as long as I feel like I'm getting perhaps 90%, or all but the most recent. Where could I learn more about how these two systems coexist? If I can root around more I'll know what mental model to rely on (whether they are coequal representations, or one is trying (however imperfectly) to be the comprehensive store, etc). Knowing absolutely nothing, I guess my prior mental model would be that editors tend to add sitelinks as wikitext and those get detected, transferred to wikidata, and then removed by bots. Do you know who could tell me how far that is from the truth? Enfascination (talk) 20:58, 27 June 2022 (UTC)
- So the wikitext versions predate Wikidata, and when Wikidata got started (around 2013) there was an effort to import the sitelinks from wikitext, replacing it, so the vast majority of interlinking at that time was from Wikidata, not from the text. Since then the natural way to add links to other language versions goes through Wikidata, so I think it is still that way. But the wikitext option remains as a workaround for cases that don't work for some reason (for example only one sitelink is allowed per language in Wikidata, but some languages have multiple articles on almost the same topic, so only one of them can be linked via Wikidata and others would need direct links). Help pages like Help:Sitelinks may give you a bit more background on this, though I don't know if there's a general place it's all recorded. Wikidata:Project Chat is also of course a good place to ask general questions. ArthurPSmith (talk) 21:20, 27 June 2022 (UTC)
- Ah, I'll use the project chat for further questions. Thank you again! Enfascination (talk) 02:53, 29 June 2022 (UTC)
- So the wikitext versions predate Wikidata, and when Wikidata got started (around 2013) there was an effort to import the sitelinks from wikitext, replacing it, so the vast majority of interlinking at that time was from Wikidata, not from the text. Since then the natural way to add links to other language versions goes through Wikidata, so I think it is still that way. But the wikitext option remains as a workaround for cases that don't work for some reason (for example only one sitelink is allowed per language in Wikidata, but some languages have multiple articles on almost the same topic, so only one of them can be linked via Wikidata and others would need direct links). Help pages like Help:Sitelinks may give you a bit more background on this, though I don't know if there's a general place it's all recorded. Wikidata:Project Chat is also of course a good place to ask general questions. ArthurPSmith (talk) 21:20, 27 June 2022 (UTC)
- @ArthurPSmith this is tremendously helpful, thank you! That's a surprise that the sitelink system is still hybrid, with cross-wiki links stored either in wikidata or wikitext. I'm OK missing some links as long as I feel like I'm getting perhaps 90%, or all but the most recent. Where could I learn more about how these two systems coexist? If I can root around more I'll know what mental model to rely on (whether they are coequal representations, or one is trying (however imperfectly) to be the comprehensive store, etc). Knowing absolutely nothing, I guess my prior mental model would be that editors tend to add sitelinks as wikitext and those get detected, transferred to wikidata, and then removed by bots. Do you know who could tell me how far that is from the truth? Enfascination (talk) 20:58, 27 June 2022 (UTC)
Survey of Scottish Witchcraft
editHi Arthur, sorry for message out of the blue but I recall seeing your talk on using OpenRefine at WikidataCon and wondered whether you, or someone you knew, would be able to advise/train my student intern Maggie Lin to work with OpenRefine in terms of aiding her work taking very rich historical data from the Ms Access database for the Survey of Scottish Witchcraft and modelling it correctly/reconciling it to wikidata. There is a great deal we can do to improve/enhance the data already added and a great many insights and visualisations we can glean once the data is linked data format. Really not intending to ask for too much of your time or effort beyond making sure Maggie is cognisant of how to work with OpenRefine to get the best out of it. Any thoughts do let me know. Best, Stinglehammer (talk) 21:06, 27 June 2022 (UTC)
- Hi Stinglehammer, glad to hear from you, and I've enjoyed hearing about your project. I do still use OpenRefine regularly but I think I'm kind of behind-the-times on the best ways to use it with Wikidata; it's evolved a lot since I first got familiar with it and I'm a bit stuck on my initial use cases. If Maggie hasn't already done this, I do strongly recommend the tutorial videos at https://openrefine.org - also the documentation there generally is quite good. But there are parts I'm not very familiar with - particularly creating new Wikidata items, which is covered here for example: https://docs.openrefine.org/manual/wikibase/new-entities . It might be best to reach out the openrefine community on their mailing list to find somebody more up-to-date on how to use it? ArthurPSmith (talk) 13:41, 28 June 2022 (UTC)
Merging of unordered author and ordered author
editYou'd better make new tools for merging ordered author and unordered author in Author Disambiguator. For Jelle Kaastra (Q58250692), we should merge more than 100 articles like Anatomy of the AGN in NGC 5548 (Q59973269). Sharouser (talk) 15:56, 7 July 2022 (UTC)
- @Sharouser: Hi - there's already some support for this - if you go to the work page and click "Auto-match unordered authors?" at the bottom it will suggest a number to set for the unordered author. But then there's another step to actually merge the two entries. If you have a suggestion for a better workflow for this please let me know! ArthurPSmith (talk) 16:03, 7 July 2022 (UTC)
- @ArthurPSmith: I want to merge all unordered authorlink in author (P50) and all ordered authorlink in author (P50) at one time. This way is more convenient and fast. You'd better upgrade author_item_oauth. Sharouser (talk) 16:15, 7 July 2022 (UTC)
- Ah, I see what you're saying. I'll check into it. ArthurPSmith (talk) 16:29, 7 July 2022 (UTC)
- OrcBot has a lot of edits adding authors without removing the author name string. If you find an article where this has happened, you’re very likely able to find that author appears in other articles with their presence as an author AND an author name string. I’m on board to help with a more effective cleanup of this bot’s good faith but certainly incomplete work. Trilotat (talk) 14:37, 8 July 2022 (UTC)
- Ah, I see what you're saying. I'll check into it. ArthurPSmith (talk) 16:29, 7 July 2022 (UTC)
- @ArthurPSmith: I want to merge all unordered authorlink in author (P50) and all ordered authorlink in author (P50) at one time. This way is more convenient and fast. You'd better upgrade author_item_oauth. Sharouser (talk) 16:15, 7 July 2022 (UTC)
Hey, Wikidata:Property proposal/title match pattern has been marked ready for two weeks ago. Is there something I can do to get this created? Do you have any more questions? – Shisma (talk) 06:30, 28 July 2022 (UTC)
- @Shisma: Could you respond to the comment from Dhx1 there about the label? That may seem an unresolved issue for property creators looking at this. ArthurPSmith (talk) 13:44, 28 July 2022 (UTC)
- i agreed with Dhx1 from the start. I just through the label can be changed at any time 😂. – Shisma (talk) 15:24, 28 July 2022 (UTC)
- another month has passed. Can I do something? Shisma (talk) 16:18, 26 August 2022 (UTC)
- @Shisma: I created it but it's bare-bones - can you add the examples and constraints? ArthurPSmith (talk) 16:57, 26 August 2022 (UTC)
- Sure — Shisma (talk) 19:14, 26 August 2022 (UTC)
- Could you please review my changes at web page title extract pattern (P10999). Thanks – Shisma (talk) 19:56, 26 August 2022 (UTC)
- Looks good. I fixed a couple of minor issues. I don't think the formatter URL will actually do anything (at least not with the regular Wikidata UI - it's applied only on external id's, not strings). Otherwise it seems fine. ArthurPSmith (talk) 20:05, 26 August 2022 (UTC)
- Could you please review my changes at web page title extract pattern (P10999). Thanks – Shisma (talk) 19:56, 26 August 2022 (UTC)
- Sure — Shisma (talk) 19:14, 26 August 2022 (UTC)
- @Shisma: I created it but it's bare-bones - can you add the examples and constraints? ArthurPSmith (talk) 16:57, 26 August 2022 (UTC)
off talkpage logging?
editAs you know, I am working on the isotopes. To be open to those interested, I am logging at talk:WP Chemistry#Charting the isotopes. But this could be too tiresome, and ineffective, for those following that Project page. Is there an option to do the logging elsewhere, while still posting the main issues (eg modeling proposals) at WDtalk:WP Chemistry? Could be a dedicated projecttalk subpage, or a page in my userspace. Ideas? -DePiep (talk) 08:17, 2 August 2022 (UTC)
- @DePiep: Either a dedicated project subpage or a userspace page would be fine - userspace probably better if you expect to be the only one editing it, project subpage if you're inviting or encouraging others to work on it too. ArthurPSmith (talk) 13:20, 2 August 2022 (UTC)
GRID ID
editMy mistake Palapa (talk) 21:01, 2 August 2022 (UTC)
Hello! This Wikidata property proposal page is currently in a difficult situation - the pages are no longer displayed, and there has been no consensus on 11 properties for 1.5 years. Could you solve this problem? MasterRus21thCentury (talk) 13:59, 4 August 2022 (UTC)
- @MasterRus21thCentury: I am not personally knowledgeable about sports at all so I haven't worked on those and don't really think I should. Also I'm not sure what you mean by "the pages are no longer displayed" - when I follow the link to the Sports property proposal page, all the proposals seem to be listed? ArthurPSmith (talk) 18:18, 4 August 2022 (UTC)
- I have a specific list about this and something needs to be done with it.
- Need to create:
- Demand immediate summing up due to lack of consensus since January 2021:
- Wikidata:Property proposal/two-pointers made
- Wikidata:Property proposal/two-pointers attempted
- Wikidata:Property proposal/three-pointers made
- Wikidata:Property proposal/three-pointers attempted
- Wikidata:Property proposal/free throws made
- Wikidata:Property proposal/free throws attempted
- Wikidata:Property proposal/field goals made
- Wikidata:Property proposal/personal fouls
- Wikidata:Property proposal/offensive rebounds
- Wikidata:Property proposal/total rebounds
- Need approval:
- Wikidata:Property proposal/National Olympic Committee of the Republic of Kazakhstan ID
- Wikidata:Property proposal/National Olympic Committee of Azerbaijan ID
- Wikidata:Property proposal/Belgian Olympic Committee ID
- Wikidata:Property proposal/Olympic Federation of Ireland ID
- Wikidata:Property proposal/Russian Football Union player ID
- Wikidata:Property proposal/All-Russian Sambo Federation ID
- Wikidata:Property proposal/Serbian Olympic Committee athlete ID (New)
- Wikidata:Property proposal/Singapore National Olympic Council athlete ID
- Wikidata:Property proposal/NOCNSF athlete ID
- Wikidata:Property proposal/British Paralympic Association athlete ID
- Wikidata:Property proposal/Canadian Paralympic Committee athlete ID
- Wikidata:Property proposal/Paralympics Australia athlete ID
- Wikidata:Property proposal/Paralympics New Zealand athlete ID
- This is because I've increased activity on this thread lately and Wikidata members aren't upvoting like they've been in the past months. However, the queue may continue to grow, which will be very unpleasant for the community. MasterRus21thCentury (talk) 18:40, 4 August 2022 (UTC)
- @MasterRus21thCentury: I see you've also commented on Project Chat - that would be the best place to recruit some people to help with this. But you can go ahead and retire the "no consensus" ones yourself; see my comments on Project Chat for how to do that. ArthurPSmith (talk) 19:06, 4 August 2022 (UTC)
- @ArthurPSmith: So I can close threads that don't have consensus without having the property creator flag? MasterRus21thCentury (talk) 18:41, 5 August 2022 (UTC)
- Don't you have that flag? Technically it's certainly not required to have the flag to close proposals, but I guess it's better for people with the flag to do it to avoid controversy here. ArthurPSmith (talk) 19:51, 5 August 2022 (UTC)
- @ArthurPSmith Yes, I was stripped of my flag three months ago as a result of a discussion on the admin forum. I was advised to reapply in six months, that is, in November. However, we need to strengthen action on Wikidata properties. MasterRus21thCentury (talk) 08:07, 7 August 2022 (UTC)
- Don't you have that flag? Technically it's certainly not required to have the flag to close proposals, but I guess it's better for people with the flag to do it to avoid controversy here. ArthurPSmith (talk) 19:51, 5 August 2022 (UTC)
- @ArthurPSmith: So I can close threads that don't have consensus without having the property creator flag? MasterRus21thCentury (talk) 18:41, 5 August 2022 (UTC)
- @MasterRus21thCentury: I see you've also commented on Project Chat - that would be the best place to recruit some people to help with this. But you can go ahead and retire the "no consensus" ones yourself; see my comments on Project Chat for how to do that. ArthurPSmith (talk) 19:06, 4 August 2022 (UTC)
Properties ready
editHello! Would you be able to queue these properties for creation?
- Wikidata:Property proposal/Real Time IDs
- Wikidata:Property proposal/Knowledge portal ID
- Wikidata:Property proposal/Belarus in persons and events ID
- Wikidata:Property proposal/Yarus feed ID
- Wikidata:Property proposal/YAPPY profile ID
- Wikidata:Property proposal/CPRF person ID
- Wikidata:Property proposal/100.histrf.ru ID
- Wikidata:Property proposal/Business Online ID
- Wikidata:Property proposal/Odnoklassniki artist ID
- Wikidata:Property proposal/Odnoklassniki album ID
- Wikidata:Property proposal/Stihi.ru author ID
- Wikidata:Property proposal/Mariinsky Theatre person ID
- Wikidata:Property proposal/Culture.ru institutes ID
- Wikidata:Property proposal/Moscow Cultural Heritage ID
- Wikidata:Property proposal/Shooting Union of Russia person ID
- Wikidata:Property proposal/Russian Trampoline Federation ID
- Wikidata:Property proposal/Freestyle Federation of Russia ID
- Wikidata:Property proposal/Federation of Ski-Jumping and Nordic Combined of Russia ID
- Wikidata:Property proposal/Football 24 article ID
- Wikidata:Property proposal/Match TV people ID
- Wikidata:Property proposal/National Olympic Committee of the Republic of Kazakhstan ID
- Wikidata:Property proposal/AFC player ID
- Wikidata:Property proposal/izsambo.ru person ID
- Wikidata:Property proposal/Rugby Union of Russia athlete ID
- Wikidata:Property proposal/wrestdag.ru person ID
- Wikidata:Property proposal/Climbing Federation of Russia athlete ID
- Wikidata:Property proposal/Turkish Paralympic Committee athlete ID
- Wikidata:Property proposal/European Hockey Federation ID
- Wikidata:Property proposal/Hockey New Zealand ID
- Wikidata:Property proposal/South African Hockey Association ID
- Wikidata:Property proposal/Hockey Australia ID
- Wikidata:Property proposal/Field Hockey Canada ID
- Wikidata:Property proposal/Moroccan Olympic Committee ID
- Wikidata:Property proposal/UCI rider ID
- Wikidata:Property proposal/Polish Paralympic Committee ID
- Wikidata:Property proposal/Belgian Paralympic Committee ID
- Wikidata:Property proposal/Planeta Belarus sigh ID
MasterRus21thCentury (talk) 10:09, 22 August 2022 (UTC)
- @MasterRus21thCentury: I took a look and ok'd some. However, as I've mentioned before, I have no expertise regarding sports, which is what most of these are, and (1) I look for at least one other supporting person (aside from me and proposer) when setting to "ready" status, and many of these seem to have no other supporters at all, and (2) there have been some complaints about non-English descriptions - I don't personally have a strong feeling on that and I guess I could add an English translation of your Russian descriptions if I had the time, but I don't right now; if you can add English descriptions where missing then setting them to "ready" would be straightforward after that. ArthurPSmith (talk) 14:07, 22 August 2022 (UTC)
- @ArthurPSmith: Currently, I have put English descriptions everywhere. Therefore, you can continue your work on setting properties for creation. MasterRus21thCentury (talk) 17:17, 22 August 2022 (UTC)
Cancel a batch in authordisambiguator
editHi,
This batch is full of mistakes : https://author-disambiguator.toolforge.org/batches_oauth.php?id=f802b40f
Do it is possible to revert an entire batch ? Simon Villeneuve (talk) 23:38, 31 August 2022 (UTC)
- @Simon Villeneuve: Hi - I happened to catch it as it was happening, and I believe I've fixed all the problems (I've also been working on ATLAS author lists). Maybe you can check to confirm it looks ok now? In general though yes it is possible to revert a batch by going to the "details" link in the edit summary for one of the edits - that takes you to the "edit groups" app, which then has an option to "Undo entire group". ArthurPSmith (talk) 15:16, 1 September 2022 (UTC)
- Ok, thank you. It seems ok now.
How the hell did you do that ? Simon Villeneuve (talk) 15:44, 1 September 2022 (UTC)- @Simon Villeneuve: Good question! I've been playing with the "lists" feature - I think you should be able to see https://author-disambiguator.toolforge.org/author_lists.php?list_id=55 ? Then for an individual work I can try to match the authors with that list. However, it's a bit time-consuming as there are lots of cases of duplicate or ambiguous name strings (for ATLAS in 2017 I have 41 duplicates to be checked - still 41 out of 2900 or so is pretty good). So it's coming from the angle of replacing lots of author name strings on a single manuscript, rather than doing it on lots of manuscripts for a single author. I think this way will end up being somewhat faster in the end, but it's still not super fast. ArthurPSmith (talk) 15:59, 1 September 2022 (UTC)
- Another big advantage of this is to have considerably less historic versions for each article element.
I'll let you work on ATLAS authors and focus on CMS. Let me know if you want to work on these too. Simon Villeneuve (talk) 17:47, 1 September 2022 (UTC)- I haven't attempted CMS yet. I don't know if it'll be easier or harder - CMS groups the author list by affiliation, so hopefully it's clearer when you have duplicate name strings... Anyway, no plans to do CMS for a long time yet! Thanks for all you've done already, it's been a big help for ATLAS! ArthurPSmith (talk) 17:51, 1 September 2022 (UTC)
- Another big advantage of this is to have considerably less historic versions for each article element.
- @Simon Villeneuve: Good question! I've been playing with the "lists" feature - I think you should be able to see https://author-disambiguator.toolforge.org/author_lists.php?list_id=55 ? Then for an individual work I can try to match the authors with that list. However, it's a bit time-consuming as there are lots of cases of duplicate or ambiguous name strings (for ATLAS in 2017 I have 41 duplicates to be checked - still 41 out of 2900 or so is pretty good). So it's coming from the angle of replacing lots of author name strings on a single manuscript, rather than doing it on lots of manuscripts for a single author. I think this way will end up being somewhat faster in the end, but it's still not super fast. ArthurPSmith (talk) 15:59, 1 September 2022 (UTC)
- Ok, thank you. It seems ok now.
The English noun "in"
editThis question probably belongs in the Lexicographic project, but as I'm still a bit uncertain what I'm looking at, and I find your username in the edit histories, I'd like to begin by asking you what in (L322367) is all about. The English word "in" as a noun? How come? It was created in October of 2020, but you created the preposition in (L2987) yourself already in 2018.
I have been looking at the adpositions and tried to figure out how to define their senses; please take a look at User:SM5POR/Languages for a view of my approach to the problem. The other day I stumbled upon Q90219924, the practical purpose of which I don't understand, as it appears to represent the spatial relation of "in/within" and "into" (it's mapped to both senses) for English as well as Russian.
According to the Concise Oxford Dictionary mentioned in my references, the word "in" exists as a preposition with around 23 different senses, as an adverb with 14, and as an adjective with three. But a noun? As English is not my native language, I hope you can explain this to me. SM5POR (talk) 08:33, 26 September 2022 (UTC)
- @SM5POR: Huh, an effect of several people using bots or scripts without looking carefully at what they are doing. Nevertheless, the reference to "WordData" allowed me to trace the source, which is the use of "in" as an abbreviation for the length unit "inch". It does appear that way (singular and plural) in written technical language, so I guess it's legitimate. But maybe there's a better way to represent this. I replaced the incorrect sense with a corrected one. ArthurPSmith (talk) 15:45, 26 September 2022 (UTC)
- Thanks, that cleared things up for me. To further clarify the sense, I added an item for this sense (P5137) link to inch (Q218593), just like the one for inch (L10963). Your solution looks fine to me.
- What may be missing is a direct link between the two lexemes, indicating that one is an abbreviation of the other. For Main namespace items there is the property unit symbol (P5061) which spells out an abbreviation of the subject unit, but as its data type is a monolingual string, it can't be reused for linking to a lexeme sense. Maybe a sense-to-sense property could be defined to serve this purpose?
- I actually dislike the proliferation of language-specific properties in the Main namespace, as it makes the item display difficult to read through massive use of the language of work or name (P407) qualifier, and I would like to see all those claims transferred to the lexeme namespace instead.
- I don't use robots myself, precisely because I don't want to risk messing up thousands of items without seeing how they are actually used. I even think there is something fundamentally wrong with your database design approach if it's versatile enough to allow representing any kind of human knowledge, yet uniform enough that you can make robots perform millions of edits automatically. SM5POR (talk) 07:16, 27 September 2022 (UTC)
- @SM5POR: Thanks for the edits and ideas. Yes I don't use bots very much and largely share your philosophy there (most of my lexeme edits have been one by one). However, I do see their usefulness when importing or correlating info in Wikidata with external databases, I don't think that's a database design issue in that case. ArthurPSmith (talk) 12:52, 27 September 2022 (UTC)
- I must admit I have a tendency to generalize just to make a point, so please let me qualify my statement about robots a bit. Robots are ok to do really trivial things, such as searching the database for instances of a particular error, or even fixing some of them in items when all their properties are defined according to the expectations of the robot maintainer. My issue is with an overall design that in effect depends on systematic use of robots for database development and regular maintenance.
- You may remember the discussion on time zone representation two years ago, in which both of us took part? I wasn't convinced that adding individual located in time zone (P421) claims for millions of localities when there are only some 30 time zones in the world, most of them geographically contiguous, is a sensible solution even in the short-term perspective.
- As I understood it, current infobox implementations are ill-prepared to apply the heuristics to make use of property inheritance and transitivity. But they shouldn't! They should rely on standardized property retrieval library routines, rather than encode the rules of transitivity in each and every infobox. This is the design approach I'm concerned about.
- The aforementioned discussion therefore inspired me to start coding in Lua, to see if I could somehow help improving on the situation. I haven't written much code yet, not even a working prototype, but you may want to take a look at my initial attempts at documentation to see what I'm aiming at.
- Have I overlooked something, do you think (besides embarking on a project that will probably not be completed in my lifetime)? SM5POR (talk) 07:36, 28 September 2022 (UTC)
- Good luck! I haven't ventured into Lua programming myself but I can see the need. Maybe wikifunctions will be a new way to do this (better)? ArthurPSmith (talk) 21:13, 28 September 2022 (UTC)
- @SM5POR: Thanks for the edits and ideas. Yes I don't use bots very much and largely share your philosophy there (most of my lexeme edits have been one by one). However, I do see their usefulness when importing or correlating info in Wikidata with external databases, I don't think that's a database design issue in that case. ArthurPSmith (talk) 12:52, 27 September 2022 (UTC)
Kvant-Telecom
editDear Arthur,
Quanttelecom LLC (Q115769956) is a very generic business name. https://kvant-telecom.ru/ seems to be much larger (AS43727) and bears the same name.
Kind regards Perkerdansk (talk) 11:58, 20 December 2022 (UTC)
- @Perkerdansk: Hmm, I can see the confusion. I updated the main label with the one from their English web page here; this does seem to be the name of this company. We don't seem to have an item for https://kvant-telecom.ru/ so it probably should be added also. ArthurPSmith (talk) 13:48, 20 December 2022 (UTC)
Wikidata external URL redirector - support for Statistics Canada Geographic code (P3012)
editIs it possible that you could add P3012 support in your URL redirector? I am trying to expand its functionality but am running into limitations in making this work. I have detailed how I would like to see linking from P3012 values implemented here. -- Denelson83 (talk) 00:28, 6 January 2023 (UTC)
- @Denelson83: I could set up something based on the id length, but I don't see how to include language code as that's not a property of the ID but of the user. I'm not sure what you have in mind here. ArthurPSmith (talk) 15:00, 6 January 2023 (UTC)
- At this point, we will have to just default to the English version. And the ID length should suffice as a differentiator in this case. In this table, the ID is bolded:
-- Denelson83 (talk) 20:18, 6 January 2023 (UTC)
- @Denelson83: Ok, this is implemented and I updated the formatter URL on P3012 to use this. ArthurPSmith (talk) 17:18, 9 January 2023 (UTC)
- Good stuff. Thank you. -- Denelson83 (talk) 17:22, 9 January 2023 (UTC)
- @Denelson83: Ok, this is implemented and I updated the formatter URL on P3012 to use this. ArthurPSmith (talk) 17:18, 9 January 2023 (UTC)
Maud Olofsson "bad merge"
editHello, you reverted my merge of Maud Olofsson (Q253640) and Maud Olofsson (Q100348520). I spent quite some time trying to find any indications that A roadmap for the implementation of mHealth innovations for image-based diagnostic support in clinical and public-health settings: a focus on front-line health workers and health-system organizations. (Q38368576) was written by any other than the former Swedish deputy prime minister, Maud Olofsson (Q253640). The article is the outcome of a round table discussion that took place during an event organized at STIAS with people from Sweden and South Africa. STIAS have on their own websites documented the former politician's participation workshops and roundtables during the 2010s. The booklet for the roundtable discussion that is described in the article, lists Maud Olofsson as "Wallenberg Foundation Ambassador" with an e-mail address belonging to a company (Romo Norr) where the former politician is a board member. An article published in one of Swedens largest newspapers three days before the event took place mentions both that the company in question is the former politician's own and that she and her husband will land in South Africa the same day to participate in a round table discussion in Stellenbosch. I find it highly unlikely that a namesake of the politician would be listed as an author of an article that is the outcome of the very same roundtable the politician were part of.
So I would say there is enough sources to disprove that my merge was bad in any way. (One might also argue that giving stating occupation (P106) as someone in their role as researcher (Q1650915) might be a bit of a stretch as well.)
- [5]Former Swedish Deputy Prime Minister Maud Olofsson visits STIAS
- [6]Investment and technology choices in the SA power sector: economic, social and environmental trade-offs
Moonhouse (talk) 17:57, 23 January 2023 (UTC)
- @Moonhouse: Ok, thanks for looking into this - but the paper does list her affiliation as Stellenbosch (I added that as a reference after unmerging). If you can add your sources appropriately on the item or perhaps on the talk page for Maud Olofsson (Q253640) that would probably be helpful. But if you are sure yes go ahead and re-merge them then. ArthurPSmith (talk) 21:26, 23 January 2023 (UTC)
- I don't see how the given affiliation directly contradicts that the two objects should be merged since the identity and the affiliation are two different questions. Whether or not she was affiliated directly with Stellenbosch (as per the article) or in the capacity of an ambassador to/representative of Marianne and Marcus Wallenberg Foundation (as the conference booklet as well as presentation of core organising committee state) I would say it is safe to assume that the two objects describe the same physical person. The aforementioned booklet and presentation of the core organising committee all include photos of Maud Olofsson (Q253640) and all the other authors of the article.
- When it comes to the affiliation I can't say it is as clear-cut. She is either described as affiliated with both Stellenbosch and Wallenberg Foundation, as the article indicates, or just tasked by the latter to assist the former (as described in e.g. "The STIAS concept of a Roundtable").
- So I would suggest that I merge the two objects and then add affiliation (P1416) (since I can find no proof in the article or elsewhere that she was in fact employed by the university) set to Stellenbosch Institute for Advanced Study (Q55829926) (with, at least, the article as a reference) and also add Marianne and Marcus Wallenberg Foundation (Q10577257) as an affiliation (supported by other documents as reference). -- Moonhouse (talk) 23:16, 23 January 2023 (UTC)
- @Moonhouse: Ok, yes, that sounds great, please go ahead with that then. Thanks. ArthurPSmith (talk) 14:14, 24 January 2023 (UTC)
- @Moonhouse: Ok, thanks for looking into this - but the paper does list her affiliation as Stellenbosch (I added that as a reference after unmerging). If you can add your sources appropriately on the item or perhaps on the talk page for Maud Olofsson (Q253640) that would probably be helpful. But if you are sure yes go ahead and re-merge them then. ArthurPSmith (talk) 21:26, 23 January 2023 (UTC)
Property proposal
editHi there,
could'nt we restart Wikidata:Property proposal/Geneanet genealogist ID? Taking care of adding more examples, and pinging related projects. Thanks, 2A01:CB1D:8CC3:6500:2D8E:560:2681:4BAD 06:47, 2 June 2023 (UTC)
- Best to create a new proposal from scratch rather than reopen a very old one. That way it shows up at the top of lists for people who track those things. ArthurPSmith (talk) 19:05, 2 June 2023 (UTC)
P11439
editThank you for your (encouraging) comment! But what am I to do exactly? May I change the property formatter URL by myself...?
Best regards, 2A01:CB1D:8CC3:6500:2D8E:560:2681:4BAD 19:56, 2 June 2023 (UTC)
- Just make an exact copy of the old proposal template, add a "2" to the page title, and make a note of whatever other things you have changed such as more examples. ArthurPSmith (talk) 20:34, 2 June 2023 (UTC)
- But, is it really necessary to make a new proposait? It would be the same project, with just a more accurate formatter URL. (And moreover, everyone already seems to agree...) 92.184.112.49 05:12, 3 June 2023 (UTC)
- ? 92.184.112.216 10:53, 5 June 2023 (UTC)
- It's not necessary to create a new one, but it helps make it more noticeable to people who check property proposal lists. Up to you what to do. ArthurPSmith (talk) 13:25, 5 June 2023 (UTC)
- Sorry for my confusion, I had somehow thought you were the same anonymous user as the one just previous and had merged the two conversations. Yes of course you can just change the formatter URL. ArthurPSmith (talk) 14:14, 6 June 2023 (UTC)
- It's not necessary to create a new one, but it helps make it more noticeable to people who check property proposal lists. Up to you what to do. ArthurPSmith (talk) 13:25, 5 June 2023 (UTC)
- ? 92.184.112.216 10:53, 5 June 2023 (UTC)
- But, is it really necessary to make a new proposait? It would be the same project, with just a more accurate formatter URL. (And moreover, everyone already seems to agree...) 92.184.112.49 05:12, 3 June 2023 (UTC)
Hi there. Am I wrong, or it still doesn't work? 92.184.97.120 14:47, 14 June 2023 (UTC)
- Sorry I've been dealing with other things recently. You have to actually test the URL before you put it in there! I had assumed that was done - I've replaced the formatter URL with one that at least has the id set and the url prefix url-encoded but again you'll need to wait a bit for it to take effect. It's always best to test it out beforehand by copying the full URL you are proposing and replacing the $1 with a url-encoded string of the id you want to use. ArthurPSmith (talk) 23:39, 17 June 2023 (UTC)
Hi Arthur,
I've just wanted to begin using your great tool. Since the main option is overloaded I had to use OAuth. After completing the permission process at mediawiki.org I was brought back. There an error arises always:
Error retrieving token: mwoauthdatastore-request-token-not-found U. M. Owen (talk) 18:36, 13 June 2023 (UTC)
- @U. M. Owen: Can you use other tools that require mediawiki OAuth (for example Quickstatements)? If you start from here what do you see right now? ArthurPSmith (talk) 19:01, 13 June 2023 (UTC)
- Now it works perfectly well, albeit slow. A little bit of patience would have saved me from bothering you.
- Is the tool only finding candidates which have P106:Q1650915 ? After creation of Polly Carmichael (Q119457161) it still wasn't found as Potential author item (not a real issue for me).
- A first name-to-gender guesser would be great, maybe with an extra checkbox that users are responsible for correctness.--U. M. Owen (talk) 20:03, 13 June 2023 (UTC)
- The only restriction on candidates for matches, other than name matching, is P31:Q5 (or subclass of organization for corporate authors). However there may be some delay in matching new items since it relies on the query service and it can take several minutes for new things to show up there. On gender guessing - that's a bit out of scope for this; there are tools for that online which I think work best if you can also provide nationality or related info. ArthurPSmith (talk) 20:36, 13 June 2023 (UTC)
2nd question
editWhen searching for "Zhi-Ping Zhong" the results are getting overwhelmed by Zhi Zhong. How may I explicitely search for the former?--U. M. Owen (talk) 16:44, 15 June 2023 (UTC)
- @U. M. Owen: Check the box that says "Specify name strings", then remove "Zhi Zhong" and any others you don't want from the list. Another option is to use the "additional SPARQL filter" where you can specify exactly the string you want for the name by entering:
wdt:P2093 "Zhi-Ping Zhong"
- - hope that helps! ArthurPSmith (talk) 23:44, 17 June 2023 (UTC)
Patents
editWould it be possible to extend the author disambiguator to patents like Synthesis of speech from text in a voice of a target speaker using neural networks (Q118923495)?-- U. M. Owen (talk) 14:38, 16 June 2023 (UTC)
- It should work fine for those - the problem there is that the "author name strings" include affiliation data as well, so they're not just author names, and aren't matching. But author disambiguation can work with anything that has a P2093 property you want to replace with P50. ArthurPSmith (talk) 23:45, 17 June 2023 (UTC)
Property proposals
editHey! Is there anything that blocks a real creation of properties for Wikidata:Property proposal/Lithuanian protected object ID, Wikidata:Property proposal/Saugoma.lt area ID and Wikidata:Property proposal/Saugoma.lt object ID proposals? Looks like they got stuck in the "ready" status. —putnik 19:38, 26 June 2023 (UTC)
- @putnik: Lack of time from property creators. If I get a chance this week I'll take a look. ArthurPSmith (talk) 14:13, 27 June 2023 (UTC)
authordisambiguator
editHi,
I have some problem to use authordisambiguator since a couple of days. Am I the only one ? Simon Villeneuve (talk) 13:26, 13 September 2023 (UTC)
- @Simon Villeneuve: Huh, I noticed it timing out a bit the last few days but I haven't been doing much with it. I'll take a look. ArthurPSmith (talk) 18:50, 14 September 2023 (UTC)
- Seems ok now. Thank you.
Another problem : the "undo entire group" link here don't work. Simon Villeneuve (talk) 14:53, 19 September 2023 (UTC) - I also have difficulties to stop batchs. The link "stop batch" sometimes work, sometimes don't. Simon Villeneuve (talk) 15:08, 19 September 2023 (UTC)
- @Simon Villeneuve: Edit Groups should work, I've definitely used it in the past, but that's not something I maintain so I don't know what's going on there. And yes "stop batch" has issues now - it's because there are 4 separate containers where these run and it only works if the stop command goes to the same container that the job is actually running on. I should probably work on fixing that... ArthurPSmith (talk) 15:26, 19 September 2023 (UTC)
- Ok. Let me know if it come back on track.
I have another problem : the Canadian Encyclopedia ID only point to the English version of the site. The french id didn't redirect to the French page anymore. I've seen that you developped the https://wikidata-externalid-url.toolforge.org/ tool. Can you add to it a parameter who automatically select the P407 qualifier of an item id? I ask about this here : Property_talk:P1630#How_to_add_the_formatter_URL_if_two_different_language_versions_have_different_url?. Simon Villeneuve (talk) 13:07, 24 September 2023 (UTC)
- Ok. Let me know if it come back on track.
- @Simon Villeneuve: Edit Groups should work, I've definitely used it in the past, but that's not something I maintain so I don't know what's going on there. And yes "stop batch" has issues now - it's because there are 4 separate containers where these run and it only works if the stop command goes to the same container that the job is actually running on. I should probably work on fixing that... ArthurPSmith (talk) 15:26, 19 September 2023 (UTC)
- Seems ok now. Thank you.
Could you do me a favor? Please participate this discussion. ChoKukSuhoTemp (talk) 13:12, 1 October 2023 (UTC)
OpenRefine question
editHi Arthur, I just started using OpenRefine after watching your amazing demo from 2017. I have two quick questions which I hope will not require a too involved answer, i.e. waste too much of your time:
- Can I add rows of data to my dataset after importing it from a file? Or do I have to scrap my project and begin a new one?
- How is progress saved? I run this program from my laptop, and I hope the matches I have made will not be gone when I log off sometime tonight.
Best, Jonathan Groß (talk) 19:38, 7 October 2023 (UTC)
- @Jonathan Groß: I'm not sure if there's a way to add rows after importing - I am guessing there is but it's not something I've done. I just start lots of separate projects generally, each time I have a file I want to work on. You can save the changes you've made to the underlying data by using the "Extract" function on the Undo/Redo tab (on the left) and then load that into the next project if it's very similar data and you want to do the same things; I also typically export the matches I've made as I work on it in case I lose my progress (though this rarely happens). Do that from the export button on the top right. OpenRefine does save what you've done in the project so that when you stop it and restart it you will be able to continue from where you were; however it does not save the state of your facet/filter operations, so you would have to re-do those. Hope this makes sense! ArthurPSmith (talk) 02:41, 8 October 2023 (UTC)
Data modelling days 2023
editHi. Saw you had some cool ideas for talks. Will there be videos of the talks or only shown live?
Regarding references, it could be useful to mention that if two references contain exactly the same content, the same reference node will be reused. I suppose it was made this way to save space, but it only works if they are identical. It would have been nice if the Wikibase UI would allow to edit these linked, so that the reference is updated everywhere it is used, currently it doesn't do this.
It should also be possible to use "stated in" with a Q-item as a reference, but this only really suitable for primary references like research papers and books, it doesn't work so well for newspaper articles. Thanks to Mike Peel's work on CiteQ and importing tons of research paper items, it's also easy to reference these on Wikipedia now. Infrastruktur (talk) 15:29, 11 October 2023 (UTC)
- @Infrastruktur: When you say "the same reference node will be reused" do you mean in the RDF export (and query service)? That would be good. However it seems to me this is not true in the wikidata item stored in the database - for example Searches for Gravitational Waves from Known Pulsars at Two Harmonics in the Second and Third LIGO-Virgo Observing Runs (Q113625041) contains the same reference item "stated in Astrophysics Data System ..." over a thousand times; I've removed some of those references (for example this edit) and that reduced the db size considerably - each reference entry was about 1000 bytes of storage. ArthurPSmith (talk) 19:23, 11 October 2023 (UTC)
- Interesting. Yeah, unfortunately it seems the identical reference gets repeated when stored in the JSON blob in Wikibase. But if you pick out the reference's hash value from the blob and query it you can plainly see there is only one object for the reference in the triplestore.
- Try it!
describe wdref:8776d8ae80dab65db64c0459f0f70ee2169035f6 { hint:Query hint:describeMode "CBD" }
- Value nodes seem to work the same way.
wikidata-externalid-url
editHi, the tool seems to be down: https://wikidata-externalid-url.toolforge.org/?p=345&url_prefix=https://www.imdb.com/&id=tt0030951. Could you please have a look? Thanks. Ayack (talk) 07:29, 26 October 2023 (UTC)
- @Ayack: Huh, not sure what happened, I just restarted it, seems ok now. ArthurPSmith (talk) 18:45, 26 October 2023 (UTC)
- Yes, it's working now. Thanks! Ayack (talk) 10:59, 27 October 2023 (UTC)
Wikidata:Property proposal/Cento Pittori via Margutta ID
editHi, i saw that you had created an ID on the Enciclopedia d'Arte Italiana, i ask you if you can consider help in creating an ID on this very well-known art exhibition organisation, founded in Italy in Rome in 1953. The artists are always Cento (100), some are also mentioned in the Getty Vocabulary, where they are related to the association. proposal/Cento Pittori via Margutta (Q98832027). Thanks. Maxxinelli01 (talk) 07:38, 31 October 2023 (UTC)
- @Maxxinelli01: Hi - you would need to create a property proposal - see Wikidata:Property proposal. From the title here it looked like you had done so but I don't see a proposal with this name, please link it if you did already. ArthurPSmith (talk) 15:28, 31 October 2023 (UTC)
authordisambiguator
editHi,
Is it possible that the option "reset error" of the tool isn't working ? I tried it a couple of time without success. Simon Villeneuve (talk) 11:13, 19 January 2024 (UTC)
- @Simon Villeneuve: It seems to work for me - you click on the "Reset errors?" link for a batch and the batch should now show as in "Ready" state, then you can click the "Restart batch?" link to actually run it again. However of course if the error condition is still there it will return to the error state again when it runs... ArthurPSmith (talk) 15:17, 19 January 2024 (UTC)
- It happened again this morning. The error code is "failed-save". It happen when I lauch the tool for a particle researcher (example e660dcb6) and, waiting for it to end, I lauch another batch about an astronomer (example e8d769b2) with more than 50 articles. The astronomer articles are going fast (they are not as big as the one of the particle researchers) and I think that it bust the editions/second I can do with my account. The failed-save occure only pass the ~50 first articles of the astronomer.
After that, when I tried to reset the failed save, it didn't work. Simon Villeneuve (talk) 12:35, 22 January 2024 (UTC)- @Simon Villeneuve: Can you give a little more detail on "didn't work"? I.e. did you get an error message when you hit the "Reset errors?" link? If that worked and "Restart batch?" also worked and you ended up with "failed save" again that's probably a problem with the article - there are some "failed save" cases that will recur repeatedly due to some issue with the wikidata entry for the article. For example if an author has 2 "series ordinal" values. ArthurPSmith (talk) 16:02, 22 January 2024 (UTC)
- It just stay stuck at "ready", loading every x seconds and stay at "ready". It did that for 2acdd162 and I launched after that 3ecd5e01, who have saved all it's items, with 2acdd162 still being stuck. Simon Villeneuve (talk) 02:30, 23 January 2024 (UTC)
- @Simon Villeneuve: When you go to https://author-disambiguator.toolforge.org/batches_oauth.php do you see the "Restart batch?" link for the batch? ArthurPSmith (talk) 17:36, 23 January 2024 (UTC)
- Yes, and it work from there ! Can you copy this link to the concerned batch ? Simon Villeneuve (talk) 11:42, 25 January 2024 (UTC)
- @Simon Villeneuve: When you go to https://author-disambiguator.toolforge.org/batches_oauth.php do you see the "Restart batch?" link for the batch? ArthurPSmith (talk) 17:36, 23 January 2024 (UTC)
- @Simon Villeneuve: Can you give a little more detail on "didn't work"? I.e. did you get an error message when you hit the "Reset errors?" link? If that worked and "Restart batch?" also worked and you ended up with "failed save" again that's probably a problem with the article - there are some "failed save" cases that will recur repeatedly due to some issue with the wikidata entry for the article. For example if an author has 2 "series ordinal" values. ArthurPSmith (talk) 16:02, 22 January 2024 (UTC)
- It happened again this morning. The error code is "failed-save". It happen when I lauch the tool for a particle researcher (example e660dcb6) and, waiting for it to end, I lauch another batch about an astronomer (example e8d769b2) with more than 50 articles. The astronomer articles are going fast (they are not as big as the one of the particle researchers) and I think that it bust the editions/second I can do with my account. The failed-save occure only pass the ~50 first articles of the astronomer.
Another thing : can the tool convert non-romanized letters ? As an example, for Aleš Bezděk (Q112392511), I have no result if I use "Ales Bezdek", and plenty of results if I use "Aleš Bezděk". Simon Villeneuve (talk) 12:04, 25 January 2024 (UTC)
P.S. : same thing for Olivier Bienaymé (Q102715089) (Bienayme) Simon Villeneuve (talk) 02:37, 28 January 2024 (UTC)
- @Simon Villeneuve: On copying the link - great idea but there was some reason I thought it wouldn't work, I'll have to dig into it. Can't remember what the problem was...
- Regarding non-romanized letters: the trouble is the code finds works for authors through a SPARQL search and that depends on an exact string match. Going from a non-roman letter to a roman letter generally works, so if you input "Aleš Bezděk" it should match both "Aleš Bezděk" and "Ales Bezdek" because the code is translating the non-roman to roman ones. But it would be infeasible to do that in the other direction - there are too many possible name variants changing every possible letter that could be accented. ArthurPSmith (talk) 18:09, 29 January 2024 (UTC)
- Here's an example : Dzifcakova E. give 30 results, but Dzifčáková E. give 0. Simon Villeneuve (talk) 00:25, 25 April 2024 (UTC)
External links to IMDb don't work
editHi, I don't know if something happened to the tool, but whenever I try to follow a link for IMDb ID (P345) I get a page saying that the tool is taking too long to respond. Agabi10 (talk) 09:59, 5 March 2024 (UTC)
- @Agabi10: Thanks for the ping, I've restarted it, it seems ok now. Sorry about that! ArthurPSmith (talk) 14:28, 5 March 2024 (UTC)
Invitation to participate in the WQT UI requirements elicitation online workshop
editDear ArthurPSmith,
I hope you are doing well,
We are a group of researchers from King’s College London working on developing WQT (Wikidata Quality Toolkit), which will support a diverse set of editors in curating and validating Wikidata content.
We are inviting you to participate in an online workshop aimed at understanding the requirements for designing effective and easy-to-use user interfaces (UI) for three tools within WQT that can support the daily activities of Wikidata editors: recommending items to edit based on their personal preferences, finding items that need better references, and generating entity schemas automatically for better item quality.
The main activity during this workshop will be UI mockup sketching. To facilitate this, we encourage you to attend the workshop using a tablet or laptop with PowerPoint installed or any other drawing tools you prefer. This will allow for a more interactive and productive session as we delve into the UI mockup sketching activities.
Participation is completely voluntary. You should only take part if you want to and choosing not to take part will not disadvantage you in any way. However, your cooperation will be valuable for the WQT design. Please note that all data and responses collected during the workshop will be used solely for the purpose of improving the WQT and understanding editor requirements. We will analyze the results in an anonymized form, ensuring your privacy is protected. Personal information will be kept confidential and will be deleted once it has served its purpose in this research.
The online workshop, which will be held on April 5th, should take no more than 3 hours.
If you agree to participate in this workshop, please either contact me at kholoud.alghamdi@kcl.ac.uk or use this form to register your interest https://forms.office.com/e/9mrE8rXZVg Then, I will contact you with all the instructions for the workshop.
For more information about my project, please read this page: https://king-s-knowledge-graph-lab.github.io/WikidataQualityToolkit/
If you have further questions or require more information, don't hesitate to contact me at the email address mentioned above.
Thank you for considering taking part in this project.
Regards Kholoudsaa (talk) 03:29, 19 March 2024 (UTC)
Property P4280 for deletion
editHi! I have just proposed for deletion the property P4280 (P4280) that you created. Please check Wikidata:Properties for deletion/P4280. Horcrux (talk) 18:08, 14 April 2024 (UTC)
A . M. Zhang and author_strings gadget madness
editHi @ArthurPSmith, thanks for reverting the name change for A . M. Zhang. I had no idea I'd done this! I briefly installed the author_strings gadget, but it seemed horribly glitchy, made page loading slow, and sometimes generated a big list of seemingly unrelated authors which would suddenly appear on my page why I was editing. I must have accidentally clicked something which generated a new item for "A . M. Zhang" and then matched that to a number of author strings. Bizarre! Anyway shortly after installing it I switched it off as it made the editor barely usable. I've looked at my contributions and found the rest of the "A . M. Zhang" changes and reverted them. Sorry for the spurious edits, it's a bit alarming that a gadget can create a bunch of edits seemingly without me knowing. Rdmpage (talk) 12:27, 23 April 2024 (UTC)
- @Rdmpage: Thanks for fixing! That is definitely alarming... ArthurPSmith (talk) 13:27, 23 April 2024 (UTC)
Universités
editBonjour User:ArthurPSmith, merci pour votre intérêt. Je fais un peu de ménage dans les éléments sur les organisations, qui sont effectivement parfois confondus avec leurs implantations. Wikidata est une base de données structurées où les concepts doivent être bien définis : une université n'est pas un campus, comme une entreprise n'est pas une usine et inversement. Le Campus adventiste du Salève est un cas un peu déroutant parce que l'organisation elle-même porte le nom de "campus", mais ça ne change pas sa nature. Cordialement, Arpyia (talk) 17:23, 3 May 2024 (UTC)
- Bonjour @Arpyia:! I hope you don't mind me staying in English. I agree that a company is not a factory, a university is not a campus, etc. So is your point that Adventist University of France – Collonges (Q2935621) is a campus, not a university? Because right now it says instance of (P31) university (Q3918). While your new Q125753974 says instance of (P31) organization (Q43229). university (Q3918) is a subclass of organization (Q43229) so to me they seem the same. Something here needs to be adjusted. ArthurPSmith (talk) 17:30, 3 May 2024 (UTC)
- You are right. Usually we have items about organisations which get mixed up with properties about campuses, but this one looks to have been intented for the place, that's why I created the item about the organisation. A whole other question would be: what qualifies as a university? I don't think anyone in France would call Campus adventiste du Salève a university. But I won't get into that! Arpyia (talk) 17:40, 3 May 2024 (UTC)
- While you're here, I am trying to include the registration number for all dangerous or polluting facilities in France. This could include some research facilities too. Could you help me here: Wikidata:Property proposal/numéro d'établissement d'une ICPE? Thank you! Arpyia (talk) 08:42, 4 May 2024 (UTC)
- Thank you a lot for your help with that! Arpyia (talk) 13:07, 22 June 2024 (UTC)
- Hi - I'm not clear what help you need? The property has been created - ICPE establishment ID (P12719) and can be used right away! ArthurPSmith (talk) 16:53, 22 June 2024 (UTC)
- Thank you a lot for your help with that! Arpyia (talk) 13:07, 22 June 2024 (UTC)
- While you're here, I am trying to include the registration number for all dangerous or polluting facilities in France. This could include some research facilities too. Could you help me here: Wikidata:Property proposal/numéro d'établissement d'une ICPE? Thank you! Arpyia (talk) 08:42, 4 May 2024 (UTC)
Dowry in Islam
editHi ArthurPSmith. I discovered this morning that all my additions in the mahr (Q902443) page (Dowry in Islam) have been canceled. In particular, these authority links have been targeted:
- LCAuth: https://id.loc.gov/authorities/sh2022006551
- LCAuth: https://id.loc.gov/authorities/sh85039242
- LCClass: https://id.loc.gov/authorities/classification/BP190.5.D69
- FAST: https://id.worldcat.org/fast/2060623
- FAST: https://id.worldcat.org/fast/897273
- IdRef: https://www.idref.fr/17523244X
- BnF: https://catalogue.bnf.fr/ark:/12148/cb16729423j
- BNE: https://datos.bne.es/resource/XX547746
All the beautiful and patient work that I have brought to this page has been canceled. I am afraid that all the other pages that I have improved and enriched will suffer the same fate. SOS! Soufiyouns (talk) 06:24, 4 July 2024 (UTC)
- Sorry about that - have you talked to the person who reverted your edits? The account appears to be كريم رائد - so you should go to their talk page and ask them why they made the change, or if possible to reverse their removal of your edits. If that doesn't get anywhere then you should bring this up at the Wikidata:Administrators' noticeboard. I have no authority on this myself. ArthurPSmith (talk) 17:30, 8 July 2024 (UTC)
- @ArthurPSmith: Thank you very much for your kind and compassionate response, you can check this user's response in Mike Peel's talk page. Regards. Soufiyouns (talk) 17:36, 8 July 2024 (UTC)
instance and subclass of the same class
editQLever can run the general query for this. Could it replace the several queries and pages you have set up? I'm willing to make the changes if you can tell me how to generate the reports.
Here are the top few entries:
?metaclass ?metaclassLabel ?count Q7187 gene 975340 Q8054 protein 761241 Q4164871 position 99687 Q277338 pseudogene 49392 Q427087 non-coding RNA 45202 Q2996394 biological process 28254 Q14860489 molecular function 11243 Q294414 public office 9909 Q34770 language 7074 Q5058355 cellular component 4198 Q898273 protein domain 2643 Q12136 disease 2270 Q282 wine 2145 Q618779 award 1947 Q929833 rare disease 1919 Q8187769 economic activity 1790 Q55788864 developmental defect during embryogenesis 1720 Q201448 transfer RNA 1153 Q112965645 symptom or sign 1124 Peter F. Patel-Schneider (talk) 12:50, 5 October 2024 (UTC)
- @Peter F. Patel-Schneider: The reports are generated by a Magnus Manske tool called "Listeria" - see Wikidata:Listeria. It is called through the Template:Wikidata list template. I think it would be a great idea to either add qlever support to this tool or to fork it to use qlever instead of WDQS. But you'll have to talk to Magnus or work on the code yourself I think for that to happen. ArthurPSmith (talk) 20:14, 14 October 2024 (UTC)
- please share your code that lead to the above data. International Press Center (talk) 16:12, 23 November 2024 (UTC)
- I think this is the query I used:
- PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
- PREFIX wdt: <http://www.wikidata.org/prop/direct/>
- SELECT DISTINCT ?metaclass ?metaclassLabel (COUNT(DISTINCT ?class) as ?count) WHERE {
- ?class wdt:P31 ?metaclass ;
- wdt:P279+ ?metaclass .
- OPTIONAL { ?metaclass rdfs:label ?metaclassLabel . FILTER ( lang(?metaclassLabel)='en' ) }
- } GROUP BY ?metaclass ?metaclassLabel ORDER BY DESC(?count) Peter F. Patel-Schneider (talk) 17:43, 23 November 2024 (UTC)
- Thank you, that is https://qlever.cs.uni-freiburg.de/wikidata/AVcvMG - it says 2ms for resolving and sending, it my browser it took longer to see a result, maybe longer than 2s. International Press Center (talk) 17:54, 23 November 2024 (UTC)
- The "+" removed https://qlever.cs.uni-freiburg.de/wikidata/zQM4BJ :
- ?metaclass ?metaclassLabel ?count
- 1 Q8054 protein 751,644
- 2 Q7187 gene 430,982
- 3 Q277338 pseudogene 49,392
- 4 Q427087 non-coding RNA 44,857
- 5 Q201448 transfer RNA 1,153
- 6 Q284416 small nucleolar RNA 560
- 7 Q898273 protein domain 473
- 8 Q502048 gasoline engine 162
- 9 Q6979593 national association football team 107
- 10 Q618779 award 106
- 11 Q163727 bachelor's degree 95
- Suggest to fix these first. International Press Center (talk) 18:00, 23 November 2024 (UTC)
A.D.
editHi,
Again about AuthorDisambiguator.
Do you know why I can't process the results for R J Laureijs ? Simon Villeneuve (talk) 13:52, 3 December 2024 (UTC)
- @Simon Villeneuve: That's weird - it means somehow the name is matching on a search but not when it tries to match on the author list that's returned. I'll have to dig a little deeper to get to the bottom of it. Have you seen any other examples like this? ArthurPSmith (talk) 18:21, 3 December 2024 (UTC)
- Ah, I think I see what it is. You have a hidden unicode character after the s in the name. Cut and paste the name from one of the Wikidata author name string values and it should work. ArthurPSmith (talk) 18:24, 3 December 2024 (UTC)
- I remember another one example like this, but I can't find it. Strange this hidden unicode. Thank you, now, I'll know that this case exist. Simon Villeneuve (talk) 20:30, 3 December 2024 (UTC)
- It would probably be helpful if there was some clear way to display such characters to warn people... I'll have to look into it. ArthurPSmith (talk) 21:29, 3 December 2024 (UTC)
- I remember another one example like this, but I can't find it. Strange this hidden unicode. Thank you, now, I'll know that this case exist. Simon Villeneuve (talk) 20:30, 3 December 2024 (UTC)
- Ah, I think I see what it is. You have a hidden unicode character after the s in the name. Cut and paste the name from one of the Wikidata author name string values and it should work. ArthurPSmith (talk) 18:24, 3 December 2024 (UTC)