About this board

Logo of Wikidata

Welcome to Wikidata, Jsamwrites!

Wikidata is a free knowledge base that you can edit! It can be read and edited by humans and machines alike and you can go to any item page now and add to this ever-growing database!

Need some help getting started? Here are some pages you can familiarize yourself with:

  • Introduction – An introduction to the project.
  • Wikidata tours – Interactive tutorials to show you how Wikidata works.
  • Community portal – The portal for community members.
  • User options – including the 'Babel' extension, to set your language preferences.
  • Contents – The main help page for editing and using the site.
  • Project chat – Discussions about the project.
  • Tools – A collection of user-developed tools to allow for easier completion of some tasks.

Please remember to sign your messages on talk pages by typing four tildes (~~~~); this will automatically insert your username and the date.

If you have any questions, please ask me on my talk page. If you want to try out editing, you can use the sandbox to try. Once again, welcome, and I hope you quickly feel comfortable here, and become an active editor for Wikidata.

Best regards!

Previous discussion was archived at User talk:Jsamwrites/Archive 1 on 2017-04-17.

31.200.13.53 (talkcontribs)

Hello.

Can you merge the Wikidata page "Category:Cities in Azerbaijan" (Q94287940) with the Wikidata page "Category:Cities and towns in Azerbaijan" (Q7287633)? The former has only a Wiktionary page which is filled and should be merged with the latter's Wiktionary page.

Yours sincerely, 31.200.13.53 13:15, 27 May 2022 (UTC)

Jsamwrites (talkcontribs)

 Done

31.200.13.53 (talkcontribs)
Reply to "Merge request"
So9q (talkcontribs)
Jsamwrites (talkcontribs)

@So9q Thanks. I will test it.

So9q (talkcontribs)
So9q (talkcontribs)
So9q (talkcontribs)
Jsamwrites (talkcontribs)
So9q (talkcontribs)
Jsamwrites (talkcontribs)

@So9q Thanks. I will use this pre-release version

So9q (talkcontribs)

New pre-release out with new features :) WDYT?

Jsamwrites (talkcontribs)

@So9q I see a lot of interesting features in this version. Great job!! But there are a couple of issues.

  1. I think the DEBUG mode is switched on by default. I see a lot of messages on the screen.
  2. I want to try the JOBS on toolforge. But it's not very clear how I can run prepared jobs on Toolforge/PAWS.
  3. I tried -l option. It doesn't work anymore. I think this option is replaced by -a option. But with -a option, I have to press Enter for every new scholarly article. I am not sure whether you are introducing it as a feature replacing the previous batch option.
Jsamwrites (talkcontribs)

@So9q Updated the above reply. I think, I checked out a wrong branch. Things are working fine. I striked-out some previous comments.

So9q (talkcontribs)
Jsamwrites (talkcontribs)

@So9q Thanks for this tutorial.

After doing ssh, I ran the following command (assuming that itemsubjector already exists), I get the following error


$ become itemsubjector

You are not a member of the group tools.itemsubjector.

Any existing member of the tool's group can add you to that.

So9q (talkcontribs)

Yeah, you have got to create your own tool in the web interface and name it whatever e.g. "itemsubjector-jsam" and use that.

I updated the guide with links.

Jsamwrites (talkcontribs)

@So9q Thanks for the updated guide. I am now able to ssh and run.

So9q (talkcontribs)

Great! I see you are editing a lot. Counting all your edits for september and oktober until now you have 1M edits! If all of them are main subject then you have single handedly taken Wikidata from 14M to 15M total main subjects on the 37M articles. Wow!

See https://qlever.cs.uni-freiburg.de/wikidata/zZAhrs which times out in Blazegraph.

Jsamwrites (talkcontribs)
So9q (talkcontribs)

According to this query we had 27M articles without any main subject before. I'm curious to see how many it is after our effort. At best, it is 1M less, so still 26M to go! :D

So9q (talkcontribs)

We are now almost down to 26M lacking P921! Nice work. 25M is our next target. How many weeks do you think it will take? 3?

Jsamwrites (talkcontribs)
So9q (talkcontribs)

That is a different measurement 😃 My search list all articles without any P921

So9q (talkcontribs)

How many reverts have you got per 100.000 edits? I got a few when I matched Canada, so I stopped matching countries.

Jsamwrites (talkcontribs)

@So9q I recall couple of them some weeks ago. The first one was 'Systemic therapy', which was ambiguous since it occurs in two different fields: psychology and cancer therapy.

  1. Q108744083 (Newly created one)
  2. Q1929812

One possible suggestion for improvement in ItemSubjector:

possible warnings when there is use of the property P1889 (different from), i.e., there are two items with the same label.


The second one was alcholism vs. alcholism treatment. I think that I added the former to an item which already had the latter. So, I am now careful with single word labels.

So9q (talkcontribs)
So9q (talkcontribs)

Fixed and working on master :)

Jsamwrites (talkcontribs)
So9q (talkcontribs)

Thanks for creating that and for inviting me. I feel honored. I signed up for WikidataCon and I will do my very best to attend. Would you like me to prepare something?


I just created a new query that might interest you/the participants: Wikidata:SPARQL query service/queries/examples#Galaxies ordered by the ones that are most linked from scientific articles

Also this query now times out on Blazegraph. A similar query works on QLever on older data from a few months back. I'm planning to set up a QLever instance in Toolforge in the near future.


Also: Did you try the new feature I pushed on master yesterday? :D

Jsamwrites (talkcontribs)

@So9q I tested the newest feature. It will be quite useful for certain subjects. I may need to test it more.

I did not yet test the Issue 26 fix.


Thanks for your reply concerning WikidataCon. I feel that the participants may have questions about ItemSubjector. If possible, you could present some of its key features.

So9q (talkcontribs)

Alright, I'll try to prepare a little demo video of the tool in action so they get a feeling for the interface.


People might want a QS similar front end, which could be a fun project.


I personally would like to make multiple batches and not have to wait for one to end before the next can be started.

So9q (talkcontribs)

I pushed QuickStatements export support to master (first step towards a WebUI) :) I did not find time to prepare a video, unfortunately.

If I don't make it into the meeting, please ask the participants whether they would like a Web UI for the tool that integrates with QS. See https://github.com/dpriskorn/ItemSubjector/issues/29

There has been very little traffic in the gitrepo so far so I wonder if a WebUI is worth the effort.

Jsamwrites (talkcontribs)
Jsamwrites (talkcontribs)
Jsamwrites (talkcontribs)

@So9q I tested the QS export option. It's working well. Great option. Thanks.

One observation: the CSV file generated misses the 'inferred from' (may be, it's a desired feature)

So9q (talkcontribs)

Oh, that's a bug. I'll open an issue

So9q (talkcontribs)

We are almost down to 25M articles missing P921 now. :) I'm working on medicine and there are thousands of subjects to cover still...

Jsamwrites (talkcontribs)

@So9q Yes, 25.16 M :)

Jsamwrites (talkcontribs)

@So9q < 25M articles now :)

So9q (talkcontribs)
Jsamwrites (talkcontribs)

@So9q Thanks for the update. I will take a look. Scholia statistics show that 20 M "main subject" values are now available.

So9q (talkcontribs)

Do you know how many P921 statements there were before I made the tool?

So9q (talkcontribs)

I have been working on Q35456 lately and will work on all of medicines for the forseeable future. Using the --sparql with --limit with the newest version makes the tool way better IMO because it first fetches (while I walk the dogs) and then I can approve/disapprove in one go in the end and start the job in k8s and leave it running until i finishes.

My biggest k8s job has been around 30k items so far running for 8 hours or more. Now I'm thinking 100k jobs and runtime of 24h might be easy to archive.

Jsamwrites (talkcontribs)

@So9q Based on what I remember, I think the number was somewhere between 15M-16M (Scholia Statistics). I may have to dig my tweets. I had shared some screen-captures in the beginning. Check this. It shows the difference: around 15M to 17M (October 2021). I think that a majority of them comes from itemSubjector.


Documenting essential medicinces (Q35456) will be useful. Wow, 100k jobs will be great.

So9q (talkcontribs)

I just started a 88k batch. Easy, took a few minutes to run all the queries and a few minutes to review. 😀

Jsamwrites (talkcontribs)
Reply to "New tool"

Lexèmes de professions en français

3
Envlh (talkcontribs)

Bonjour,

Tu as récemment créé plusieurs dizaines de lexèmes en français pour des professions. Ce sont des noms communs qui ne devraient pas débuter par une lettre capitale. Peux-tu corriger ces lexèmes (lemmes et formes) stp ?

Bonne journée.

Jsamwrites (talkcontribs)

@Envlh merci de m'avoir signalé les erreurs. Je viens de corriger tous ces lexèmes et leurs formes.

Envlh (talkcontribs)
Reply to "Lexèmes de professions en français"

Call for participation in a task-based online experiment

1
Kholoudsaa (talkcontribs)

Dear Jsamwrites,

I hope you are doing good,

I am Kholoud, a researcher at King's College London, and I work on a project as part of my PhD research, in which I have developed a personalised recommender system that suggests Wikidata items for the editors based on their past edits. I am collaborating on this project with Elena Simperl and Miaojing Shi.

I am inviting you to a task-based study that will ask you to provide your judgments about the relevance of the items suggested by our system based on your previous edits.

Participation is completely voluntary, and your cooperation will enable us to evaluate the accuracy of the recommender system in suggesting relevant items to you. We will analyse the results anonymised, and they will be published to a research venue.

The study will start in late January 2022 or early February 2022, and it should take no more than 30 minutes.

If you agree to participate in this study, please either contact me at [] or use this form https://docs.google.com/forms/d/e/1FAIpQLSees9WzFXR0Vl3mHLkZCaByeFHRrBy51kBca53euq9nt3XWog/viewform?usp=sf_link

I will contact you with the link to start the study.

For more information about the study, please read this post: https://www.wikidata.org/wiki/User:Kholoudsaa

In case you have further questions or require more information, don't hesitate to contact me through my mentioned email.

Thank you for considering taking part in this research.

Regards

Reply to "Call for participation in a task-based online experiment"
2001:E68:5425:6418:D047:D196:C3DB:63A8 (talkcontribs)

Can you block this user, User:AcMY01 for making a disruptive editing.-~~~~

Reply to "Block this user"
SCIdude (talkcontribs)
Jsamwrites (talkcontribs)
Reply to "subject bot"
Trilotat (talkcontribs)

I would suggest that this is not the main subject of the articles with the phrase “geologic map” in the title. Rather, I would suggest that those articles are more like instances of “geologic map.”

Jsamwrites (talkcontribs)

"geologic map" as a value of 'instance of (P31)' could be another interesting possibility. Following are the reasons behind the use of "P921 (subject)":

  1. I considered "geologic map" as a visualization tool.
  2. Keeping aside certain items, a vast majority of "scholarly article" items are single-valued for P31.
Trilotat (talkcontribs)

I understand your logic; it makes good sense to me. For what it’s with, I long ago added geologic map as P31, so we’re covered. I thought it might be redundant, but that’s probably not a significant issue. Take care.

Reply to "Geologic Map as subject"
Vis M (talkcontribs)

Hi, please note these changes I have made to your input.

  • The ~ങ്കൽ form is archaic and not used now
  • The ~ു് form is rarely used nowadays and only for some place names. For common nouns, it is never used now.
Jsamwrites (talkcontribs)

Thanks @Vis M for your feedback. I will also be adding the suggested forms in my subsequent edits. Personally, I feel that it would be useful to document these archaic forms since they are part of the linguistic heritage. Take for example Lexeme:L18745 (thou) in English.

Vis M (talkcontribs)

Ok. Thanks

Reply to "Errors in ml lexeme forms"
VIGNERON (talkcontribs)

Salut,

Je vois que tu t'es lancé dans la création de lexème pour la Climate week! Merci beaucoup <3

Je te laisse faire les créations et, pour information, je repasse derrière à ajouter des informations (le genre grammatical et les formes au pluriel par exemple).

Tu pourrais peut-être utiliser l'outil Lexeme forms pour te faciliter le travail ;)

Jsamwrites (talkcontribs)
Reply to "Climate lexeme week"
Infovarius (talkcontribs)

Hello, I suppose you add items to wrong senses...

Reply to "Water sense"