Wikidata:Requests for permissions/Bot/Uzielbot
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved --Lymantria (talk) 06:03, 17 October 2019 (UTC)[reply]
Uzielbot edit
Uzielbot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Uziel302 (talk • contribs • logs)
Task/s: upload 500K forms of Hebrew words, based on Hspell open source Hebrew morphological analyzer. The tool itself is under AGPL but grammatical information on its own isn't under copyright according to legal conclusions here.
Code: I use LexData framework and run python script in the following format, I already uploaded about 100 forms using my main account as test run:
import LexData
from LexData.languages import he
repo = LexData.WikidataSession("Uziel302", "<REDACTED>")
L2 = LexData.get_or_create_lexeme(repo, "אבד", he, "Q24905")
L2.createForm("אבדתי", ['Q24905','Q21714344','Q110786','Q1994301'], claims={"P1343": ["Q6936841"]})
Function details: --Uziel302 (talk) 12:34, 5 October 2019 (UTC)[reply]
@Uziel302: would it be possible to create a full lexeme before uploading it? It seems that the LexData does "incremental" lexeme creation - you first create just the core, and then add individual statements/forms/etc. The bot I have been using first creates a complete representation first, and uploads it as a single command. --Yurik (talk) 19:36, 6 October 2019 (UTC)[reply]
- @Yurik: That's how I understood LexData, User:MichaelSchoenitzer may be able to add option to post multiple forms in one edit. I can also use your bot if you can adjust it to the format of my data: base word and multiple forms of it with list of grammatical info. Uziel302 (talk) 20:26, 6 October 2019 (UTC)[reply]
- @Yurik: if I get the data in JSON format, each lexeme with its forms as sons, will you be able to upload it with your bot? Uziel302 (talk) 09:46, 7 October 2019 (UTC)[reply]
- @Uziel302: I will try to hack an example for you on how you can use my bot for your uploads. Hopefully in the next few days. --Yurik (talk) 15:07, 7 October 2019 (UTC)[reply]
- Adding multiple things in one edit does unfortunately require to use completely different API-calls that are way more complex to handle. Therefore I would need to rewrite a lot – what I can't do in the near future. -- MichaelSchoenitzer (talk) 11:38, 11 October 2019 (UTC)[reply]
- I am ready to approve this request in a couple of days, provided that no objections will be raised. Lymantria (talk) 06:20, 14 October 2019 (UTC)[reply]
- @Lymantria: I am a bit concerned about tens of edits rather than a single "create everything and upload it at once". It is ok to do that by hand - humans are not robots, but when we upload a large number of items, multiplied by 10-30 edits -- that's a big stress on the servers, and all those edits will be forever stored and use up significant data storage. --Yurik (talk) 16:34, 14 October 2019 (UTC)[reply]
- @Yurik: And as you have seen above, an alternative is not easily at hand. I don't think this issue should block approval. Lymantria (talk) 06:36, 16 October 2019 (UTC)[reply]
- @Lymantria: I am a bit concerned about tens of edits rather than a single "create everything and upload it at once". It is ok to do that by hand - humans are not robots, but when we upload a large number of items, multiplied by 10-30 edits -- that's a big stress on the servers, and all those edits will be forever stored and use up significant data storage. --Yurik (talk) 16:34, 14 October 2019 (UTC)[reply]