Wikidata:Requests for permissions/Bot/ItalianSchoolsBot
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 19:53, 5 May 2018 (UTC)[reply]
ItalianSchoolsBot edit
ItalianSchoolsBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Floatingpurr (talk • contribs • logs)
Task/s: Loading datasets of information about Italian Schools (i.e., name, type, location, etc...).
Code: https://github.com/floatingpurr/wikidata-utils/tree/master/bots/ItalianSchoolsBot Code available, ready for analyses and some testing. Floatingpurr (talk) 11:27, 27 April 2018 (UTC)[reply]
Function details: Well, this is my first time in Wikidata's bot section. As you can see here, I did some Python stuff for cleaning a dataset of 65k+ rows and creating statements for Quickstatements but, unfortunately, the statements collection is too huge for Quickstatements. That's the reason why I'm here :). Pardon me if this request may look a bit unconventional, but I just need a feedback for understanding if this bot is the right way to load huge datasets, and some pointers to best practices for building a compliant Python bot. Dataset info are in the Data Hub. Some relevant points are here below:
- 65k+ rows (complete dataset available here)
- some items already exist in wikidata (wikidata pointers are available in the spreadsheet)
- probably I need to create a brand new property for the unique ID of Italian Schools (i.e., CODICESCUOLA)
- probably I need a 2nd run, since CODICESCUOLA (i.e., parent organization (P749) or part of (P361)) refers back to created items.
- item example: D. Buzzati Limana (Q52083858)
- email field was not loaded in test run due to policy restrictions.
Thanks, Floatingpurr (talk) 17:11, 22 April 2018 (UTC)[reply]
- To create a new property see Wikidata:Property proposal/Generic. Someone will also ask a test run before approval. --Valerio Bozzolan (talk) 23:42, 23 April 2018 (UTC)[reply]
- Ok, thank you Valerio for your help. I'm going to request the new property (see Wikidata:Property_proposal/Generic#Italian_School_ID). In the meanwhile, I'll try to figure out how to effectively operate a Python bot. I hope I'll be able to share the complete code soon. :) Floatingpurr (talk) 11:09, 24 April 2018 (UTC)[reply]
- Property Italian School ID (P5114) is now available! Floatingpurr (talk) 10:11, 3 May 2018 (UTC)[reply]
- Ok, thank you Valerio for your help. I'm going to request the new property (see Wikidata:Property_proposal/Generic#Italian_School_ID). In the meanwhile, I'll try to figure out how to effectively operate a Python bot. I hope I'll be able to share the complete code soon. :) Floatingpurr (talk) 11:09, 24 April 2018 (UTC)[reply]
Testing:
Test#1 Loaded new items for testing purposes but now the bot is subjected to writing limitations for unapproved bots. Looking forward to receiving community comments. Floatingpurr (talk) 23:43, 1 May 2018 (UTC)[reply]
- Support Looks to be working well to me, although I'd suggest running more entries (up to ~50 items) --Reosarevok (talk) 11:40, 3 May 2018 (UTC)[reply]
- Test#2 ~50 items just loaded! Floatingpurr (talk) 15:49, 3 May 2018 (UTC)[reply]
- Test#3 Updated school items already present in wikidata. E.g., T.Tasso (Q13392468). Floatingpurr (talk) 12:37, 4 May 2018 (UTC)[reply]
- Test#2 ~50 items just loaded! Floatingpurr (talk) 15:49, 3 May 2018 (UTC)[reply]