Wikidata talk:Projects/UG Digital Collection Wikidata Batch Upload Project Tracking

Latest comment: 3 years ago by Celestinesucess in topic Policies Submission Excel Sheet

This page is for documenting and tracking the data cleaning process of UG Digital Collections on Wikidata using Open Refine. This is part of the ongoing project Wikidata GLAMs Campaign Ghana.

Query edit

@Masssly: Please I moving conversations from here to this page. I have finally created the list page as instructed.Celestinesucess (talk) 19:29, 2 December 2020 (UTC)Reply

Great job. Thanks. One more useful column that'd be nice to have is the basic statement of the item, i.e. instance of academic chapter, scholarly article, etc. After that, let's prepare the Quickstatements batches, one spreadsheet at a time, test one or two in each spreadsheet, if is works we move forwards and add the rest to Wikidata. You can add link to the spreadsheet here, so we can easily ask for help we run into issues. -—M@sssly 01:12, 3 December 2020 (UTC)Reply
Okay I have included the instance of column.Celestinesucess (talk) 16:02, 3 December 2020 (UTC)Reply

Policies Submission Excel Sheet edit

@Masssly: The sheet I picked for us to try first is the [Policies Submission]. I have split the dc.publisher column to include the volume and issue columns. I need to create the University of Ghana Digital Collections (UGSpace) ID column and the full work available at URL column. I can create the ID column but I am not sure how to go about creating and populating the column for the full work available at URL column.

@Celestinesucess:You could ignore the P953. Since the scraper program only picked characters that were immediately available on the main ?show=full page, the "full work url" can only be added manually according to the data we have in the spreadsheet. I do see a pattern in the full work URLs where it consists of exact match+title_item+.pdf?sequence=1&isAllowed=y. I haven't tried it, but it may work if you use this to pull them together. Again, feel free to ignore the P953 if you also agree that its extraneous.

Also, dc.type column value for all items is other instead of Book Chapter or Scholarly article. Please how can we go about reconciling this column?

The Items in Policies Submission do not appear to me as Book Chapters or Scholarly articles but as scholarly publications (Q591041). What do you think?

Finally, the main subject column is absent from this sheet and there is no way we can create and populate this column so the items which will be created on Wikidata will not have the main subject statementCelestinesucess (talk) 16:02, 3 December 2020 (UTC)Reply

It's safe to apply P921-->Q1156854 or P921-->Q546113 for all the items in this sheet. The Policies Submission is relatively short, so if you have extra time you could go through each Item's full record and add other "subjects" that the Item may also belong to. E.g. Finance could apply to Q102111640 as well. —M@sssly 16:42, 29 December 2020 (UTC)Reply
@Masssly: I have uploaded the entries but I ommitted the publisher, volume and issue columns because, there the publisher,'University of Ghana Special Reporter' does not have an item on Wikidata and I did not create it because I wasn't sure what statements to add to it.Celestinesucess (talk) 11:11, 22 February 2021 (UTC)Reply
Return to the project page "Projects/UG Digital Collection Wikidata Batch Upload Project Tracking".