Wikidata:Dataset Imports/A Bibliography for Aegean Glyptic in the Bronze Age (1991), compiled by John G. Younger

You may find these related resources helpful:


Guidelines for using this page edit

Documenting the import edit

  • Guidelines on how to import a dataset into Wikidata are available at Wikidata:Data Import Guide.
  • Please include notes on all steps of the process.
  • Once a dataset has been imported into Wikidata please edit the page to change the progress status from in progress to complete.
  • It is strongly recommended to use Visual Editor when making changes to this page, particularly for editing any of the tables.

Creating a Wikidata item for the dataset edit

  • Please create a Wikidata item for the dataset, this will allow us to improve the coverage of datasets on Wikidata and understand what datasets are available on that topic and which of them have been added to Wikidata.
  • If you are working with very large dataset you can break it into smaller Mix n' Match catalogues, but only create one Wikidata item.
  • Link the dataset Wikidata item to this page using Wikidata Dataset Imports URL (P5195)

Getting help edit

  • If your dataset import runs into issues please edit the page to change the progress status from in progress to help needed.
  • You can ask for help on Wikidata:Project chat.

Overview edit

Dataset name edit

A Bibliography for Aegean Glyptic in the Bronze Age (1991), compiled by John G. Younger

 
A side of a four-sided green jasper seal. (CMS II,2 316d)

Source edit

The original bibliography list was published in the CMS series by the Corpus der Minoischen und Mykenischen Siegel (http://cmsheidelberg.uni-hd.de/) in 1991 as the supplementary volume 4 (Beiheft 4). In 2009 the author published this same annotated bibliography in HTML format on http://people.ku.edu/~jyounger/Sphragis/sealbib1.html

Link edit

The HTML version of the bibliography is available under: http://people.ku.edu/~jyounger/Sphragis/sealbib1.html

A digitised version of the printed bibliography is available under http://dx.doi.org/10.11588/propylaeum.367.518 in PDF format.

A biblatex version of the bibliography is being prepared and enriched and available under https://github.com/bellerophons-pegasus/lobib/tree/master/source-bibtex

Dataset description edit

The dataset consists of about 1200 bibliographic references to scholarly publications about Aegean seals and sealings from the Bronze Age published before 1991. Each reference is annotated with related topics, including places and discussed objects.

The bibliography list uses the citation format and the abbreviations established by the American Journal of Archaeology (AJA) 90 (1986) 381-394.

Additional information edit

The list is first organised by author and then followed by a thematic index, where after each keyword a list of author year references follows. Import of the dataset was initiated during a fellowship in the programme "Open Science Fellows Program" funded by Wikimedia Deutschland, the Stifterverband, and the Volkswagen Foundation. A detailed project description is available on Wikiversity. Additionally a dedicated web application is under construction in order to fetch the references from wikidata and display them in a custom way with a search function and filters. Further functionalities like sorting and also exporting are planned. The repository is available on GitHub.

Progress of import edit

The table below is used to track the progress of importing this dataset. The suggested column headings are most applicable to data being imported from a spreadsheet - you can change some column headings or add new columns as required to best describe the progress of this import.

Wikidata item for the datasetTransform data from HTML into bibtexClean and enrich dataFormat data into spreadsheet to import the dataStructure of data within WikidataMatch the dataset to WikidataImporting data into WikidataVisualisationsMaintainance queries and expected results
A Bibliography for Aegean Glyptic in the Bronze Age (Q61761384)In progress, Authors a to f ready. Final bibtex files are stored on GitHub.In progress. Correcting typos and in rare cases wrong references. Adding links to full texts when available, as well as DOIs and the like.Not done yet; depends on structure of data within WikidataIn progress, first manual entries already present.In progress. Information to be matched: Keywords, Authors, Editors, Publisers, and Journals.Not done yetIn progress; see repository on GitHub.Not done yet

Edit history edit

Use the table below to list batches of edits that have been completed for this dataset. Ideally each entry should have all applicable columns filled out, but at a minimum please make to add a date and description to give an idea of what was added to Wikidata and when.

DateDescriptionMethodPropertiesQualifiersReferencesStatements addedStatements removedLink to import sheet
Nov. 18 - Feb. 19Exploring how references can be modelled in WikidataManual and for two or three test with Quickstatemens---10 new items; 12 items expanded0
30. May 19Updating persons and importing new person information related to references a-bReconciliation with OpenRefine, exporting from there via Quickstatements in multiple iterations; manual editing for corrections and individual additionsfamily name; given name; occupation; sex or gender; Viaf ID; series ordinals -26 new family names; 35 updated items; 36 new persons0
Date 3Description 3Method 3Properties 3Qualifiers 3References 3Added Count 3Removed Count 3Link 3

Discussion of import edit

These headings are generally useful, please change this section to suit your needs.

Wikidata item for dataset edit

The bibliographic information of the printed bibliography was entered into Wikidata and also marked as a dataset: A Bibliography for Aegean Glyptic in the Bronze Age (Q61761384)

Transform data from HTML into bibtex edit

Transforming of data was done in seven batches by author last names: a-b, c-f, g-j, k-m, n-q, r-t, and v-z. First the references where manually copied from the website and pasted into a txt file. This was processed with a Python script in order to get a text file where each line contains a reference with the author name at the beginning of the line. The resulting file could then be parsed with AnyStyle. The GUI of AnyStyle offers a comfortable way of viewing the parsed results and correcting the assigned labels. The first two batches were also used to train the model, which resulted in better parsing results in the remaining batches. The corrected references could then be exported from AnyStyle into the BibTeX format, which was then further edited with JabRef.

Screenshots of the individual transformation steps:

Clean and enrich data edit

After transformation into the BibTeX format the references where edited using JabRef. In order to improve the quality of the dataset each reference was manually looked up online in order to check and enrich (e. g. add links to full texts) it.

In the course of this process typos were detected and errors corrected. Whenever a digital version of the reference was found, the respective link was added. If DOIs were assigned, they were also added. Links to full texts include links to JSTOR, journal web pages, and in some cases to Academia.edu or Researchgate.net

The final BibTeX files are available on GitHub. Authors a-f are ready, the remaining still in progress. All references to reviews are going to be processed later, because they all refer to publications mentioned in the list.

Format data into spreadsheet to import the data edit

Structure of data within Wikidata edit

First manual entries are already present. They were created in order to explore how they can be modelled in Wikidata and if any new properties have to be created. For formatting reference information work done in the WikiCite project is considered. An overview on how the individual reference types are going to be represented will follow here.

Field nameWikidata propertyNotes
Name1Property1Notes1
Name2Property2Notes2
Name3Property3Notes3

Match the dataset to Wikidata edit

Information to be matched: Keywords, Authors, Editors, Publisers, and Journals. More information will follow.

Importing data into Wikidata edit

Import completion notes edit

Visualisations edit

With Scholia a nice visualisation of the dataset is already provided out of the box: https://tools.wmflabs.org/scholia/topic/Q58681669

Still a bespoke web application is being developed, in order to provide more information and more reference management related functionality. The development repository is available on GitHub. The current version of the application is published with GitHub pages

Maintenance edit

Queries and expected results edit

Query linkDescriptionExpected results
Link1Property1Notes1
Link2Property2Notes2
Link3Property3Notes3

Schedule of new data released edit