Wikidata:Dataset Imports/A Bibliography for Aegean Glyptic in the Bronze Age (1991), compiled by John G. Younger
You may find these related resources helpful:
Guidelines for using this page edit
Documenting the import edit
- Guidelines on how to import a dataset into Wikidata are available at Wikidata:Data Import Guide.
- Please include notes on all steps of the process.
- Once a dataset has been imported into Wikidata please edit the page to change the progress status from in progress to complete.
- It is strongly recommended to use Visual Editor when making changes to this page, particularly for editing any of the tables.
Creating a Wikidata item for the dataset edit
- Please create a Wikidata item for the dataset, this will allow us to improve the coverage of datasets on Wikidata and understand what datasets are available on that topic and which of them have been added to Wikidata.
- If you are working with very large dataset you can break it into smaller Mix n' Match catalogues, but only create one Wikidata item.
- Link the dataset Wikidata item to this page using Wikidata Dataset Imports URL (P5195)
Getting help edit
- If your dataset import runs into issues please edit the page to change the progress status from in progress to help needed.
- You can ask for help on Wikidata:Project chat.
Overview edit
Dataset name edit
A Bibliography for Aegean Glyptic in the Bronze Age (1991), compiled by John G. Younger
Source edit
The original bibliography list was published in the CMS series by the Corpus der Minoischen und Mykenischen Siegel (http://cmsheidelberg.uni-hd.de/) in 1991 as the supplementary volume 4 (Beiheft 4). In 2009 the author published this same annotated bibliography in HTML format on http://people.ku.edu/~jyounger/Sphragis/sealbib1.html
Link edit
The HTML version of the bibliography is available under: http://people.ku.edu/~jyounger/Sphragis/sealbib1.html
A digitised version of the printed bibliography is available under http://dx.doi.org/10.11588/propylaeum.367.518 in PDF format.
A biblatex version of the bibliography is being prepared and enriched and available under https://github.com/bellerophons-pegasus/lobib/tree/master/source-bibtex
Dataset description edit
The dataset consists of about 1200 bibliographic references to scholarly publications about Aegean seals and sealings from the Bronze Age published before 1991. Each reference is annotated with related topics, including places and discussed objects.
The bibliography list uses the citation format and the abbreviations established by the American Journal of Archaeology (AJA) 90 (1986) 381-394.
Additional information edit
The list is first organised by author and then followed by a thematic index, where after each keyword a list of author year references follows. Import of the dataset was initiated during a fellowship in the programme "Open Science Fellows Program" funded by Wikimedia Deutschland, the Stifterverband, and the Volkswagen Foundation. A detailed project description is available on Wikiversity. Additionally a dedicated web application is under construction in order to fetch the references from wikidata and display them in a custom way with a search function and filters. Further functionalities like sorting and also exporting are planned. The repository is available on GitHub.
Progress of import edit
The table below is used to track the progress of importing this dataset. The suggested column headings are most applicable to data being imported from a spreadsheet - you can change some column headings or add new columns as required to best describe the progress of this import.
Wikidata item for the dataset | Transform data from HTML into bibtex | Clean and enrich data | Format data into spreadsheet to import the data | Structure of data within Wikidata | Match the dataset to Wikidata | Importing data into Wikidata | Visualisations | Maintainance queries and expected results |
---|---|---|---|---|---|---|---|---|
A Bibliography for Aegean Glyptic in the Bronze Age (Q61761384) | In progress, Authors a to f ready. Final bibtex files are stored on GitHub. | In progress. Correcting typos and in rare cases wrong references. Adding links to full texts when available, as well as DOIs and the like. | Not done yet; depends on structure of data within Wikidata | In progress, first manual entries already present. | In progress. Information to be matched: Keywords, Authors, Editors, Publisers, and Journals. | Not done yet | In progress; see repository on GitHub. | Not done yet |
Edit history edit
Use the table below to list batches of edits that have been completed for this dataset. Ideally each entry should have all applicable columns filled out, but at a minimum please make to add a date and description to give an idea of what was added to Wikidata and when.
Date | Description | Method | Properties | Qualifiers | References | Statements added | Statements removed | Link to import sheet |
---|---|---|---|---|---|---|---|---|
Nov. 18 - Feb. 19 | Exploring how references can be modelled in Wikidata | Manual and for two or three test with Quickstatemens | - | - | - | 10 new items; 12 items expanded | 0 | |
30. May 19 | Updating persons and importing new person information related to references a-b | Reconciliation with OpenRefine, exporting from there via Quickstatements in multiple iterations; manual editing for corrections and individual additions | family name; given name; occupation; sex or gender; Viaf ID; | series ordinals | - | 26 new family names; 35 updated items; 36 new persons | 0 | |
Date 3 | Description 3 | Method 3 | Properties 3 | Qualifiers 3 | References 3 | Added Count 3 | Removed Count 3 | Link 3 |
Discussion of import edit
These headings are generally useful, please change this section to suit your needs.
Wikidata item for dataset edit
The bibliographic information of the printed bibliography was entered into Wikidata and also marked as a dataset: A Bibliography for Aegean Glyptic in the Bronze Age (Q61761384)
Transform data from HTML into bibtex edit
Transforming of data was done in seven batches by author last names: a-b, c-f, g-j, k-m, n-q, r-t, and v-z. First the references where manually copied from the website and pasted into a txt file. This was processed with a Python script in order to get a text file where each line contains a reference with the author name at the beginning of the line. The resulting file could then be parsed with AnyStyle. The GUI of AnyStyle offers a comfortable way of viewing the parsed results and correcting the assigned labels. The first two batches were also used to train the model, which resulted in better parsing results in the remaining batches. The corrected references could then be exported from AnyStyle into the BibTeX format, which was then further edited with JabRef.
Screenshots of the individual transformation steps:
-
A reference to an article by P. I. Agallopoulou on the web page created by John. G. Younger
-
A Python script to transform the list of references copied from the web page into a unified format for parsing
-
The resulting unified text file for parsing with the reference to the article by P. I. Agallopoulou
-
The same reference after parsing with anystyle.io. A manual correction of labels was already done.
-
After export from AnyStyle in BibTeX format the reference list can be viewed, edited and enriched in e. g. JabRef
Clean and enrich data edit
After transformation into the BibTeX format the references where edited using JabRef. In order to improve the quality of the dataset each reference was manually looked up online in order to check and enrich (e. g. add links to full texts) it.
In the course of this process typos were detected and errors corrected. Whenever a digital version of the reference was found, the respective link was added. If DOIs were assigned, they were also added. Links to full texts include links to JSTOR, journal web pages, and in some cases to Academia.edu or Researchgate.net
The final BibTeX files are available on GitHub. Authors a-f are ready, the remaining still in progress. All references to reviews are going to be processed later, because they all refer to publications mentioned in the list.
Format data into spreadsheet to import the data edit
Structure of data within Wikidata edit
First manual entries are already present. They were created in order to explore how they can be modelled in Wikidata and if any new properties have to be created. For formatting reference information work done in the WikiCite project is considered. An overview on how the individual reference types are going to be represented will follow here.
Field name | Wikidata property | Notes |
---|---|---|
Name1 | Property1 | Notes1 |
Name2 | Property2 | Notes2 |
Name3 | Property3 | Notes3 |
Match the dataset to Wikidata edit
Information to be matched: Keywords, Authors, Editors, Publisers, and Journals. More information will follow.
Importing data into Wikidata edit
Import completion notes edit
Visualisations edit
With Scholia a nice visualisation of the dataset is already provided out of the box: https://tools.wmflabs.org/scholia/topic/Q58681669
Still a bespoke web application is being developed, in order to provide more information and more reference management related functionality. The development repository is available on GitHub. The current version of the application is published with GitHub pages
Maintenance edit
Queries and expected results edit
Query link | Description | Expected results |
---|---|---|
Link1 | Property1 | Notes1 |
Link2 | Property2 | Notes2 |
Link3 | Property3 | Notes3 |