Wikidata:WikiProject Manuscripts/Catenae


Catenae Catalogue

Project description

edit

The Catenae Catalogue (https://purl.org/itsee/catena-catalogue) is a database created by Georgi Parpulov in conjunction with his 2021 monograph Catena Manuscripts of the Greek New Testament (DOI 10.31826/9781463242619). It lists and describes Greek manuscripts of the New Testament which have catena-type commentaries.

The database has basic information on 721 manuscripts within the project scope. It is hosted by the Institute for Textual Scholarship and Electronic Editing (ITSEE) at the University of Birmingham. Its contents are available without subscription and published under Creative Commons BY 4.0.

Data description

edit

The database has basic information on all witnesses available in tabular data (https://itsee-wce.birmingham.ac.uk/catenacatalogue/) (Rahlfs number, Place, Library and Shelf Mark, Content, Date, Images available).

  1. "view", link to entries individual manuscripts (using a non-significant 4-digit identifier)
  2. GA = Gregory-Aland-Number, Gregory-Aland-Number (P1577) (only where available)
  3. Liste = link to the corresponding manuscript in the New Testament Virtual Manuscript Room (https://ntvmr.uni-muenster.de/)
  4. Diktyon = Diktyon number, Diktyon ID (P12042)
  5. Parpulov = group (or class) this manuscript belongs to according to its textual contents (as assigned by Parpulov)
  6. Location of the manuscript
  7. Library of the manuscript
  8. Shelfmark of the manuscript (fonds and shelf mark)
  9. Contents (coded: e = Gospels ...)
  10. CPG = Clavis Patrum Graecorum identifiers of the catenae in the manuscript (only where available), Clavis Patrum Graecorum ID (P7988)
  11. Century = date of the manuscript (century, sometimes more precise: 1st half, 2nd half, med., ex.)
  12. Layout = layout of the catena(e) (alternating catena, frame catena, marginal scholia)
  13. Biblical text = completeness of the main text (abridged, full)
  14. Further Images = links to repositories of digital copies or photographs of the manuscript pages, full work available at URL (P953)

Entries for individual manuscripts also have additional data which is not available in the tabular file:

  1. Year (of creation of the manuscript, often empty)
  2. Material (parchment, paper)
  3. Source names (stated within the catenae)
  4. Incipits

Tasks and progress

edit
  1. ✓ Done Receive and prepare data for importing
    1. ✓ Done (2023-10-11) Extract tabular data (721 rows) with colums 1–14 and create a TSV file for OpenRefine.
    2. ✓ Done (2023-10-21) Decide how to best import data into Wikidata. Created four separate lists:
      1. Gregory-Aland-Number = Diktyon Number (568 rows, including 5 GA-Numbers without Diktyon IDs)
      2. 1 Gregory-Aland-Number for 2 or more Diktyon Numbers (47 rows)
      3. 2 Gregory-Aland-Numbers for 1 Diktyon Number (1 row)
      4. no Gregory-Aland-Numbers (105 rows)
    3. ✓ Done (2023-10-22) Create properties for the Catenae Catalogue IDs and the Parpulov classes: Wikidata:Property proposal/Catenae Catalogue‎Catenae Catalogue ID (P12109), Parpulov group (P12110).
  2. ✓ Done (2023-10-22) Import data with OpenRefine: 1 item each for sub-files #1 and #4, 3 items each for sub-files #2 and #3
    1. ✓ Done (2023-10-23) Gregory-Aland-Number = Diktyon Number (568 rows, including 5 GA-Numbers without Diktyon IDs): Match (251 matched, 317 new) and create missing items.
    2. ✓ Done (2023-10-23) 1 Gregory-Aland-Number for 2 or more Diktyon Numbers (47 rows): Create missing GA items (47 new).
    3. ✓ Done (2023-10-22) 2 Gregory-Aland-Numbers for 1 Diktyon Number (1 row): Created 2 GA items and 1 Diktyon item
    4. ✓ Done (2023-10-22) no Gregory-Aland-Numbers (105 rows): Matched (2 matched, 103 new) and created items (2 already existed: Catena on St. Luke's Gospel (Q62052138), Catena in Matthaeum (Q62052131)). Quality of data import overall acceptable, with a few issues: full work available at URL (P953) has wrong value. Needs to be ignored in OR and imported with QS after item creation.
  3. Check consistency of imported data
    1. Manuscripts with multiple Catalogue IDs, usually for sections of the manuscript: Bibliothèque nationale de France Gr. 193 (Q123152571) (Bibliothèque nationale de France Gr. 193, ff. 1–143 (Q123152474), Bibliothèque nationale de France Gr. 193, ff. 144–172 (Q123152564)), Vaticanus Graecus 2275 (Q123152469)
    2. Manuscript sections that are treated as manuscripts by the Catalogue
    3. (Pseudo-)Duplicates (more than 1 catena in the same ms./section)
    4. Conflations: Wikidata:Database reports/Constraint violations/P12109#Single value