User:Jonathan Groß/tasks
Mix'n'match: matching external datasets with Wikidata items
editCatalogues which I created resp. feel responsible for maintaining:
Biography and Prosopography
editHeidelberg Academy for Sciences and Humanities
edit- mix'n'match catalogue: https://mix-n-match.toolforge.org/#/catalog/108
- Property: HAdW member ID (P2273)
- Status: Matching complete – created missing items
- To Do: Enhance items from catalogue – Keep up with matching
- OK Fetch new members (elected once per year) – courtesy of Andreas Dafferner
Catalogus Professorum Halensis
edit- https://tools.wmflabs.org/mix-n-match/?mode=catalog&catalog=77
- Property: Catalogus Professorum Halensis ID (P2005)
- Status: Matching complete – created missing items
- To Do: Enhance items from catalogue
Teuchos-Prosopographie
edit- https://tools.wmflabs.org/mix-n-match/?mode=catalog&catalog=87
- Property: Teuchos ID (P2018)
- Status: Matching complete as of 16:13, 15 October 2015 (UTC)
- To Do: enhance items from catalogue
AcademiaNet
edit- https://tools.wmflabs.org/mix-n-match/?mode=catalog&catalog=99
- Property: AcademiaNet ID (P2080)
- Status: Matching complete as of 19:02, 13 October 2015 – created missing items
- To Do: Enhance items from catalogue
- Keep up with matching (start of every month)
- 46 new entries, 1 one Wikidata 09:32, 7 November 2015 (UTC)
Sächsische Biografie
edit- https://tools.wmflabs.org/mix-n-match/?mode=catalog&catalog=103
- Property: Sächsische Biografie (GND) ID (P1710)
- Status: Matching complete as of 09:30, 19 October 2015 (UTC)
- To Do: Create missing items – enhance items from catalogue
Database of Classical Scholars
edit- https://tools.wmflabs.org/mix-n-match/?mode=catalog&catalog=105
- Property: Database of Classical Scholars ID (P1935)
- Status: Matching complete as of 19:39, 26 October 2015 (UTC)
- To Do: Create missing items – enhance items from catalogue
Still to check:
- Monroe E. Deutsch (Q21524619)
- Emory B. Lease (Q21524620)
- Monroe Nichols Wetmore (Q21524621)
- E. T. Owen (Q21524623)
- Moses Clement Gile (Q21524624)
- Ernest Abell Dale (Q21524626)
- Moses S. Slaughter (Q21524627)
- Ernest Brehaut (Q21524628)
- Natalie Gifford Wyatt (Q21524629)
- Ernest L. Hettich (Q21524631)
- Nathan Dane II (Q21524632)
- Ernest Pascal (Q21524633)
- Nathaniel Williams (Q21524634)
- Ernest Mondell Pease (Q21524636)
- Nelson Glenn McCrea (Q21524637)
- Ethel Hampson Brewster (Q21524638)
- Norman Johnston DeWitt (Q21524640)
- Eugene Stock McCartney (Q21524641)
- Oliver C. Phillips (Q21524642)
- Eugene Tavenner (Q21524644)
- Omera Floyd Long (Q21524645)
- Eunice Work (Q21524647)
- Orsamus Merrill Pearl (Q21524648)
- Eva Johnston (Q21524650)
- Otis Johnson Todd (Q21524651)
- Eva Matthews Sanford (Q21524653)
- Paul A. Clement (Q21524654)
- Evan T. Sage (Q21524657)
- Paul Grady Moorhead (Q21524658)
- Thomas Fitz-Hugh (Q21524660)
- Paul Nixon (Q21524662)
- Fordyce Mitchel (Q21524663)
- Paul R. Coleman-Norton (Q21524664)
- Francis Howard Fobes (Q21524667)
- Peter K. Marshall (Q21524668)
- Francis Marion Austin (Q21524671)
- Peter Wilson (Q21524672)
- Francis R. B. Godolphin (Q21524673)
- Philip Whaley Harsh (Q21524674)
- Frank Burr Marsh (Q21524677)
- Prescott Townsend (Q21524678)
- Frank Card Bourne (Q21524680)
- Preston H. Epps (Q21524682)
- Frank E. Anderson (Q21524683)
- Raymond H. Coon (Q21524684)
- Frank M. Snowden, Jr. (Q21524687)
- Reginald Isaac Wilfred Westgate (Q21524688)
- Franklin Hazen Potter (Q21524690)
- James J. Sheridan (Q21524692)
- William Bernard O’Toole (Q21524695)
- Frederick Carlos Eastman (Q21524697)
- Richard E. Doyle (Q21524699)
- F. Warren Wright (Q21524700)
- Richard Grier Peoples (Q21524702)
- Frederick William Shipley (Q21524704)
- Richard L. Trapp (Q21524705)
- George Converse Fiske (Q21524708)
- Richard Mansfield Haywood (Q21524709)
- George Dwight Kellogg (Q21524711)
- Richard Parsons (Q21524713)
- George Edward Dimock (Q21524715)
- Richard T. Scanlan (Q21524716)
- George Edwin Howes (Q21524719)
- Richard Treat Bruère (Q21524721)
- Robert Steele (Q21524725)
- George J. Ryan (Q21524726)
- R. B. Patton (Q21524727)
- George Meason Whicher (Q21524730)
- Robert Duff Murray (Q21524731)
- George Millet Chase (Q21524734)
- Glanville Downey (Q21524735)
- George P. Bristol (Q21524736)
- Robert Henning Webb (Q21524738)
- George R. Throop (Q21524740)
- Robert J. Rowland (Q21524741)
- Giorgos Thaniel (Q21524743)
- George Washington Paschal (Q21524746)
- Robert Somerville Radford (Q21524748)
- Robert Sawyer (Q21524751)
- G. M. Hirst (Q21524753)
- Rodney Potter Robinson (Q21524755)
- Gessner Harrison (Q21524757)
- Roger A. Hornsby (Q21524758)
- Gilbert Bagnani (Q21524761)
- Roger Miller Jones (Q21524762)
- Gilbert H. Taylor (Q21524763)
- Ross S. Kilpatrick (Q21524765)
- Gladys Martin (Q21524767)
- Roy C. Flickinger (Q21524768)
- Glanville Terrell (Q21524770)
- Roy J. Deferrari (Q21524771)
- Glenn M. Knudsvig (Q21524773)
- Roy Kenneth Hack (Q21524774)
- Glenn R. Morrow (Q21524776)
- Royal C. Nemiah (Q21524777)
- Gonzalez Lodge (Q21524779)
- Grace Lucile Beede (Q21524782)
- Russell M. Geer (Q21524783)
- Graham Vincent Sumner (Q21524786)
- Harry C. Rutledge (Q21524787)
- Grove Ettinger Barber (Q21524789)
- Gustav Hermansen (Q21524792)
- Herbert Musurillo (Q21524793)
- Gustave Adolphus Harrer (Q21524796)
- Hugh Peter O’Neill (Q21524797)
- Harold L. Axtell (Q21524798)
- William Anthony Grimaldi (Q21524800)
- Harold William Miller (Q21524802)
- Sister St. John (Q21524803)
- Harrison Boyd Ash (Q21524805)
- Sally MacEwen (Q21524806)
- Harry Edwin Burton (Q21524808)
- Samuel Willis (Q21524809)
- Harry J. Leon (Q21524811)
- Samuel Loomis Mohler (Q21524812)
- Harry Mortimer Hubbell (Q21524814)
- Sidney G. Ashmore (Q21524815)
- Sinclair MacLardy Adams (Q21524817)
- Harvey Bruce Densmore (Q21524818)
- Skuli Johnson (Q21524819)
- Helen H. Tanzer (Q21524820)
- H. Lamar Crosby (Q21524823)
- Stephen Powelson (Q21524824)
- Steven Lowenstam (Q21524827)
- Henry Lloyd Stow (Q21524828)
- Stewart Irvin Oost (Q21524829)
- Henry Nevill Sanders (Q21524830)
- Susan Dinsmore Tew (Q21524832)
- Henry Parks Wright (Q21524833)
- Susan P. Cobbs (Q21524836)
- Henry Phillips, Jr. (Q21524837)
- Sylvia W. Gerber (Q21524838)
- Herbert Hoffleit (Q21524840)
- Theodore C. Burgess (Q21524842)
- Herbert Cannon Lipscomb (Q21524843)
- Theodore Lyman Wright (Q21524845)
- Herbert Charles Elmer (Q21524846)
- Theodore T. Duke (Q21524848)
- Herbert C. Nutting (Q21524849)
- Thomas Bond Lindsay (Q21524850)
- Herbert Cushing Tolman (Q21524851)
- Thomas Fauss Gould (Q21524854)
- Herbert Jewett Barton (Q21524855)
- Thomas Cutt (Q21524857)
- Herbert Couch (Q21524858)
- Frank Louis Van Cleef (Q21524860)
- Herbert Pierrepont Houghton (Q21524861)
- Larue Van Hook (Q21524864)
- Herbert Wing, Jr. (Q21524865)
- Verne Brinson Schuman (Q21524867)
- Herman Tracy (Q21524868)
- Herman Louis Ebeling (Q21524870)
- Waldo Earle Sweet (Q21524871)
- Walter Blair (Q21524873)
- Walter Dennison (Q21524876)
- Walter Fifield Snyder (Q21524878)
- Walter Raymond Agard (Q21524882)
- Warren Everett Blake (Q21524884)
- Wilbert Lester Carr (Q21524886)
- Wilfred P. Mustard (Q21524888)
- William Alexander Lamberton (Q21524890)
- William Arthur Heidel (Q21524892)
- William Augustus Merrill (Q21524894)
- William B. Royall (Q21524895)
- William Berney Saffold (Q21524897)
- William Charles Korfmacher (Q21524900)
- William Clark Helmbold (Q21524901)
- William Coffman McDermott (Q21524904)
- William Davis Hooper (Q21524906)
- William Dodge Gray (Q21524908)
- William Elisha Peters (Q21524909)
- William Everett Waters (Q21524912)
- William Frank Wyatt (Q21524914)
- William George Fletcher (Q21524915)
- William George Williams (Q21524918)
- William Hamilton Kirk (Q21524921)
- William Harris Stahl (Q21524924)
- William H. Crogman (Q21524926)
- William Meredith Hugill (Q21524928)
- William Merritt Read (Q21524930)
- William Nickerson Bates (Q21524932)
- William Pitkin Wallace (Q21524934)
- William Richard Grey (Q21524935)
- William Robert Jones (Q21524938)
- William Sherwood Fox (Q21524940)
- William Stuart Messer (Q21524942)
- Winthrop Dudley Sheldon (Q21524945)
Magdeburger Biographisches Lexikon
edit- https://tools.wmflabs.org/mix-n-match/?mode=catalog_details&catalog=111
- Property: Magdeburger Biographisches Lexikon ID (P2277)
- Status: Ongoing
- To Do: Finish matching – create missing items – enhance items from catalogue
Greek myth and mythology
editHederichs gründliches mythologisches Lexicon
editI proposed this property (Hederich encyclopedia article (P2272)) in October 2015 while I was working on my PhD thesis (link to discussion). At the time, this was the only comprehensive database of Greek mythical characters I knew of. Magnus Manske then created a mix'n'match catalogue using a webscraper (link). This catalogue has over 9000 entries to be matched with Wikidata items. Note that many of those are cross-references, which in my opinion are not useful to Wikidata (apart from providing alternative headings, maybe). I have classified these cross-references as "not relevant for Wikidata" (2811 so far).
To this day, this catalogue is only partially matched with Wikidata. About 3500 items (39%) are manually matched. The bulk of this work was apparently done by User:Melderick in 2018 with nearly 3200 matches; after that come User:Varina (687 matches), myself (300 matches) and User:Tusculum (211 matches).
There are still about 1400 automatically matched entries which need to be confirmed or corrected by a human, and 1267 entries not matched at all. So there is still work to be done. 13:57, 6 July 2023 (UTC) 15:09, 2 July 2023 (UTC)
EDIT, a few days later: I've found a few mistakes and realised that the manual matches will have to be reviewed as well. *sigh* Jonathan Groß (talk) 13:57, 6 July 2023 (UTC)
UPDATE: Single value and Unique value violations can be used to cross-check the matches. So far matching in the catalogue seems reliable. Jonathan Groß (talk) 10:40, 11 July 2023 (UTC)
UPDATE: While checking the +2800 entries marked as "not applicable to Wikidata", I found that most of them were actually applicable. So I undid a lot of work done by User:Melderick and opened these entries up for matching again. Jonathan Groß (talk) 07:31, 12 July 2023 (UTC)
- Challenges and Solutions
- Hederich is very dated and although most learned and useful for its time, cannot be considered an authoritative source for Greek mythology any more. Using its data to create a Who's Who of Greek mythology is problematic.
- Solution: Let us consider Hederich, a standard work of its time, as a standard in and of itself. Even if its information is incomplete and skewed, it rest on solid study of Greek and Latin sources.
- Hederich uses Latin headings while Wikidata has sometimes Greek, sometimes Latin. As the lemmata use only capital letters, they do not differentiate between I and J or U and V. Hence, Zeno.org transliterated every I as I/i (no problem here) and every V as V/v (good heavens!).
- Solution: Be patient. Also when creating new items: Respect common practice ("To each their own"). Anglophone readers prefer Latin, Germans either Greek or Latin (or 'German' versions).
- Hederich takes care to differentiate between persons of the same name. However, Greek myth is notorious for having competing variants of personal names, places and events. This leads to ambiguity in attributing Hederich articles to Wikidata items.
- Solution: Carefully judge case by case. Document and discuss the problematic cases and use them as learning experiences for our ontology of myth.
- Lots remains to be done.
- Solution: Get to it! And make our progress visible.
- Status
- Issues
- Hero (Q5742616) I matched to Hero 1. Hederich has the subject as "daughter of Priam", which given the context of the underlying source (Hyg. fab. 90, ToposText) seems correct: Although the chapter is titled "sons and daughters of Priam" and the individuals' gender is not specified, "Hero" appears under that name (with a nominative ending usually found in female names) between "Medusa" and "Creusa". However, the English Wikipedia article w:en:Hero (son of Priam) and its Catalan translation w:ca:Hero (filho de Príamo) both have Hero as "son of Priam". This should be corrected. Jonathan Groß (talk) 15:09, 2 July 2023 (UTC)
- Wikidata:Database reports/Constraint violations/P2272
- Coronus (Q1784645) has 4 Hederich articles matched to it. Jonathan Groß (talk) 08:49, 6 July 2023 (UTC)
- worshipped by (P1049) should be checked, it is sometimes used for characters not considered deities. Jonathan Groß (talk) 10:40, 11 July 2023 (UTC)
Greek and Latin literature
edit...
Paulys Realenzyklopädie der klassischen Altertumswissenschaft
editI'm going to maintain the items relating to the over 17,000 articles from Paulys Realenzyklopädie der klassischen Altertumswissenschaft (RE) featured on the German Wikisource project.
After creating 15756 new items for these articles on May 27th and 28th, there's much to do (adding statements, descriptions and labels):
- English description: "article from Pauly-Wissowa’s RE, a comprehensive encyclopedia on classical antiquity";
- German description: "Artikel aus Paulys Realenzyklopädie der klassischen Altertumswissenschaft (Pauly-Wissowa)"
- instance of (P31) encyclopedia article (Q17329259)
- published in (P1433) Paulys Realenzyklopädie der klassischen Altertumswissenschaft (Q1138524)
- For cross references (not full articles), it would be better to add instance of (P31) cross-reference (Q1302249)
- main subject (P921) will be the most important qualifier, but it may be difficult to fill this automatically.
So far, this is only a ToDo list. Jonathan Groß (talk) 12:28, 28 May 2015 (UTC)
Done so far:
- instance of (P31) encyclopedia article (Q17329259) for all articles. Replaced with instance of (P31) cross-reference (Q1302249) for cross-references.
- published in (P1433) Paulys Realenzyklopädie der klassischen Altertumswissenschaft (Q1138524) for all articles and cross-references.
Left to do:
- Adding labels.
- Adding author (P50) according to the authors’ categories.
- Figuring out how to read the RE template to give volume and column numbers.
- Adding volume (P478) and page(s) (P304) (or section, verse, paragraph, or clause (P958)?) to published in (P1433).
- main subject (P921) ... maybe with Wikipedia articles from the RE template?
Jonathan Groß (talk) 09:15, 30 May 2015 (UTC)
To keep up with article creation, I'll keep on frequently:
- [1] Creating new items for entries in cross-reference and article categories.
- [2] Adding published in (P1433) Paulys Realenzyklopädie der klassischen Altertumswissenschaft (Q1138524) to all lemmata (articles and cross-references)
- [3] Adding instance of (P31) cross-reference (Q1302249) to cross-references
- [4] Adding instance of (P31) encyclopedia article (Q17329259) to articles
Jonathan Groß (talk) 09:14, 9 June 2015 (UTC)
Members of the Hellenic Philological Society of Constantinople
editI started adding Property:P463 (member of) with qualifiers "start time" and "as" (do qualify the type of membership, e.g. honorary members, corresponding members, ordinary members) to members of the Hellenic Philological Society of Constantinople (1861–1922). They are listed in the front matter of the society’s journal.
Most, but not all volumes of the journal are available online. Digitised volumes are listed on the Greek Wikisource page.
As most of the ordinary and corresponding members are not eligible for Wikipedia articles, I focus on the society’s honorary members who are highranking Ottoman civil servants, Greek Orthodox patriarchs, foreign diplomats and scientists, sometimes bankers and physicians from Constantinople.
The members’ names are given only by surname and initial, but in combination with the stated profession and workplace, identification is possible. Slight errors in the year of membership are to be expected (especially if the volume was published a long time after the election of a member). The main problem with identifying the members is that the lists in the journal have a lot of spelling errors, mainly with foreign names written in Latin letters.
For example: Conrad Bursian is listed in vol. 6 (1871/72) as Boursion C., Καθηγ. τοῦ Πανεπιστ. Βιέννης (= University of Vienna, which is an error for Jena). In the next volume, the surname is "corrected" to Boursian and the university is given as Ἰένη. More examples: Wilhelm Henzen is listed in vol. 6 (1871/72) as Hengen W., which has never been corrected; Hermann Sauppe is listed as "Hermann Sauppre" (under letter H) in vol. 10 (1875/76). Friedrich August Eckstein is listed as "Eckstreïz" in vol. 14 (1878/79).
Progress so far:
- OK 2015-01-30 checked volumes 6 and 10 for members.
- OK 2015-02-04 checked volumes 3, 7–9 and 11–12 for members.
- OK 2015-02-06 checked volume 14 for members.
- OK 2015-02-10 checked volumes 16–20 for members. Total number now: 330
Sodales Academiae Latinitati Fovendae
editThe members of the Academiae Latinitati Fovendae are Latin scholars from all over the world. Many of them have articles on lawiki, some also on other Wikipedias. The ALF has a list of its members as of 2012-04-20. This list also has the names of the fouding members (1967-04-18) along with the people who were declared "founding members" in 1983, i.e. members who were elected into the ALF until 1983. Jonathan Groß (talk) 13:44, 6 February 2015 (UTC)
The following people I didn't find on Wikidata as of today:
- Mauro Agosto (2007), Italy
- Marco Buonocore (2012), Italy
- Neil Coffee (2011), United States of America [5]
- Lucienne Deschamps (1990), France [6]
- Giorgio Di Maria (2013), Italy [7]
- Gérard Freyburger (1998), France [8]
- Dimitrios Koutroubas (1998), Greece VIAF
- Bruno Luiselli (1984), Italy VIAF
- José María Maestre Maestre (2004), Spain [9]
- Piergiorgio Parroni (1982), Italy [10], VIAF
- Françoise Licoppe-Deraedt (2013), Belgium, Honorary Member
- Giancarlo Rossi (2011), Italy, Honorary Member
- Jane O’Neil (2001), United States of America, Honorary Member
- Nicolae Barbu (1967), Romania VIAF
- Edoardo Coleiro (1967), Malta VIAF
- Giuseppe Del Ton (1967), Vatican City VIAF
- Walthère Derouau S.J. (1967), Belgien/Burundi
- Κωνσταντίνος Γρόλλιος (1967), Greece VIAF
- Alfons Isnenghi (1967), Austria, Dr. phil., teacher in Salzburg
- Jan Kábrt (1967), Czechoslovakia
- Stéphane Kresic (1967), Canada VIAF
- William Stuart Maguinness (1967), United Kingdom
- José Maria Mir (1967), Spain VIAF
- Ottorino Morra (1967), Italy
- Vandick Londres da Nóbrega (1967), Brasil VIAF
- Guerino Pacitti (1967), Italy
- Virgilio Paladini (1967), Italy
- Faruk Zeki Perek (1967), Turkey [11]
- Pierre Schmid (1967), Switzerland
- Vincenzo Ussani d’Escobar (1967), Italy
- Madeleine Bonjour (1982), France
- José Jimenez Delgado (1983), Spain
- Anton Daniel Leeman (prior to 1980), Netherlands
- Alain Michel (1973), France
- Isaj Mihajlovič Nahov (1980), USSR
- Boleslaw Povsic (1982), USA
- Michel Rambaud (1974), France
- José Ruysschaert (prior to 1983), Belgium VIAF
- Amleto Tondini (1969), Vatican City
- Gavin B. Townend (1982), UK
- Antonio Traglia (1976), Italy
- Rodolphe Verdiere (1973), Belgium
- Jan Wikariak (prior to 1983), Poland
Jonathan Groß (talk) 15:52, 6 February 2015 (UTC)