Wikidata:WikiProject PCC Wikidata Pilot/Harry Ransom Center

Background

edit

The Harry Ransom Center (Q5671855) is a humanities research center at University of Texas at Austin (Q49213). The Ransom Center collections provide unique insight into the creative process of some of our finest writers and artists, deepening the understanding and appreciation of literature, photography, film, art, and the performing arts. Access and discovery of the collections (https://www.hrc.utexas.edu/collections/) is provided through a series of tools including EAD findings aids, item level SQL databases and MARC bibliographic records. While the Center contributes to PCC NACO for the authority control of bibliographic entities, creators and subjects of archival and museum like collections are managed in a local name authority file. This authority file is currently an internal database, though it has a great potential as a discovery tool for patrons and researchers. During the past few years, the HRC Description and Access department has engaged in a number of projects aimed to enhance the HRCNAF including alignment with NACO policies and entity reconciliation with external authoritative sources such as LCNAF, VIAF and ULAN. During Summer 2020, Description and Access hosted a University of Texas at Austin School of Information (Q7896456) graduate student to conduct her capstone project. Alyssa Anderson's project served as a pilot for identifying workflows, training, and policy needs for a scalable and sustainable integration of wikidata in some of the Center's Description and Access workflows.


Aim and Scope

edit

The aim of the HRC Wikidata project is to explore the benefits and opportunities resulting from linking a local authority file to Real World Objects described in this open and multilingual knowledgebase. The Ransom Center project uses the property archives at (P485) to connect a local entry in the HRCNAF with a wikidata item. The current scope of the project is limited to creators of archival collections held at the Ransom Center, but we expect the initial results of the project to inform additional opportunities for contributions and new types of connexion.

UPDATE: from 04/07/2021 the Ransom Center project also uses the property personal library at (P9419) to connect creators of personal libraries held at the Ransom Center with their wikidata items.

UPDATE: from 09/10/2021 the Ransom Center project also uses the property has works in the collection (P6379) to connect creators of artworks and photographs held at the Ransom Center with their wikidata items.

This project has a multiphase approach:

Phase 1

  • Identify best practices for the implementation of the archives at (P485) property
  • Establish a profile for HRC Collections as a statement source
  • Establish a profile for the description of new Wikidata items and/or enhancement of existing ones
  • Establish Quality Control and maintenance workflows
  • Propose a new property "personal library at" and identify best practices for its implementation

Phase 2

Phase 3

  • Establish a strategy to integrate wikidata items editing/creation within HRC Description and Access' workflow

Contributors

edit

HRC best practices

edit

Describing HRC CPF entities

edit

Our framework for description of CPF entities in wikidata is the Wikidata and MARC21 authority mappings (consulted on 3/12/2021) and the Archives Linked Data Interest Group Wikiproject.


People

Property Value Data type Usage note
label Most common name that the person is known by String Enter name in direct order without any qualifying information such as dates. Might or might not be the same as the full name. Must be associated with a language and may repeat for multiple languages.
description Brief description about the person String The description is designed to disambiguate items with the same or similar labels. Think of it as the continuation of "This person is/was...". Should start with lower case unless is a proper noun.
also Known as Alternative forms of the name for the person String Use it to record variant forms of the name, such as differences in spelling, punctuation, abbreviations (e.g. initials), older forms of the name (e.g. maiden names).
instance of (P31) human (Q5) Item n/a
date of birth (P569) The most specific date known for the person's birth Time When this property is used with items of living people it may violate privacy; statements should generally not be supplied unless they can be considered widespread public knowledge or openly supplied by the individual themselves. If different dates are stated in different sources, record both dates with references. Values may include uncertain dates such as the 1980s or 19th century. See Help:Dates for more information about formatting dates.
date of death (P570) The most specific date known for the person's birth Time When this property is used with items of living people it's likely to be challenged; as a result those statements have to be supported by a reliable public source as suggested in Wikidata:Living people. If different dates are stated in different sources, record both dates with references. Values may include uncertain dates such as the 1980s or 19th century. See Help:Dates for more information about formatting dates.
place of birth (P19) Geographic location where the person was born Item Do not use if the statement is likely to harm the person being described. Record the most specific place known, whether country, province, city, or even specific location. If different places of birth are stated in different sources, record both places with references.
place of death (P20) Geographic location where the person died Item Do not use if the statement is likely to harm the person being described. Record the most specific place known, whether country, province, city, or even specific location. If different places of death are stated in different sources, record both places with references.
floruit (P1317) Date when the person was known to be active or alive, when birth or death not documented (fl. = "floruit") Time If different dates are stated in different sources, record both dates with references. Values may include uncertain dates such as the 1980s or 19th century. See Help:Dates for more information about formatting dates.
honorific prefix (P511) Word or expression used before a name, in addressing or referring to a person Item Find appropriate wikidata item for the title of honor (e.g. Sir)
residence (P551) Geographic location where the person is or has been resident Item When this property is used with items of living people it may violate privacy; statements should generally not be supplied unless they can be considered widespread public knowledge or openly supplied by the individual themselves. Record the most specific place known, whether country, province, city, or even specific location. If different places of residence are stated in different sources, record both places with references.
sex or gender (P21) Sex or gender identity of the person Item When this property is used it may violate privacy; statements should generally not be supplied unless they can be considered widespread public knowledge or openly supplied by the individual themselves. Find appropriate wikidata item for the gender identity (e.g. Female).
languages spoken, written or signed (P1412) Language(s) the person speaks, writes or signs, including the native language(s) Item Find appropriate wikidata item for the language(s) (e.g. English).
native language (P103) Language(s) the person has learned from early childhood Item Do not use if the statement is likely to harm the person being described. Find appropriate wikidata item for the language(s) (e.g. English).
field of work (P101) Field of specialization of a person or organization Item Use to indicate the area in which a person worked (e.g. philosophy).
occupation (P106) Occupation of the person no Use to designate a specific profession or occupation (e.g. writer).
member of (P463) Organization, club or musical group to which the person belongs Item Do not use if the statement is likely to harm the person being described. Do not use for membership in ethnic or social groups, nor for holding a position (e.g. member of the Parliament).
birth name (P1477) Full name of the person at birth, if different from their current, generally used name Monolingual text When this property is used with items of living people it may violate privacy; statements should generally not be supplied unless they can be considered widespread public knowledge or openly supplied by the individual themselves. Must be associated with a language and may repeat for multiple languages.
pseudonym (P742) Alias used by the person String Do not use if the statement is likely to harm the person being described.
mother (P25), father (P22), sibling (P3373), spouse (P26), child (P40), unmarried partner (P451) Immediate family type relationships Item Use to record relationships among people.
archives at (P485) Harry Ransom Center (Q5671855) Item n/a
personal library at (P9419) Harry Ransom Center (Q5671855) Item n/a
has works in the collection (P6379) Harry Ransom Center (Q5671855) Item n/a
on focus list of Wikimedia project (P5008) WikiProject PCC Wikidata Pilot/Harry Ransom Center (Q105936481) Item n/a
Library of Congress authority ID (P244) Library of Congress identifier for persons, organizations, events, places, titles, and subject headings External identifier Enter only the ID No., not the whole URI (e.g.: n79022935)
Union List of Artist Names ID (P245) Identifier from the Getty Union List of Artist Names External identifier Enter only the ID No., not the whole URI (e.g.: 500115588)
VIAF ID (P214) Identifier for the Virtual International Authority File database External identifier Enter only the ID No., not the whole URI (e.g.: 120062731)
SNAC ARK ID (P3430) Identifier for items in the Social Networks and Archival Context system External identifier Enter only the ID No., not the whole URI (e.g.: w6b38jcj)


Corporate bodies

Property Value Data type Usage note
label Most common name that the corporate body is known by String Enter name in direct order without any qualifying information. Might or might not be the same as the full name. Must be associated with a language and may repeat for multiple languages.
description Brief description about the corporate body String The description is designed to disambiguate items with the same or similar labels. Think of it as the continuation of "This corporate body is/was...". Should start with lower case unless is a proper noun.
also Known as Alternative forms of the name for the corporate body String Use it to record variant forms of the name, such as differences in spelling, punctuation, abbreviations (e.g. acronyms), older forms of the name.
instance of (P31) Class of which this subject is a particular example and member Item There are many wikidata items which represent the type of entity being described from very generic (e.g. organization (Q43229), public company (Q891723), business (Q4830453), society (Q8425)) to very specific (e.g. photographic studio (Q672070))
industry (P452) Specific industry of the organization Item Use to document the industry in which a corporate body operates.
field of work (P101) Field of specialization of the organization Item Use when the corporate body acts in a sector or field that is not an industry.
inception (P571) Date of creation of the corporate body Time Values may include uncertain dates such as the 1980s or 19th century. See Help:Dates for more information about formatting dates.
dissolved, abolished or demolished date (P576) Date at which the corporate body ceased to exist Time Values may include uncertain dates such as the 1980s or 19th century. See Help:Dates for more information about formatting dates.
headquarters location (P159) City where the corporate body's headquarters is or has been situated Item Use for corporate bodies with many locations or branches.
street address (P6375) Full street address where subject is located Monolingual text Use for corporate bodies which can be associated with one street address. Include building number, city/locality, post code, but not country.
located in the administrative territorial entity (P131) Territory within an Administrative entity where the corporate body is located Item
country (P17) Country associated with the corporate body Item
official website (P856) URL of the official homepage (current or former) URL This should include the http(s):// or other appropriate prefix. A corporate entity should only have one official website per language/country.
founded by (P112) Founder or co-founder of the corporate body Item Use to record relationship between people and the corporate body
owned by (P127) Owner of the corporate body Item Use to record relationship between people and the corporate body
part of (P361) Larger entity of which the corporate body is part of Item Use to record hierarchical relationships among corporate bodies
archives at (P485) Harry Ransom Center (Q5671855) Item n/a
on focus list of Wikimedia project (P5008) WikiProject PCC Wikidata Pilot/Harry Ransom Center (Q105936481) Item n/a
Library of Congress authority ID (P244) Library of Congress identifier for persons, organizations, events, places, titles, and subject headings External identifier Enter only the ID No., not the whole URI (e.g.: n79022935)
Union List of Artist Names ID (P245) Identifier from the Getty Union List of Artist Names External identifier Enter only the ID No., not the whole URI (e.g.: 500115588)
VIAF ID (P214) Identifier for the Virtual International Authority File database External identifier Enter only the ID No., not the whole URI (e.g.: 120062731)
SNAC ARK ID (P3430) Identifier for items in the Social Networks and Archival Context system External identifier Enter only the ID No., not the whole URI (e.g.: w6b38jcj)

The "archives at" property

edit

Link to Property talk:P485.


Qualifiers for "archives at"

 
RDF mapping diagram for HRC "archives at" property
Property Value Data type Usage note
title (P1476) Collection title Monolingual text Use the title as found on the HRC Collections db "Collection title" field. Always include a collection title
inventory number (P217) Alpha numeric code for the collection Monolingual text Find this code on the HRC Collections db

When adding the archives at (P485) property, always include the collection finding aid as a source (see HRC Collections as statement sources below)


Who is using the archives at (P485) property? here

The "personal library at" property

edit

Link to Property talk:P9419.


For the purpose of this project, we identify a "personal library" as a collection of more than fifty titles that once belonged to a post-1800 individual and twenty five for pre-1800 individuals. These parameters are based on the definition given of an author's library by Richard Oram and Joseph Nicholson on their compilation Location and Bibliographical Guide to Writers' Libraries (Oram & Nicholson, 2014). The wider semantics of the personal library at (P9419) property will allow us to use the property not just with author's libraries currently held at the Ransom Center, but also with other types of personal libraries.

We welcome the input of the Special Collections community in fine tuning best practices for the use of personal library at (P9419).

Sources:

  • Oram, R., & Nicholson, J. (2014). Collecting, curating, and researching writers’ libraries : a handbook / edited by Richard W. Oram, with Joseph Nicholson. Rowman & Littlefield.
  • The Library chronicle of the University of Texas. (1944). Library, University of Texas. (https://babel.hathitrust.org/cgi/mb?a=listis&c=758668079)


Qualifiers for "personal library at"

 
RDF mapping diagram for HRC "personal library at" property
Property Value Data type Usage note
title (P1476) Collection title Monolingual text Use the title as found on the HRC Collections db "Collection title" field. Always include a collection title
inventory number (P217) Alpha numeric code for the collection Monolingual text Find this code on the HRC Collections db

Since there are no Book Collection Records on public display at the institution Website, source statements for personal library at (P9419) will have to be provided by adding an offline reference.


Who is using the personal library at (P9419) property? here


The "has works in the collection" property

edit

Link to Property talk:P6379.


Qualifiers for "has works in the collection"

Do not include any qualifiers.

When adding the has works in the collection (P6379) property, always include the collection finding aid as a source (see HRC Collections as statement sources below).


Who is using the has works in the collection (P6379) property? here

Statement references

edit

Statements added by Ransom Center project team should always include references citing items or collections within the Ransom Center holdings. More information about working with sources in Wikidata can be found on Help:Sources.


Citing a collection that has a published finding aid

Property Value Data type Usage note
reference URL (P854) https://norman.hrc.utexas.edu/fasearch/findingAid.cfm?eadid=[EAD Num.] URL EAD number can be obtained on the Finding Aids Management database
retrieved (P813) DD Month YYYY Time Date or point in time that the finding aid was consulted to retrieve the information


Citing a collection that does not have a published finding aid

Property Value Data type Usage note
? Collection title Monolingual text Check with public services regarding collection/item citation
inventory number (P217) Alpha numeric code for the collection String Find this code on the HRC Collections db


Citing a book

Property Value Data type Usage note
stated in (P248) Edition of the book used as a source Item Use directions on Help:Sources#Books to create new items for works and editions if the item does not exist in wikidata

Bulk uploads

edit

The Ransom Center has tested the use of Quickstatements for the addition of new "archives at" and "personal library at" statements. There is very robust documentation about the use of the Import CSV commands in Help:QuickStatements.


Workflow

  • Copy/paste the template table below on a spreadsheet, fill it out following the sample values, and save it as a .csv file
  • Open the .csv file with notepad and review. You will notice that additional quotations have been added around the datatype properties, and that is fine, since this is what the Import CSV commands is expecting. Your text should look like this:


qid,P485,qal217,qal1476,S854,s813
Q3208122,Q5671855,"""MS-54100""","en:""Wilson Barrett Papers""","""https://norman.hrc.utexas.edu/fasearch/findingAid.cfm?eadid=01178""",+2021-09-01T00:00:00Z/11


  • If there are extra quotations before and after the 'en:', you can use the find and replace function to fix it
  • Create a new command batch for wikidata in quick statements and copy/paste the .csv commands from the notepad
  • Select the "Import CSV command" button and then hit the "Run" button


Import CSV commands for "archvies at"

The following is the template to use for import CSV commands.

qid P485 qal217 qal1476 S854 s813
Q3208122 Q5671855 "MS-54100" "en:""Wilson Barrett Papers" "https://norman.hrc.utexas.edu/fasearch/findingAid.cfm?eadid=01178" +2021-09-01T00:00:00Z/11


Import CSV commands for "personal library at"

The following is the template to use for import CSV commands.


Import CSV commands for "on focus list of Wikimedia project"

The following is the template to use for import CSV commands.

qid P5008
Q3208122 Q105936481

Quality Control and metrics

edit

The following queries have been defined to keep track of project deliverables and provide Quality Control

Querying items


Querying qualifiers



Querying references



Local discovery testing

edit

The following queries have been defined to test local discovery features


Project questions, ideas and further research

edit

Additional properties useful for expanding HRC Wikidata integration


Useful resources

edit

WikiProjects