Wikidata:Property proposal/WorldCat Entities ID

WorldCat Entities ID edit

Originally proposed at Wikidata:Property proposal/Generic

Descriptionidentifier for a person, work, or place from the WorldCat Entities linked data service
RepresentsWorldCat Entities (Q112122720)
Data typeExternal identifier
Domainitem
Allowed values[a-zA-Z0-9]{26}
Example 1Stacey Abrams (Q7595813)E39PCjBmpqCRgMjGFXqMvKBFJC
Example 2Helen Clark (Q180383)E39PBJrcqvXdm3kkwGr7HVG8md
Example 3Gone with the Wind (Q2875)E39PCG49K6QCxWwHvtYhKMKkMq
Example 4The Handmaid's Tale (Q1541914)E39PCGm6vG9GDfJ46yBj6yC4YK
Example 5Yuba County (Q196014)E39PBJmtyb8rhVKTTQkMvc9VYP
Example 6Eritrea (Q986)E39PBJfrJjjP9djpqRMMWPCPcP
Sourcehttps://id.oclc.org/worldcat/entity
Planned useadding to existing and new items for persons, works, and places
Expected completenessalways incomplete (Q21873886)
Formatter URLhttps://id.oclc.org/worldcat/entity/$1
See alsoOCLC work ID (P5331), WorldCat Identities ID (superseded) (P7859)
Applicable "stated in"-valueWorldCat Entities (Q112122720)

Motivation edit

WorldCat Entities (Q112122720) is a linked data service from OCLC for persons, works, and places, funded in part by a grant from the Mellon Foundation, in partnership with an advisory group of 28 libraries. The system allows users to browse through different languages and explore the way each entity links to other external vocabularies and authority files for further context. UWashPrincipalCataloger (talk) 20:41, 24 May 2022 (UTC)[reply]

Discussion edit

@UWashPrincipalCataloger, Pteropotamus, Emwille, Sheilatb, Jimfhahn, Kiwigirl3850: @Clements.UWLib, BeLucky, Epìdosis:   Done WorldCat Entities ID (P10832)--Alexmar983 (talk) 16:52, 19 June 2022 (UTC)[reply]

Nice! @UWashPrincipalCataloger, Pteropotamus, Emwille, Sheilatb, Jimfhahn, Kiwigirl3850:

Can someone provide some info about it? How many items? What does it link to? Is there a dump or SPARQL endpoint?

From URLs like this it's obvious that it's PARTLY driven by Wikibase:

--Vladimir Alexiev (talk) 16:11, 1 July 2022 (UTC)[reply]

I was on the advisory board while it was being developed from Feb 2020 - Dec 2021. Part of my work on the advisory board was evaluating the interface and APIs.
The EMI service has people and works as their focus; but places also hung off of those entities.
The data is enriched with Wikidata and the initial system began as wikibase. Though I understood that much of the wikibase code for the front-end is completely replaced with an OCLC standard front end stack. I think they used Amazon Neptune for the SPARQL.
Eventually API access will be available, I did evaluate a version of the API where we could use SPARQL statements against the service. I think that the API will not be a free service though.
As for the number of entities in the system -- OCLC made use of previous experimentation with their Work IDs from worldcat.org. According to a 2014 presentation there could have been ~360 million work IDs then: https://www.oclc.org/research/events/2017/12-12.html
OCLC's worldcat is made up of 2 billion items though there is not a 1:1 match of work to items. Rather many items relate to a single work. OCLC identities represented people identifiers and their associated works: http://www.worldcat.org/identities/
I also believe this infrastructure (OCLC EMI) to be a place for sustainable access to work and people IDs for bibliographic data. There is alot more that I'd like to know about the service, too -- but I'd be inclined to await a webinar from OCLC with more concrete details about numbers of entities and API access plans. Jimfhahn (talk) 17:52, 1 July 2022 (UTC)[reply]