Wikidata:WikiProject Alba amicorum National Library of the Netherlands/Album owners

This project is part of the Wikidata efforts of the Koninklijke Bibliotheek Nederland

Home   Pilot projects   Overviews   Source data   All project pages  

Retrieve NTA, VIAF and Wikidata ids from ISNI id edit

(see https://plein.kb.nl/thoughts/18515)

Goals: start with a person name + a ISNI identifier, find NTA, VIAF and Wikidata identifiers

For instance, we start with ISNI=0000000029132470 for Marcgravius, Christianus

Method 1: SPARQL NTA on data.bibliotheken.nl edit

Go to http://data.bibliotheken.nl/sparql? and copy-paste this query:

PREFIX schema: <http://schema.org/>   
SELECT DISTINCT
(GROUP_CONCAT(DISTINCT(?isni1); separator="") as ?isni)
(GROUP_CONCAT(DISTINCT(?nta1); separator="") as ?nta)
(GROUP_CONCAT(DISTINCT(?viaf1); separator="") as ?viaf)
(GROUP_CONCAT(DISTINCT(?wikidata1); separator="") as ?wikidata)
WHERE{
?person schema:sameAs ?x.
BIND(STRAFTER(STR(?person),"http://data.bibliotheken.nl/id/thes/") AS ?nta1).
BIND(STRAFTER(STR(?x),"http://www.isni.org/isni/") AS ?isni1).
BIND(STRAFTER(STR(?x),"http://viaf.org/viaf/") AS ?viaf1).
BIND(STRAFTER(STR(?x),"http://www.wikidata.org/entity/") AS ?wikidata1).
{
SELECT DISTINCT ?person ?id  
WHERE {
?person schema:mainEntityOfPage/schema:isPartOf <http://data.bibliotheken.nl/id/dataset/persons> .
?person schema:sameAs ?id.
FILTER(regex(?id, '0000000029132470', 'i')).
}}}

Try it! - Result of this query (takes some time...)

JSON response can be selected from the Results Format dropdown

Method 2: Webscrape ISNI website edit

Scrape from webpage ISNI=0000000029132470 using Chrome plugin from webscraper.io

The other 2 methods seem to much more work/slower than this one, so it seems to be the quickest method

Method 3: From VIAF datadump edit

Download http://viaf.org/viaf/data/viaf-20210802-links.txt.gz (1.2GB zipped, inzipped 7.7GB, 125M lines!!) and process that on your machine

Data looks like this:

http://viaf.org/viaf/100001249	PTBNP|835968
http://viaf.org/viaf/100001249	ISNI|0000000069727399
http://viaf.org/viaf/100009494	PTBNP|85905
http://viaf.org/viaf/100012436	Identities@https://www.worldcat.org/identities/viaf-100012436
http://viaf.org/viaf/100012436	ISNI|0000000068921720
http://viaf.org/viaf/100012436	PTBNP|867238
http://viaf.org/viaf/10001407	Wikipedia@https://cs.wikipedia.org/wiki/Pavel_Hrach
http://viaf.org/viaf/10001407	Identities@https://www.worldcat.org/identities/lccn-nb2002057643
http://viaf.org/viaf/10001407	LC|nb2002057643
http://viaf.org/viaf/10001407	SUDOC|188652647
http://viaf.org/viaf/10001407	WKP|Q46479622
http://viaf.org/viaf/10001407	ISNI|0000000055120174
http://viaf.org/viaf/10001407	BNF|13491530
http://viaf.org/viaf/10001407	NKC|jo20010086910
....
....
.. 125 million lines of that!

Method 4: Using OpenRefine edit

Method 5: Custiom API via grlc.io + Github edit

  • Docs: