Property talk:P6597
Documentation
ID of corresponding entry in the DFD online dictionary of family names
The Digital Dictionary of Surnames in Germany (Q61889795) publishes new entries every two weeks, on the first and fifteenth day of the month. With every update, two lists are also updated on the DFD server:
- http://www.namenforschung.net/alle.csv – list of all published entries
- http://www.namenforschung.net/neu.csv – list of newly published entries from the last update
Both lists are formatted like the upload format for M’n’M (not yet adjusted to the M’n’M update of February 2021). The Mix’n’match catalog will soon be updated every two weeks from the first list.
Additional matching is possible with an external workflow, which produces a list ready for QuickStatements. This workflow is intended to only make unequivocal 1:1 matches and avoid constraint violations.[1-9][0-9]{0,5}|10[0-9]{5}
”: value must be formatted using this pattern (PCRE syntax). (Help)List of violations of this constraint: Database reports/Constraint violations/P6597#Format, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P1705, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P282, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P31, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Entity types
List of violations of this constraint: Database reports/Constraint violations/P6597#Scope, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Item P407, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#language
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'de' language, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'en' language, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P6597#Label in 'es' language, search, SPARQL
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
there should be just one native label (P1705) statement on items (Help)
Violations query:
SELECT ?item (COUNT(DISTINCT ?st) as ?count) (COUNT(DISTINCT str(?nl)) as ?count2) (GROUP_CONCAT(DISTINCT str(?nl); separator=", ") as ?nls) WHERE { ?item wdt:P6597 [] . ?item p:P1705 ?st . ?st ps:P1705 ?nl . } GROUP BY ?item HAVING (?count2 > 1) ORDER BY DESC(?count2) ?item LIMIT 500
List of this constraint violations: Database reports/Complex constraint violations/P6597#single native label
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
check native label (P1705) and label in English (en) (Help)
Violations query:
SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"en") as ?en_label) FILTER NOT EXISTS { ?item rdfs:label ?en_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#en label should match P1705 value
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
check native label (P1705) and label in German (de) (Help)
Violations query:
SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"de") as ?de_label) FILTER NOT EXISTS { ?item rdfs:label ?de_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#de label should match P1705 value
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
check native label (P1705) and label in Spanish (es) (Help)
Violations query:
SELECT ?item ?nl { ?item wdt:P6597 ?value . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl),"es") as ?es_label) FILTER NOT EXISTS { ?item rdfs:label ?es_label } }
List of this constraint violations: Database reports/Complex constraint violations/P6597#es label should match P1705 value
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
labels would generally be available in several languages (Help)
Violations query:
SELECT ?item ?nl (COUNT(*) as ?count) { ?item wdt:P6597 ?value . ?item rdfs:label ?l . ?item wdt:P1705 ?nl . } GROUP BY ?item ?nl HAVING ( ?count < 9)
List of this constraint violations: Database reports/Complex constraint violations/P6597#labels in several languages
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
Default English language description for Latin script family names is "family name" (Help)
Violations query:
SELECT ?item ?en_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item schema:description ?en_desc . FILTER( lang(?en_desc)="en" && !CONTAINS( ?en_desc, "family name" ) ) ?item wdt:P282 wd:Q8229 }
List of this constraint violations: Database reports/Complex constraint violations/P6597#en description to include "family name"
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
Default German language description for Latin script family names is "Familienname" (Help)
Violations query:
SELECT ?item ?de_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item schema:description ?de_desc . FILTER( lang(?de_desc)="de" && !CONTAINS( ?de_desc, "Familienname" ) ) ?item wdt:P282 wd:Q8229 }
List of this constraint violations: Database reports/Complex constraint violations/P6597#de description to include "Familienname"
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
Japanese description format is generally "姓 (<P1705 value>)" (Help)
Violations query:
SELECT ?item ?nl ?ja_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . ?item schema:description ?ja_desc . FILTER(lang(?ja_desc)="ja" && !CONTAINS( ?ja_desc, ?nl ) ) ?item wdt:P282 wd:Q8229 . } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ja description to include P1705 value
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
Russian description format is generally "фамилия - <P1705 value>" (Help)
Violations query:
SELECT ?item ?nl ?ru_desc { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . ?item schema:description ?ru_desc . FILTER(lang(?ru_desc)="ru" && !CONTAINS( ?ru_desc, ?nl ) ) ?item wdt:P282 wd:Q8229 . } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ru description to include P1705 value
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
Native label should be an alias in Russian (Help)
Violations query:
SELECT ?item ?nl ?alt { ?item wdt:P6597 [] . hint:Prior hint:rangeSafe true . ?item wdt:P1705 ?nl . BIND( strlang(str(?nl), "ru") as ?alt) FILTER NOT EXISTS { ?item skos:altLabel ?alt } } LIMIT 200
List of this constraint violations: Database reports/Complex constraint violations/P6597#ru alias to include P1705 value
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
family names used as P734 values, but without a P6597 statement. Selection by nationalities (Help)
Violations query:
SELECT ?item ?l ?count ?sample WITH { SELECT ?item (COUNT(DISTINCT ?p) as ?count) (SAMPLE(?p) as ?sample) { VALUES ?c { wd:Q183 wd:Q16957 wd:Q713750 } ?p wdt:P27 ?c . hint:Prior hint:rangeSafe true . ?p wdt:P734 ?item . ?p wdt:P31 wd:Q5 . } GROUP BY ?item HAVING ( ?count > 20 ) } as %a WHERE { INCLUDE %a FILTER NOT EXISTS { ?item wdt:P6597 [] } ?item rdfs:label ?l . FILTER(lang(?l) = "de" ) } ORDER BY DESC(?count) LIMIT 100
List of this constraint violations: Database reports/Complex constraint violations/P6597#Most frequent P734 values without property (Germany, nat)
![](http://upload.wikimedia.org/wikipedia/commons/thumb/a/a6/Pictogram_voting_comment.svg/40px-Pictogram_voting_comment.svg.png)
family names used as P734 values, but without a P6597 statement. Selection by place of birth (Help)
Violations query:
SELECT ?item ?l ?count ?sample WITH { SELECT ?item (COUNT(DISTINCT ?p) as ?count) (SAMPLE(?p) as ?sample) { ?p wdt:P19 / wdt:P17 wd:Q183 . hint:Prior hint:rangeSafe true . ?p wdt:P734 ?item . ?p wdt:P31 wd:Q5 . } GROUP BY ?item HAVING ( ?count > 20 ) } as %a WHERE { INCLUDE %a FILTER NOT EXISTS { ?item wdt:P6597 [] } ?item rdfs:label ?l . FILTER(lang(?l) = "de" ) } ORDER BY DESC(?count) LIMIT 100
List of this constraint violations: Database reports/Complex constraint violations/P6597#Most frequent P734 values without property (Germany, POB)
Statistics
editNames used in family name (P734)
editFrequency of uses of names as family name (P734)-values in Wikidata.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changesrange | names | total_items | sample |
---|---|---|---|
0 | 21562 | 0 | Isakovic |
1 | 7667 | 7667 | Ayik |
2-4 | 11189 | 34321 | Jansch |
5-9 | 3947 | 28690 | Degenkolb |
10+ | 7204 | 155830 | Umbach |
50+ | 1532 | 106419 | Hertzberg |
100+ | 1640 | 345663 | Lockwood |
500+ | 277 | 194590 | Yates |
1000+ | 247 | 492388 | Johnston |
5000+ | 20 | 126392 | Hall |
10000+ | 5 | 65766 | Johnson |
Names used in family name (P734) (Germany)
editFrequency of uses of names as family name (P734)-values in Wikidata. Items with country of citizenship (P27) = Germany (Q183) only.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changesrange | names | total_items | sample name | sample person |
---|---|---|---|---|
0 | 36099 | 0 | Aaron | |
1 | 7884 | 7884 | Arntzen | Helmut Arntzen |
2-4 | 7173 | 20670 | Angel | Tina Angel |
5-9 | 1634 | 11783 | Eisenmann | Hans Eisenmann |
10+ | 2092 | 41828 | Gehrke | Hans-Joachim Gehrke |
50+ | 247 | 16628 | Nickel | Rafael Nickel |
100+ | 150 | 26844 | Schumann | Conrad Schumann |
500+ | 9 | 5992 | Schneider | Bernd Schneider |
1000+ | 2 | 3327 | Schmidt | Willi Schmidt |
Names used in family name (P734) (Austrian)
editFrequency of uses of names as family name (P734)-values in Wikidata. Items with country of citizenship (P27) = Austria (Q40) only.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changesrange | names | total_items | sample name | sample person |
---|---|---|---|---|
0 | 49721 | 0 | Wilkinson | |
1 | 2723 | 2723 | Lamm | Erich Lamm |
2-4 | 1995 | 5699 | Brown | Vanessa Brown |
5-9 | 406 | 2923 | Abeles | Otto Abeles |
10+ | 397 | 7233 | Walter | Anton Walter |
50+ | 34 | 2232 | Leitner | Thea Leitner |
100+ | 14 | 1830 | Huber | Franz Jägerstätter |
Discussion
editIs item-requires-statement constraint (Q21503247) with language of work or name (P407) necessary?
editIt seems to me that item-requires-statement constraint (Q21503247) with language of work or name (P407) is used as a convenient way to check the completeness of family name (Q101352) items? The applicability of Digital Dictionary of Surnames in Germany ID (P6597) is independent of language of work or name (P407), in my opinion. Wouldn’t it be better to rely on EntitySchema:E734 for this check?
(A colleague raised the concern that this constraint might give the impression that the Digital Dictionary of Surnames in Germany ID (P6597) statement is erroneous, and I see the point.) @Jura1
Julian Jarosch (digicademy) (talk) 17:19, 19 December 2019 (UTC)
- Eventually every item for a family name should have one or several such statements, but, as one can see on Property talk:P734/numbers/values for values of P734, we are far from that.
- Beyond the format constraint, I don't think constraints indicate that, but I suppose we could change it to a mere suggestion (done that). --- Jura 07:00, 21 January 2020 (UTC)
- Great, thank you! Julian Jarosch (digicademy) (talk) 09:59, 21 January 2020 (UTC)
I have removed the restriction conflicts-with constraint (Q21502838) because the dictionary also contains Ukrainian, Russian, Japanese names etc. and an error is displayed when merging them, see e.g. Antonjuk (Q107453552)/Antoniuk (Q12784938) and cf. http://www.namenforschung.net/id/name/300261/1 or conflicts-with constraint (Q21502838) etc. --HarryNº2 (talk) 13:29, 8 August 2022 (UTC)