Wikidata:WikiProject Books/Book data model

General cases

edit
Properties Work Edition (original/translated) Exemplar (manuscript) Exemplar (printed)
instance of (P31) Mandatory1 Mandatory2 Mandatory3
has edition or translation (P747) Optional na na
edition or translation of (P629) na Mandatory na
exemplar of (P1574) na na Mandatory4 Mandatory5
title (P1476) Mandatory Mandatory6 Mandatory6
subtitle (P1680) Mandatory if applicable Mandatory if applicable6 Mandatory if applicable6
place of publication (P291) na Mandatory na -7
location of discovery (P189) na na Mandatory if applicable na
publication date (P577) na Mandatory na -7
inception (P571) Mandatory8 na -7 -7
time of discovery or invention (P575) na na Mandatory if applicable na
author (P50) Mandatory Mandatory if different from work -7 -7
author of foreword (P2679) na Mandatory if applicable na -7
author of afterword (P2680) na Mandatory if applicable na -7
editor (P98) na Mandatory if applicable na -7
contributor to the creative work or subject (P767) Mandatory if applicable -7 -7 -7
publisher (P123) na Mandatory na -7
translator (P655) na Optional Optional -7
illustrator (P110) na Optional Optional -7
printed by (P872) na Optional9 na -7
named after (P138) na na Optional Optional
language of work or name (P407) Mandatory Mandatory Mandatory if applicable -7
follows (P155) Mandatory if applicable -7 -7 -7
followed by (P156) Mandatory if applicable -7 -7 -7
Identifiers
translator (P655) na Optional -6
Content data
genre (P136) Optional -6 -6
movement (P135) Optional -6 -6
main subject (P921) Optional -6 -6
characters (P674) Optional -6 -6
narrative location (P840) Optional -6 -6
based on (P144) Optional -6 -6
inspired by (P941) Optional -6 -6
na : not applicable (the property shouldn't never used in this class of items)
mandatory: the property has to be always present in the item with a value or no value or unknown value
mandatory if applicable: the property has to be always present if a value exists
optional: the property can be present in the item (with a value) or not
1 With value work (Q386724)
2 With value version, edition or translation (Q3331189)
3 With value individual copy of a book (Q53731850)
4 Link to to the work item in case of unique exemplar
5 Link to the edition item in case of several exemplars
6 In the original language for original editions/exemplars and in the translation language for translated editions/exemplars
7 Value are retrieved from the items of higher level
8 Earliest known date of writing or date when the manuscript was ended and available for reading, significant time before publication date
8 application example: manuscript
9 if different from "publisher"

Special cases

edit

...

General principles for data structure

edit
  • The classification is based on an application of the FRBR system: 3 levels are used to describe written documents:
    • - work level
    • - edition level
    • - exemplar level
  • When only one edition or only one exemplar exists, the work item is necessary. If no additional item is created for the work item, the minimal solution is to use the same item for both work/edition data or work/exemplar data. The presence of both statements instance of work AND instance of edition or instance of work AND instance of exemplar is necessary to ensure a reliable data extraction.

General guidance for extraction all information for an item

edit

Principle 1

The data retrieval has to follow a top-down process in order to collect the data relevant for an item
  • - Examplar case: ...
  • - Edition case: collect all data from the work item linked to the edition item and then collect data from the edition item. If some data are different between the work and the edition item, the data from the edition erase the data from the work item.
  • - Work case: collect all data from work item.

Principle 2

All properties have to be checked at each level of the data collection. Due to several exceptions, no properties can be dedicated to only one level (for example property author (P50) can't be used only at the work level, in some special cases, author (P50) is used at the edition level and at the work level).

Principle 3

Following Principle 2, data from the different levels have to be aggregated: for example if property author (P50) is used at the edition level and at the work level, the final data retrieval can then finish with several values for the same property. In that case the value of the lowest level erases the others.