Wikidata:WikiProject ELTeC

The main aim of this project is building Wikidata entities for the novels from ELTeC: European Literary Text Collection level-2. The guideline model for activities is the successful use case SrpELTeC@Wikidata, that will be used for following languages that have level-2 already published: English, Portuguese, French, Slovenian, German, Hungarian. Tasks involved for the achievement of this goal include the following processing steps of novels with level-2 annotations: 1) the preparation of metadata of the ELTeC collection for import into Wikidata, 2) import of data and 3) analysis of imported dataset using set of SPARQL queries. Main entities involved in these tasks are: text collection, authors, literary work, issue. Metadata will be extracted from the element <TEIHeader> for all novels. Data preparation, existence check and disambiguation will be done using OpenRefine, a powerful tool for cleaning data and transforming it from one format into another; as well as extending it with web services and external data. Preparation for import into Wikidata will be done using QuickStatements. For statistical analysis, evaluation and visualisation several SPARQL queries will be embedded in a dedicated website


 Info You can use the AutoEdit tool to quickly add label and description on WikiProject ELTeC in many languages.


Property overview edit

Work item properties edit

Works should be instances of written work (Q47461344) or one of its subclasses, we use literary work (Q7725634)

Title ID Data type Description Examples Inverse
instance ofP31Iteminstance of: that class of which this subject is a particular example and member; different from P279 (subclass of); for example: K2 is an instance of mountain; volcano is a subclass of mountain (and an instance of volcanic landform)The Sign of Four <instance of> literary work-
titleP1476Monolingual textoriginal title and title: published name of a work, such as a newspaper article, a literary work, piece of music, a website, or a performance workThe Sign of Four <title> The Sign of the Four-
authorP50Itemauthor, writer and creator: main creator(s) of a written work (use on works, not humans); use P2093 (author name string) when Wikidata item is unknown or does not existThe Sign of Four <author> Arthur Conan Doyle-
language of work or nameP407Itemlanguage: language associated with this creative work (such as books, shows, songs, broadcasts or websites) or a name (for persons use "native language" (P103) and "languages spoken, written or signed" (P1412))The Sign of Four <language of work or name> English-
has edition or translationP747Itemversion, edition or translation, translated edition and source text: link to an edition of this itemThe Sign of Four <has edition or translation> The Sign of Four : ELTeC editionedition or translation of
form of creative workP7937Itemform of art, album type, musical form, type of musical work/composition and form of creative work: structure of a creative workThe Sign of Four <form of creative work> novel-
main subjectP921Itemtopic, matter and subject: primary topic of a work (see also P180: depicts)The Party Journalist <main subject> radio propagandastatement is subject of
charactersP674Itemfictional character: characters which appear in this item (like plays, operas, operettas, books, comics, films, TV series, video games)The Sign of Four <characters> Sherlock Holmespresent in work
narrative locationP840Itemnarrative location: the narrative of the work is set in this locationImpure Blood <narrative location> Vranje-
imageP18Commons media fileillustration and image: image of relevant illustration of the subject; if available, also use more specific properties (sample: coat of arms image, locator map, flag image, signature image, logo image, collage image)The Sign of Four <image> The Sign of Four cover 1892.jpg-
VIAF IDP214External identifierVIAF ID: identifier for the Virtual International Authority File database [format: up to 22 digits]Impure Blood <VIAF ID> 208715862-

Edition item properties edit

Editions should be instances of version, edition or translation (Q3331189) or one of its subclasses.

Title ID Data type Description Examples Inverse
instance ofP31Iteminstance of: that class of which this subject is a particular example and member; different from P279 (subclass of); for example: K2 is an instance of mountain; volcano is a subclass of mountain (and an instance of volcanic landform)The Sign of Four : ELTeC edition <instance of> version, edition or translation-
edition or translation ofP629Itemversion, edition or translation: is an edition or translation of this entityThe Sign of Four : ELTeC edition <edition or translation of> The Sign of Fourhas edition or translation
language of work or nameP407Itemlanguage: language associated with this creative work (such as books, shows, songs, broadcasts or websites) or a name (for persons use "native language" (P103) and "languages spoken, written or signed" (P1412))Aus dem Leben einer Frau : ELTeC ausgabe <language of work or name> German-
authorP50Itemauthor, writer and creator: main creator(s) of a written work (use on works, not humans); use P2093 (author name string) when Wikidata item is unknown or does not existThe Sign of Four : ELTeC edition <author> Arthur Conan Doyle-
titleP1476Monolingual textoriginal title and title: title of this particular editionThe Tower <title> Der Turm-
place of publicationP291Itemplace of publication and place of first publication: geographical place of publication of the edition (use 1st edition when referring to works)Folle-Farine : VWWP edition (digital edition) <place of publication> Bloomington-
publication dateP577Point in timepublication date: date or point in time when a work was first published or released48-tól Világosig (első kiadás) <publication date> 1680-
imageP18Commons media fileillustration and image: image of relevant illustration of the subject; if available, also use more specific properties (sample: coat of arms image, locator map, flag image, signature image, logo image, collage image)Gutenberg Bible <image> Gutenberg Bible, Lenox Copy, New York Public Library, 2009. Pic 01.jpg-
number of pagesP1104Quantitypage, leaf, page of plates, leaf of plates, unnumbered page of plates and number of pages: number of pages in an edition of a written work; see allowed units constraint for valid values to use for units in conjunction with a numberArabela : edicija ELTeC <number of pages> 100-
publisherP123Itempublisher and publisher: organization or person responsible for publishing books, periodicals, printed music, podcasts, games or softwareImpure blood: ELTeC edition <publisher> Distant Reading for European Literary History-
full work available at URLP953URLdigital library: URL of a web page containing the full body of this itemImpure blood: ELTeC edition <full work available at URL> https://distantreading.github.io/ELTeC/srp/SRP19101.html-
distributed byP750Itemmedia distributor: distributor of a creative work; distributor for a record label; news agency; film distributorImpure blood: ELTeC edition <distributed by> Zenodo-
copyright licenseP275Itemlicense: license under which this copyrighted work is releasedAm Jenseits : ELTeC ausgabe <copyright license> Creative Commons Attribution 4.0 International-
copyright statusP6216Itemcopyright status: copyright status for intellectual creations like works of art, publications, software, etc.Am Jenseits : ELTeC ausgabe <copyright status> copyrighted, dedicated to the public domain by copyright holder-

Author item properties edit

Author should be instances of human (Q5).

Title ID Data type Description Examples Inverse
instance ofP31Iteminstance of: that class of which this subject is a particular example and member; different from P279 (subclass of); for example: K2 is an instance of mountain; volcano is a subclass of mountain (and an instance of volcanic landform)Honoré de Balzac <instance of> human-
sex or genderP21Itemsex of humans: sex or gender identity of human or animal. For human: male, female, non-binary, intersex, transgender female, transgender male, agender, etc. For animal: male organism, female organism. Groups of same gender use subclass of (P279)Honoré de Balzac <sex or gender> male-
name in native languageP1559Monolingual textname and full name: name of a person in their native languageHonoré de Balzac <name in native language> Honoré de Balzac-
date of birthP569Point in timedate of birth: date on which the subject was bornHonoré de Balzac <date of birth> 1799-
date of deathP570Point in timedate of death: date on which the subject diedHonoré de Balzac <date of death> 1850-
native languageP103Itemfirst language: language or languages a person has learned from early childhoodHonoré de Balzac <native language> French-
imageP18Commons media fileillustration and image: image of relevant illustration of the subject; if available, also use more specific properties (sample: coat of arms image, locator map, flag image, signature image, logo image, collage image)Honoré de Balzac <image> Honoré de Balzac (1842) detail.jpg-
writing languageP6886Itemlanguage in which the writer has written their workHonoré de Balzac <writing language> French-
languages spoken, written or signedP1412Itemlanguage proficiency: language(s) that a person or a people speaks, writes or signs, including the native language(s)Honoré de Balzac <languages spoken, written or signed> French-
occupationP106Itemoccupation, profession and by occupation or profession: occupation of a person; see also "field of work" (Property:P101), "position held" (Property:P39)Honoré de Balzac <occupation> writer-

Queries edit

Collections in ELTeC Collection edit

Subpages edit

Participants edit