User:Bargioni/CC literature box

CC Lit Box is a user script that allows to add Colon Classification (P8248) to literary authors and works, on the basis of the properties available within their items.

It is based on the Colon Classification (Q1110558) by Shiyali Ramamrita Ranganathan.

Authors: User:Carlobia and User:Bargioni.

Installation

edit

To use this gadget add the following code to your common.js page:

importScript( 'User:Bargioni/CC_Lit_Box_conf.js' );
importScript( 'User:Bargioni/CC_Lit_Box.js' );

Tutorial about CC

edit

Author Number

edit

This gadget enables the automatic creation and recording of property Colon Classification (P8248), i.e. the Colon Classification Class number for literary authors such as Shakespeare, Dante, Dumas, Tolstoj etc.

For the creation of the class number, three properties of the person (human (Q5)) are mandatory:

  • First, occupation (P106) must be one among poet (Q49757), playwright (Q214917), novelist (Q6625963), or writer (Q36180), otherwise the classification is not applicable, the gadget doesn't work (and a warning windows appears in the upper right corner of the screen). The preference order is not suggested by the gadget itself is, and a preference must be expressed by the classifier, based on the following criteria: the role for which the author is better known (e.g., Dante is mostly know as a poet, Shakespeare as a playwright, Tolkien as a novelist, and so on); otherwise, reference sources can be of help, starting from the page on Wikipedia; lastly, an analysis of the author's production can be performed.
  • Second, a language must be associated with the person. The preference order is native language (P103), writing language (P6886), languages spoken, written or signed (P1412), as suggested by the gadget.
  • Third, date of birth (P569) (with precision year, month or day) must be associated with the person.

If one or more mandatory properties are missing, they can be added and, reloading the page, the gadget will work.

A warning message appears if the property Colon Classification (P8248) is already recorded in the item, as it should not be duplicated.

The user of the gadget must always check that the main or preferred occupation, the mother or relevant language, and the birth year expressed in the box are right. If the proposed values are not the preferred or the right ones, they can be changed properly. After the control, the user can save the proposed Class number for the author.

Work Number

edit

In a second step, CC Lit Box was updated to allow classification of the works by a specific author. The author's class number is taken as a basis for each of his/her works, and a progressive number following a chronological sequence is proposed for the works. Before creating CC work numbers, the user must perform every control necessary to be sure that each Wikidata item is really describing a work (and not anyone of its editions) and to ascertain that all the data associated with the work are correct (creation date, language, title in original language, and so on). All ready work class numbers are highlighted in black, while work class numbers missing one or more facets are in red. To change the status from red to black, the work item in Wikidata must be edited and completed. If the CC Lit Box highlights also green numbers, it means that some work class number was created previously, without completing the classification of all the works by that author and, so, preventing their correct chronological arrangement.

Edition and items. The book number

edit

A further gadget for the editions of the works written by an author will be created, as soon as possible.

WC4Lit

edit

The gadget is also programmed to give choice for the expression of a Class number for the Wikidata Classification for Literature (WC4Lit), a new classification schema more applicable to data available on Wikidata and more expressive and understandable by the classifiers and the users (documentation will follow, as soon as possible).

To be known

edit

The quality of data currently available in Wikidata is a major drawback for this project. So, even if the classification process is automated, data must usually be checked and cured manually. The gadget could be much more useful if quality data - such as from well formed library catalogs - would be imported in Wikidata automatically or semiautomatically.

Items related to an author can be only partially included in the box because they are described in different ways. In fact, works with value literary work (Q7725634) (literary work) for property instance of (P31) (instance of) are included; instead works with values written work (Q47461344) (written work) and, e.g., novel (Q8261) (novel) for property P31 (instance of) are not. The gadget checks this property and manages only items that are literary works (instance of (P31) - literary work (Q7725634)).

In fact, written work (Q47461344) is more generic than literary work (Q7725634) literary work; the first would include Darwin's The Origin of the species, and CC_lit_Box would not apply. On the other side, novel (Q8261) is more exactly a genre (genre (P136) ) of a literary work (Q7725634) literary work.

If you change instance of (P31) (using this property is classifying!) of work items not yet included in the box to literary work (Q7725634), you will see them appear in the box (please, note that up to 10-15 minutes are usually required, due to the lag on Wikidata).

Lastly, CC Lit Box was recently set up to include written works (that seem to be the largely more frequent classification, even for literary works). The new workflow and pros and cons are to be evaluated.

Bibliography

edit

Relevance of Colon Classification for the arrangement of FRBR entities, and the bell curve to express the distribution of content relevance obtained by Colon Classification, were described in:

  • Carlo Bianchini (2010). "FRBR prima di FRBR". JLIS.it (in Italian). 1 (1): 11–39. doi:10.4403/JLIS.IT-31. hdl:11571/222144. ISSN 2038-1026. Wikidata Q58098635. 
  • Carlo Bianchini (2011). "Organising knowledge with the filiatory sequence of the Colon Classification". JLIS.it. 2 (2): 1-21, 1-4. doi:10.4403/JLIS.IT-4710. ISSN 2038-1026. Wikidata Q58380307. 
  • Carlo Bianchini (2012). "Arrangement of FRBR Entities in Colon Classification Call Numbers". Cataloging & Classification Quarterly. 50 (5–7): 473–493. doi:10.1080/01639374.2012.679877. ISSN 0163-9374. Wikidata Q56479623. 
  • Carlo Bianchini; Claudio Gnoli; Luca Giusti (2017). "The APUPA bell curve: Ranganathan's visual pattern for knowledge organization". Les Cahiers du numérique. 13 (1): 49–68. doi:10.3166/LCN.13.1.49-68. ISSN 1622-1494. Wikidata Q54553753. 

Feasibility of an algorithm to produce class numbers for editions and items of the works of some authors starting from LOD of BnF were investigated in the following works:

  • Carlo Bianchini (2019). Dal web semantico all'indicizzazione per soggetto. Un caso di studio su data.bnf.fr e Colon classification. Viaggi a bordo di una parola (in Italian). pp. 15–31. ISBN 978-88-7812-276-5. Wikidata Q73512494. 
  • Licul, Caterina (2019). L’applicazione dei Linked Open Data alla produzione dei numeri di libro. Udine: University of Udine. (thesis)
  • Carlo Bianchini (2020). From semantic web to faceted classification – a case study and five lines of future research. Mirna Willer: Festschrift. pp. 173–191. ISBN 978-953-331-274-3. Wikidata Q95586772. 

A full description of the project and the gadget is available in: