Wikidata:WikidataCon 2017/Submissions/From hundreds manual infoboxes to 10 core wikidata-powered infoxboxes. A WPCA experience

 This is an Open submission for WikidataCon 2017 that has not yet been reviewed by the members of the Program Committee.

Submission no. 24
Title of the submission

From hundreds manual infoboxes to 10 core wikidata-powered infoxboxes. A cawiki experience


Author(s) of the submission
E-mail address

https://www.wikidata.org/wiki/Special:EmailUser/Amadalvarez

Country of origin

Catalonia, Spain

Affiliation, if any (organisation, company etc.)

Amical Wikimedia


Type of session

Depending on the interest of the audience, and others making submissions on Wikidata infoboxes, this could be organised as a talk, workshop, demo, round table, or discussion. In particular, if others working on Wikidata infoboxes also want to present on their work, then this would work as a comparison of approaches/discussion about Wikidata infoboxes in Wikipedias more generally.

Length of session

30 min.

Ideal number of attendees

20-40


Abstract

When we started 2 years ago implementing wikidata to existing infoboxes, we realized that it was huge job because we had hundreds of hyperspecialized infoboxes (regions, professions, church, castle, building, ...). Then, we decided to build core infoboxes that gathered the best features and parameters from all the infoboxes with a similar topic. These new infoboxes work with intense use of wikidata in addition to the same manual parameters they had. De facto, the publishers have stopped entering manual parameters, it is easy to use, and the guarantee of maintenance of volatile contents is higher than before.

Today, 10 "core infoboxes" serve 55% of all cawiki articles and have left behind as obsolete almost 300 infoboxes. All of them include dynamic maps based on OSM (when applied), multivalued properties integrating multiple qualifiers in the edition of the property. Additionally, there are 30-40 highly specialized infoboxes -attending 5-10% of cawiki articles- that have not been able to concentrate, but have been powered with the same wikidata functionalities.

This project has required the homogenization of the names of the parameters that represent the same concept, creating a kind of data dictionary of manual parameters and then, rename the old parameter names in articles in order to run with the new infobox. Our Wikidata module has evolved and, at the moment, it is able to do small editing treatments for the combination property-qualifiers; Manage the gender to return correctly the description of occupations or positions when women; Recover bottom-up hierarchical structures such as taxon or higher territorial units, etc.

The empiric experience allows audience to see (and copy the specific solution) how to handle hundreds of properties.

What will attendees take away from this session?
  1. How to improve fast and easily their infoboxes with WD and other gadgets from the real solutions of catalan wikipedia
  2. How to handle difficult data structures of WD, for instance, "position held" or "heritage status", specially "world heritage", etc.
  3. How WD information has been included in cawiki infoboxes that covers almost 70% of thematic scope
Slides or further information
Special requests

I am applying for a scholarship to attend the conference.


Interested attendees edit

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest.

  1. -- YULdigitalpreservation (talk) 18:21, 25 July 2017 (UTC)[reply]
  2. -- Jsamwrites (talk)
  3. seav (talk) 17:55, 30 July 2017 (UTC)[reply]
  4. -- JakobVoss (talk)
  5. Daniel Mietchen (talk) 07:44, 31 July 2017 (UTC)[reply]
  6. Jklamo (talk) 00:05, 1 August 2017 (UTC)[reply]
  7. --Micru (talk) 11:26, 11 August 2017 (UTC)[reply]
  8. Gikü (talk) 15:15, 21 August 2017 (UTC)[reply]