Wikidata:WikidataCon 2017/Submissions/Describing Software So We Can Cite Software

Watch the recording
Submission no. 15
Title of the submission
Describing Software So We Can Cite Software

Author(s) of the submission

Katherine Thornton, Finn Årup Nielsen

E-mail address

katherine.thornton@yale.edu

Country of origin

USA, Denmark

Affiliation, if any (organisation, company etc.)

Yale University Library, Technical University of Denmark


Type of session
Length of session

45 min

Ideal number of attendees

10+

EtherPad for documentation
https://etherpad.wikimedia.org/p/WikidataCon-Describing-Software

Abstract

When researchers publish scientific articles some include information about software used in their research. In traditional publication models the rare occasions in which research software is provided along with a publication, the links between those research outputs are not always easily maintained. Funders, publishers, scholars, librarians, and other information professionals need ways to describe software that is used in research just as they need to describe scientific publications.

Connecting scientific articles and the software used in their creation is not currently well-supported in scholarly communication. This is changing as more funders and publishers are mandating that research data and research software be published alongside research papers. A recent announcement from the Wellcome Trust is a new policy on research data and research software. Several other similar initiatives are underway to promote awareness of the need for software citation.

We will report on exploratory research into statements about software that can be mined out of publications indexed by PubMed. We will describe the workflows we created to add statements to the Wikidata items for scientific articles from our study to describe the software that was used in the study. We will share findings about how to reuse existing ontologies to further refine statements we can make about what aspect of the research process a particular piece of software was used. This research will then be used to enrich or inform existing recommendations about software citation, software curation, research data citation, and research data curation. This will help us create data models within Wikidata that will support our work to describe complex research objects relevant to scholarly communication, including software citation involving dereferencable Wikidata URIs.

Practices of description, curation, and structuring are emerging in the Wikidata community that will support precise software description according to data models that are currently being developed. We are working toward increasing the breadth of citable software (describing versions, describing configured software environments, etc.) and toward the breadth of aspects of the research process (data collection, data analysis, document typesetting, figure creation, data set used, etc). Wikidata can have properties to describe as many parts, facets, processes, perspectives on the research process as we create for ourselves. We have the option to describe complex research outputs collectively in a open collaboration system so that anyone can freely reuse this content.

What will attendees take away from this session?
  1. Hear about how software is described in the research papers collected in PubMed
  1. Get an overview of the current status of how scientific articles are being described in Wikidata
  1. Get an overview of the current status of how software is being described in Wikidata
  1. Gain an understanding of the role Wikidata can play in helping to represent relationships between scientific publications and software and data sets used in research
  1. Expect to learn about how research using Wikidata is already being used to inform Yale University Library’s software preservation decisions
Slides or further information
Special requests

Interested attendees edit

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest.

  1. Daniel Mietchen (talk) 19:05, 31 July 2017 (UTC)[reply]
  2. Andrawaag (talk) 18:01, 17 August 2017 (UTC)[reply]
  3. ...