Wikidata:WikiProject Journalists
The goal of this project is to organize and add information about journalists to Wikidata. Wikimedia relies on high quality sources to use as references. Many of these sources are written by people who are not in Wikidata. How can we help fix that?
For now we will simply pose some questions and fill them in as we go.
Existence?
editShould this project exist or are there other projects that it should be merged with?
Current State
editWhat is the current state of journalist data in Wikidata?
What are the most common items and properties?
editItems
editProperties
editWhat are some useful SPARQL queries that can be used to assess the current data?
editWhat does the ontology look like?
editSub-classes of journalist graph
https://angryloki.github.io/wikidata-graph-builder/?property=P279&item=Q1930187&mode=reverse
What is the Gender gap?
editGender gap report from Denelezh
SQID Report on Q1930187 (journalist)
editReasonator Report on Q1930187 (journalist)
editScholia
editScholia is a linked data project focused on academic publications but sometimes is able to generate interesting reports for journalists. A good showcase of what is possible if enough journalist linked data is put into Wikidata. For example,
Deduplication and Record Linkage
editThere are already on the order of 100k journalists in Wikidata. Any attempt to add new data in bulk will need to resolve collisions between incoming and existing journalists. This is a very common problem and solutions typically involve Record linkage. Can we design record linkage solutions for existing databases? Will we need a custom record linkage model for each database we try to incorporate or are there common features that we can use across multiple databases?
Currently the closest thing with have to a unique id for them is their twitter ID!
OpenRefine
editThe OpenRefine tool is one well supported method of doing this.
Existing Databases
editWhat existing databases of journalists exist and how can we integrate their data?
Muck Rack
editGood visability on google, seems to have a page for every journalist and on that page has summary of who they written for, excerpts of thie work, links to thier social media and thier twitter feed.
The jounalist can take ownership of each page and corrections are delt with via a chat mechnaism that can actioned with a few hours.
The unique ID of the page is proprietarty and the links they show are to properietry sites too such as twitter, I would like to see them add and use a open cross-platform id
The Factual
editNot publicly available or offered commercially, but they maintain an internal database of journalists.
Standards
editHow should journalism data be structured?
editWhat information do we need about journalists, publishers, newspapers?
What is the best way to handle freelancers?
Should we link news sites to their ratings on
Can we incorporate data from the Wikipedia:Reliable_sources/Perennial_sources or the other way around?
Is there a uniqueid for us to use for each jounalist from an open and independant organisation
editfor example instead of a "twitter handle" which has become the de-facto "uniqueID" it should be something like :-
- Integrated Authority File, ISNI,VIAF or Worldcat
Also judging by how disorganised most jouranlist media presence can be, the id will have needed to be given them automatically rather than something they had to apply for.
How should we handle referencing?
editRelated External Projects
editJournalList
edit"A Networked List of News Publishers" --https://journallist.net/
Managers of the trust.txt
framework (see
this video for
a short introduction and comparison to existing "txt" solutions
such robots.txt
and ads.txt
.
Related Wikimedia Projects and Sites
editWhat lessons can we learn from existing projects? How can we collaborate?
- https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Newspapers
- https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Journalism
- https://en.wikipedia.org/wiki/Wikipedia:Reliable_sources/Perennial_sources
Tools
edit- Wikidata:WikiProject_Journalists/Creating_an_entry_using_Wikidata_for_Firefox Example of adding a journalist using Wikidata for Firefox
- The Media Directory, an Observable notebook which lists medias by country and category.