Wikidata talk:WikiProject COVID-19/Archive
Archive: March 2020 edit
This project edit
Hello,
I love the idea of this WikiProject. There were some discussions about structuring it as a task force (see User_talk:Daniel_Mietchen#Would_you_like_to_contribute_to_a_WikiProject_COVID-19_?). And @Daniel Mietchen: has lots of experience in managing WikiProjects in Wikidata. What about we start gathering some content here and maybe move it elsewhere on Wikidata if it is a better option for the community? TiagoLubiana (talk) 01:20, 16 March 2020 (UTC)
- I think the most important thing is to get going and to inform all the relevant communities/ places. --Daniel Mietchen (talk) 01:45, 16 March 2020 (UTC)
Should regional outbreaks be classified as pandemics or epidemics? edit
I made a wikidata list of all pandemics. See en:Talk:Pandemic#Wikidata list of pandemics. Should regional outbreaks, such as 2020 COVID-19 pandemic in North Dakota (Q87746410), be considered instances of pandemics, or just epidemics and part of a specific pandemic? Tomastvivlaren (talk) 07:19, 16 March 2020 (UTC)
- @Tomastvivlaren: Great point. My view is: there is only one instance of a pandemic. It is one pandemic, by definition. I guess you can classify as an instance of an outbreak, as it has been done (see COVID-19 pandemic in Brazil (Q86597695). Maybe there could be a more specific item. Something in the likes of a "local manifestation of a pandemic", a subclass of "outbreak". What do you think? TiagoLubiana (talk) 14:33, 16 March 2020 (UTC)
- Up, something like local manifestation of a pandemic (Q87767992) TiagoLubiana (talk)
- We already have it modelled here https://www.wikidata.org/wiki/Wikidata:WikiProject_COVID-19/Data_models/Outbreaks as disease outbreak (Q3241045). Shouldn't you use that? --Hannolans (talk) 19:23, 14 April 2020 (UTC)
Mining COVID19 research using [R] and Wikidata edit
- Project components: R, textmining, Wikidata, COVID19, open access publications, preprints, closed access publications.
A group being put together by Peter Murray Rust is looking to build an integrated open resource of current published information & data on COVID19 (see the petermr/openVirus github repo for details). It should also eventually be relevant to other topics.
A few example subgoals:
- Immediate term: be able to edit wikidata from R via the API
- Short term: pull all COVID19 published papers from ePMC
- textmine for main topics and broader topics (all topics should match to wikidata items)
- write main topics to Wikidata items for those published papers
- publish broader topics of each paper in a separate open database
- Medium term: as above, but for bioRxiv, medRxiv, SciElo, Redalc etc.
Medium term: as above, but for paywalled articles
- Medium term: more in-depth text analysis
- Long term: make process applicable to other topics
Please let us know if you'd be interested in being involved. T.Shafee(evo&evo) (talk) 08:53, 16 March 2020 (UTC)
- @Evolution and evolvability: Oh, awesome that Peter is acting on this too! I will be glad to help. TiagoLubiana (talk) 14:30, 16 March 2020 (UTC)
Particularly keen to create dictionaries from Wikimedia resources. Currently
- Links in Wikipedia pages (very noisy)
- Categories. Often coherent lists
- Tables in WP pages. variable
- Templates. Look very good - writing code ATM
see https://github.com/petermr/tigr2ess Tigr2ess tutorial (will add link later) Petermr (talk) 04:09, 17 March 2020 (UTC)
- For the 'edit API', a simple solution is to have the RDF generate QuickStatements. This approach has worked for me very well for years now. Of course, a Wikidata bot written in / for R would be awesome. --Egon Willighagen (talk) 07:21, 17 March 2020 (UTC)
- @Egon Willighagen: Agree, making quickstatements tables should be a good intermediate solution. Is it possible to submit to quickstatements via API (the way SourceMD used to)? Will add an issue ot the repo now (link and link). T.Shafee(evo&evo) (talk) 00:55, 19 March 2020 (UTC)
- There is QS Batch. Magnus has replied in both issues. I used that to submit 3M edits (only took 2 months to complete ;-) --Vladimir Alexiev (talk) 20:02, 23 March 2020 (UTC)
- @Egon Willighagen: Agree, making quickstatements tables should be a good intermediate solution. Is it possible to submit to quickstatements via API (the way SourceMD used to)? Will add an issue ot the repo now (link and link). T.Shafee(evo&evo) (talk) 00:55, 19 March 2020 (UTC)
For dictionaries I would actually start from MeSH descriptor ID (P486). https://www.ncbi.nlm.nih.gov/mesh/2049999 (from January) and https://www.ncbi.nlm.nih.gov/mesh/2050205 (from February) are two new C-numbers (i.e. MeSH supplementary topics). The point I would make is that they each come with a list of aliases. https://www.ncbi.nlm.nih.gov/mesh/2023153 from 2018 is the relevant genus: the aliases there are not going to be so useful. https://www.ncbi.nlm.nih.gov/mesh/68018352 from 1994 likewise: but the narrower terms given on those pages deserve attention. In general reverse lookup can translate MeSH C-numbers (as for D-numbers) into the items carrying them: respectively SARS-CoV-2 (Q82069695) (added today by me), COVID-19 (Q84263196) added 17 February. (I use https://tools.wmflabs.org/wikidata-todo/resolver.php?prop=486 for reverse lookup, though there are other ways.)
Generally a good way to compile broad-based dictionaries is to extract the label and aliases for items, using a SPARQL query. A VALUES query based on a list of items would be easy to update, as things progress. Charles Matthews (talk) 16:55, 17 March 2020 (UTC)
- Thanks for this! Everyone note the difference between MESH Descriptors, Concepts and Terms. Best described here https://www.wikidata.org/wiki/Wikidata:Property_proposal/MESH_Concept_ID --Vladimir Alexiev (talk) 20:06, 23 March 2020 (UTC)
For alpha users. Have written code to extract references from Wikipedia templates into dictionary form. Initial examples ``` String[] templates = {"Baltimore_(virus_classification)", "Antiretroviral_drug", "Virus_topics"}; ``` using https://github.com/petermr.ami3 will convert these Wikipedia templates into structured dictionaries. Templates of species, drugs, organizations, etc. will be useful. Please suggest templates that could be useful and I'll convert them.Petermr (talk) 09:46, 20 March 2020 (UTC)
- Hello, @Petermr:. Great work! I'm with Ami the Kangaroo here cheering for us. On the Wikipedia Wikiproject on COVID19 (en:Wikipedia:WikiProject_COVID-19) there are many templates directly related to COVID19, for example, the project tempĺate (en:Template:WikiProject_COVID-19). I hope this helps somehow. Best, TiagoLubiana (talk) 18:15, 21 March 2020 (UTC)
Numeric data edit
There are various data that can be represented on Wikidata or in the Tabular Data format on Wikimedia Commons, see, e.g., [1]. It is somewhat difficult to collect. Hospitalizations are apparently not available from Danish authorities, but scattered in the news media; longitudinal Danish data is hard to get because the page where it is presented is overwritten daily. I suppose that is the same problem for other countries. — Finn Årup Nielsen (fnielsen) (talk) 14:31, 16 March 2020 (UTC)
- Great point. The fact that the page is overwritten everyday might be good. Perhaps we can get legacy pages from a service such as webarchive [2] and standardize the refs? TiagoLubiana (talk) 14:41, 16 March 2020 (UTC)
- Yes, exactly! I recommend using the Wayback Machine (Q648266) too, and in fact was doing that for COVID-19 pandemic in the Netherlands (Q86756826) already. --Egon Willighagen (talk) 09:44, 17 March 2020 (UTC)
- For the numeric data in Austria (Q40) on COVID-19 pandemic in Austria (Q86847911) i am starting to use the following structure of references including data published by the Federal Ministry of Health (Q1006381):
- * reference URL (P854) → https://www.sozialministerium.at/Informationen-zum-Coronavirus/Neuartiges-Coronavirus-(2019-nCov).html
- * publisher (P123) → Federal Ministry of Health (Q1006381)
- * publication date (P577) → 2020-03-17
- * archive URL (P1065) → [3]
- * archive date (P2960) → 2020-03-17
- * retrieved (P813) → 2020-03-17
- * language of work or name (P407) → German (Q188)
- --Mfchris84 (talk) 14:33, 17 March 2020 (UTC)
- There is a similar discussion on the English Wikipedia. --Daniel Mietchen (talk) 10:53, 17 March 2020 (UTC)
- An option for standardized, global source of information are the WHO Novel Coronavirus (2019-nCoV) Situation Reports, at least for number of cases (P1603) and number of deaths (P1120). You can easily find them (at least now) in this Scholia page: for the COVID-19 pandemic. I have made a few python scripts to parse the WHO table and convert them to quickstatements (WHO to Wikidata on GithHub). For now, it is something like:
- outbreak_qid +number of deaths (P1120) + total_deaths + point in time (P585) + who_report_date + stated in (P248) + who_report_qid
- outbreak_qid +number of cases (P1603) + total_cases + point in time (P585) + who_report_date + stated in (P248) + who_report_qid
- If you have any contributions, please feel free to make either there or here. I can already send the batch for the yesterday numbers, but I thought about consulting you first. TiagoLubiana (talk) 18:14, 17 March 2020 (UTC)
Collection SPARQL queries as a book edit
Hi all, some time ago I started playing with SPARQL embedded in Markdown, to create an online book. Since someone was asking about queries for the virus, I started this book: https://egonw.github.io/SARS-CoV-2-Queries/ --Egon Willighagen (talk) 19:38, 16 March 2020 (UTC)
Capturing Special Announcements in a Wikibase edit
Following the announcement of schema.org of their release version 7.0, I took the liberty to set up a wikibase where I tried to follow the same schema. The special announcement schema was created to deal with the CORVID-19 Pandemic
The Wikibase is running on WBStack and still very much an early prototype. I was not able to fully following the schema proposed by schema.org, since Wikibase only accepts one datatype per property, but I was able to solve that by using wikibase items, where the items will capture the other possible data types.
I am reaching out to this group for scrutiny. Please feel free to test run. I will also continue working on it, but if the model is mature, I will try set up a Wikibase that can be used to collect these special announcements for real. --Andrawaag (talk) 21:49, 17 March 2020 (UTC)
- @Andrawaag: I took a look at it, and it looks quite useful. I'm not able to give technical feedback on it, but I liked the idea and the implementation. Collecting those announcements can be super informative, indeed. TiagoLubiana (talk) 14:19, 18 March 2020 (UTC)
I made a second prototype of a Wikibase for specialAnnouncement, this time with a wikibase property for each data type included in a schema.org property. e.g isBasedOnUrl (http://schema.org/Product as item) and isBasedOnUrl (url).
To move forward it would be nice to have a set of special announcements to see which approach fits best. Who has examples? --Andrawaag (talk) 08:03, 19 March 2020 (UTC)
Semantic Scholar's COVID-19 Open Research Dataset edit
News from Semantic Scholar (Q22908627):
Semantic Scholar has partnered with several leading research groups to prepare the COVID-19 Open Research Dataset (CORD-19), a unique resource of over 29,000 scholarly articles about COVID-19, SARS-CoV-2, and related coronaviruses.
Do be aware of the associated licence.
I have a contact there, if anyone needs something that is not already publicly available. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:21, 18 March 2020 (UTC)
How the CORD dataset was made. And they'll keep it up to date https://www.whitehouse.gov/briefings-statements/call-action-tech-community-new-machine-readable-covid-19-dataset/. Many people are working off that dataset. I know of 2 RDF representations. So is it a goal to mirror CORD in Wikidata and link it to the rest of WD? Only makes sense if we can keep it up to date. Vladimir Alexiev (talk) 20:15, 23 March 2020 (UTC)
biological processes edit
This is a heads up on my personal project to import papers on the molecular biology of SARS-Cov-2 replication in humans. These will have a common main subject claim added, so that they can later be searched. The ultimate goal is to provide a collection of references for statements on the biology, practically the base of biocuration. --SCIdude (talk) 07:53, 19 March 2020 (UTC)
- @SCIdude: That is awesome. Do you want to document what you are doing (so more people can help) somewhere around this project? For example, what do you think about a new tab about scholarly research? TiagoLubiana (talk) 23:27, 19 March 2020 (UTC)
- The result is continously visible in the query https://www.wikidata.org/wiki/Wikidata:WikiProject_COVID-19/Queries#with_main_subject_molecular/structural_biology_of_any_SARS-related_strain. --SCIdude (talk) 14:03, 20 March 2020 (UTC)
- @SCIdude:, please take note of this project: https://covid.pages.uni.lu/map_curation Let's try to coordinate all our work. --Egon Willighagen (talk) 13:22, 21 March 2020 (UTC)
- The result is continously visible in the query https://www.wikidata.org/wiki/Wikidata:WikiProject_COVID-19/Queries#with_main_subject_molecular/structural_biology_of_any_SARS-related_strain. --SCIdude (talk) 14:03, 20 March 2020 (UTC)
wirvsvirushackathon edit
Hello,
at the Weekend starting on tomorrow there is a hackathon. Maybe it is possible and you are interested in attending. More information you can find on that page. https://wirvsvirushackathon.org/ I have seen it a few minutes ago and I think maybe it is interesting for you. --Hogü-456 (talk) 21:55, 19 March 2020 (UTC)
- Here you can add the project if you want. I think it could help the other people. [4] --Hogü-456 (talk) 21:59, 19 March 2020 (UTC)
- @Hogü-456: Awesome! Good contribution. I was not able to add the project to the list. If someone wants to use Wikidata, great, I can make myself available to help any way possible. Just leave a message at my talk page. TiagoLubiana (talk) 17:55, 21 March 2020 (UTC)
How to represent putative treatments? edit
There are some drugs that appear in the scientific literature as putative treatments for COVID-19. For example, in the english Wikipedia page is said:
- "Authorities recommend chloroquine[108] and the Chinese 7th edition guidelines include interferon, ribavirin, chloroquine, or umifenovir.[109]."
Do you think that using drug or therapy used for treatment (P2176) with these drugs is suitable? TiagoLubiana (talk) 17:47, 21 March 2020 (UTC)
New branch of the project: statements! edit
Notified participants of WikiProject COVID-19
Hello,
I am quite excited about this new branch of the project called statements (Wikidata:WikiProject_COVID-19/Statements). The idea is to gather the statements behind core publications and Wikipedia articles. Why doing that?
- Check which core informations can and cannot be currently represented in Wikidata.
- Identify missing items and properties.
- Update Wikidata items so they capture the core informations of Wikipedia.
- Start to work on templates for, in the future, automatically create pages in Wikipedia for projects/languages that do not have pages for those items (See en:Wikipedia:Mbabel).
I hope you like the idea and contribute! It is still experimental, so new ideas of how to organize that are super welcome.
TiagoLubiana (talk) 02:04, 20 March 2020 (UTC)
I am participating in Hack the Crisis hackathon initiated by the the Finnish government and startups. I would like to reach out to this group for collaboration. I can introduce resources that Wikidata offers to the participants or propose ways in which they can contribute to it. Please help me focus on the most essential needs. You can also join the hackathon if you have the time. Cheers, Susanna Ånäs (Susannaanas) (talk) 07:19, 20 March 2020 (UTC)
- @Susannaanas: I only saw that now, I am sorry! The project is still organizing itself. I would say that a dire need is a trustable and updated source of number of number of cases (P1603) and number of deaths (P1120) for auto updating thee items for the specific locations of the pandemics. Infos on the Wikidata:WikiProject_COVID-19/Items#Genes,_genomes_and_proteins would also help a lot. Also, the queries here Wikidata:WikiProject_COVID-19/Queries can be interesting for the hackathon participants. If you think of any other ways this project can colab with hackathons, that would be great, and please let us know. If there is time, I might even write a page about this integration. I wish you a great hackathon! TiagoLubiana (talk) 18:03, 21 March 2020 (UTC)
How to indicate someone being the index case (Q1639798) in their country? edit
As the title says. --Trade (talk) 12:43, 20 March 2020 (UTC)
- subject --> part of --> national epidemic; qualifier: subject has role --> index case. --SCIdude (talk) 13:56, 20 March 2020 (UTC)
- @Trade:, @SCIdude:. Good question and ideas. Adding to the discussion, it seems that there is a different solution already modelled. See for example Thomas Eric Duncan (Q18249266) index case of (P1677) Ebola virus disease in the United States (Q17486597) and the property has index case (P1660). TiagoLubiana (talk) 17:46, 21 March 2020 (UTC)
Basic terminology edit
Hi all! I'm collecting basic terminology that people would need in order to be able to write articles about this pandemic as part of the same hackathon that Susanna Ånäs (Susannaanas) was talking about above this. If you'd like to add items to the table, all you need to do is add in the qid from Wikidata for the concept. If the concept doesn't have a qid, it can be added to the list below the table. I appreciate any and all help! -Yupik (talk) 07:42, 21 March 2020 (UTC)
- @Yupik: Awesome dashboard! I saw that most concepts were already "QIDed". Amazing. I have added your dashboard to Wikidata:WikiProject_COVID-19/Items. I am also trying to convert the Wikipedia pages to a "structured format", so to identify the concepts and relations used in such pages. Many of the concepts are already in your list, but perhaps you will enjoy taking a look at this page for example. TiagoLubiana (talk) 17:40, 21 March 2020 (UTC)
- @TiagoLubiana: Thank you, that is a great page! Susannaanas has made wonderful visualization of COVID-19-related topics, which should help out with the structured format. Would you be interested in helping us translate some hygiene-related bits and piece into Portuguese btw? -Yupik (talk) 18:23, 21 March 2020 (UTC)
- @Yupik: Sure, great! I have added it to my today's todo list. TiagoLubiana (talk) 19:06, 21 March 2020 (UTC)
- @TiagoLubiana: Thank you! -Yupik (talk) 19:35, 21 March 2020 (UTC)
Be careful when adding values from WHO via quickstatements edit
Hi, I recently discovered that quickstatements has what I believe is a bug that makes it ADD qualifiers and references to an EXISTING value if it already exist. This is very unfortunate.
Please check the pages you do not:
- add conflicting point-in-time statements
- lump references that refers has different dates under the same value statement
Please ensure that you remember to:
- add "refine time" qualifiers (for WHO 00:00)
- add timezone information (for WHO CET=UTC+1)
Details here: https://www.wikidata.org/wiki/User_talk:TiagoLubiana#Messy_edits_to_Q84081576 Thanks in advance for your attention to details.--So9q (talk) 19:50, 21 March 2020 (UTC)
Warning: Wrong_numbers_from_WHO_suspected edit
https://www.wikidata.org/wiki/Talk:Q84104992#Wrong_numbers_from_WHO_suspected --So9q (talk) 10:53, 23 March 2020 (UTC)
- Hello @So9q:. Apparently the WHO reports are quite messy themselves. It is surprising to me, as I would expect them to be a reliable source. I will avoid using WHO information to update Wikidata pages. If possible, could explain the problems you are locating in detail? WHO numbers report confirmed case numbers that are sent by local health organizations to WHO. TiagoLubiana (talk) 18:04, 23 March 2020 (UTC)
- The latest case numbers from Italy seem way off. So much that I suspect they are equivalent to the total cases. I therefore marked them with determination method=unknown. I trust the national sites over WHO, but not all countries have published their statistics, e.g. I just cannot find the numbers from Bosnia Hercegovina and the sources in WP are bogus with numbers newer than the retrieval date.--So9q (talk) 21:14, 23 March 2020 (UTC)
- I misinterpreted the italian source it seems. WHO numbers for Italy are correct, but lack behind.--So9q (talk) 11:01, 25 March 2020 (UTC)
- I don't think there is any problem with including the numbers reported by WHO as long as the statements specify they come from WHO. ·addshore· talk to me! 12:20, 24 March 2020 (UTC)
- The latest case numbers from Italy seem way off. So much that I suspect they are equivalent to the total cases. I therefore marked them with determination method=unknown. I trust the national sites over WHO, but not all countries have published their statistics, e.g. I just cannot find the numbers from Bosnia Hercegovina and the sources in WP are bogus with numbers newer than the retrieval date.--So9q (talk) 21:14, 23 March 2020 (UTC)
- Update: also wrong numbers from WHO reported for India, see https://www.wikidata.org/wiki/Wikidata:WikiProject_India/COVID-19_task_force. --So9q (talk) 09:43, 25 March 2020 (UTC)
WikiProject Clinical Trials edit
I am writing to invite anyone here to join in developing Wikidata:WikiProject Clinical Trials.
There are already COVID-19 clinical trial studies in progress. There will be increasing interest in gaining information about these studies from Wikidata.
Wikidata already has items for 320,000 trials with a ClinicalTrials.gov ID (P3098), which is 99% of all the trials. Right now I would like to request review and support for these property proposals:
- Wikidata:Property proposal/study type - a basic classification
- Wikidata:Property proposal/Chinese Clinical Trial Registry identifier - the Chinese government registry
If we can stay on top of COVID-19 trials then we can have a really interesting complete collection. Thanks. Blue Rasberry (talk) 17:30, 23 March 2020 (UTC)
Redundancy on COVID-19 pandemic (Q81068910) edit
Hello,
There is some redundancy in how the {{Q|Q81068910}] page is currently modeled, and that is annoying me a little bit.
We have country (P17) and has part(s) (P527) modelling the same cases, i.e. local instances of the global pandemic. I suggest that we remove all the country (P17) statements from the page, leaving only the links on has part(s) (P527), as these are clearer.
Do you Support or Oppose to that change?
Thanks TiagoLubiana (talk) 16:32, 24 March 2020 (UTC)
- Support--So9q (talk) 09:37, 25 March 2020 (UTC)
- Support--AntoineLogean (talk) 09:48, 28 March 2020 (UTC)
APIs for COVID-19 Data edit
Hi. I'm the PM for APIs at Wikimedia Foundation in the Core Platform Team. One thing I've noticed is that we're building up a big corpus of geographical data for COVID-19 cases and deaths. I've seen a couple of data analysts using the data on various WPs and Wikidata. I wonder if it would be helpful to tabulate that data and make it available as JSON or CSV feeds. Is this happening? Worth doing? --EProdromou (WMF) (talk) 16:46, 24 March 2020 (UTC)
- Hello, @EProdromou:, and thanks for your message. As of today, data about COVID-19 cases and deaths on Wikidata are still far from standardized. There are 3 main sources of numbers that seem to be used here and on Wikipedia, WHO situation reports, which are official but delayed, the Johns HOpkins dashboard and worldometers. There isn't a consensus on how this knowledge should be represented in Wikidata too. An API that gave fully referenced information on cases, deaths and recoveries (including location, source of information, specific time to which it refers to) would be quite useful for automatic, reliable updating of Wikidata items for when we reach such a consensus. The data is kind of already available at the Johns Hopkins dashboard github. What would be the source of the API you are thinking about? Thanks again! TiagoLubiana (talk) 22:03, 24 March 2020 (UTC)
- One thing that may be useful re: APIs, if you could give any advice over at openVirus I-15 on APIs to deposit from [R] to wikidata, since none of us over at that project have much experience with the wikidata API. T.Shafee(evo&evo) (talk) 03:12, 25 March 2020 (UTC)
- @talk:EProdromou (WMF): Hello, thank you again for the message. This meeting never happened, but it is still a very important and open topic. Would you still be interest on talkin more about that? Thanks TiagoLubiana (talk) 13:09, 4 May 2020 (UTC)
GIFs edit
Hello to all,
I started translating some of the GIFs about this topic to Portuguese. I think content is better absorbed if it is in the reader's language. I uploaded the .psd files to a folder in Github, so anyone (with Photoshop) can download them and translate the texts to their language without the task of cleaning the rasterized text in each frame. Let me know if you need help with them. Good contributions, Ederporto (talk) 16:51, 24 March 2020 (UTC)
droplet spread transmission edit
I tagged three potentially identical items (I'm the one who "said to be the same as") that may warrant merging: droplet infection (Q15304517), droplet infection (Q871752), and droplet infection (Q1557568). I'm not a health care expert, nor a data ontology enthusiast, so I invite others to take a look and see if they can be straightened out, merged, and/or clarified. Cheers, -Animalparty (talk) 21:11, 24 March 2020 (UTC)
- @Animalparty: I would think these three items can be merged, as they are wiki pages of different languages (fi/de/sv/cs/hu/pl) of the same mechanism. --Zhenqinli (talk) 21:43, 24 March 2020 (UTC)
- @Animalparty: Thanks for pointing that out. I believe all but droplet infection (Q15304517) can be removed. They have different names from other languages, true, but the instanciation and other relations seem weird. I have made the requests for deletion at Wikidata:Requests_for_deletions, but I would not oppose merging and fixing (as per @Zhenqinli:). TiagoLubiana (talk) 21:47, 24 March 2020 (UTC)
How are we standardizing the deaths by SARS-CoV-2 edit
How are we standardizing the deaths by SARS-CoV-2? I am using for cause_of_death=COVID-19 and I am also adding "significant_event=2020 COVID-19 pandemic in the United States". Is this what we are standardizing on? See Orlando Staton (Q88505446) and Floyd Cardoz (Q88584577). How many deaths entries have we recorded so far? --RAN (talk) 21:29, 25 March 2020 (UTC)
- @Richard Arthur Norton (1958- ): So far, I haven't seen any standardization per se. It seems that cause of death (P509) COVID-19 (Q84263196) does the job. [Wikidata:WikiProject_COVID-19/Queries#Image_grid_of_individuals_who_have_died_from_COVID-19_ordered_by_Wikidata_item_completeness This SPARQL query] might be of interest to you. This other query points to around 100 people in Wikidata that have died from COVID-19:
The following query uses these:
- Items: COVID-19 (Q84263196)
- Properties: cause of death (P509)
SELECT ?item ?itemLabel WHERE { ?item wdt:P509 wd:Q84263196. SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". } }
. TiagoLubiana (talk) 02:09, 26 March 2020 (UTC)
- This is complicated as people can die of other complications because they are weakened by the infection. Does this mean they die from the virus? Maybe not. Maybe we should model it something like: cause_of_death=COVID-19_or_during_COVID-19_infection which is broader.--So9q (talk) 10:38, 26 March 2020 (UTC)
- I believe that if people die from complications that arose due to COVID-19, both things could be related as cause of death (one is more proximal, and other more distal, but both sound to me like causes). There is a further insteresting aspect about this: what about COVID-19 negative deaths that happened during the pandemic? I bet that many deaths are happening (or will happen) due to the collapse of healthcare system. Something to represent that the pandemic contributed to their death, albeit not directly. I would endorse any modelling of that kind. TiagoLubiana (talk) 19:46, 26 March 2020 (UTC)
- Wikidata shouldn't be an endless spiral of reductionism. Practicality should trump technicality. I think a convention of "disease died from" = "cause of death" is most sensible. If someone is shot to death, is cause of death the bullet? The blood loss? The organ failure? Are cancer deaths due to cancerous cells, tumors, organ failure, or simply cancer (Q12078)? With COVD-19 deaths, the proximate (and practical) cause of death is COVID-19 (Q84263196), even if the ultimate cause of death is pneumonia (Q12192) (which in turn causes pulmonary aspiration (Q737510) which causes asphyxia (Q193840) which causes brain death (Q223867)...). Add qualifiers if needed. -Animalparty (talk) 22:23, 26 March 2020 (UTC)
- See below for how AIDS was handled to take care of such a problem with "AIDS" and "AIDS related death", which needs to harmonized. I am not sure two were needed but it does cover the problem under discussion. There are two causes now at AIDS and we need to standardize on one. --RAN (talk) 02:11, 27 March 2020 (UTC)
- See this question about U07.1 vs U07.2 usage on en.Wikipedia. These two codes are temporary emergency ICD-10 codes. U07.1 is for lab-confirmed COVID-19 and U07.2 is for clinically or epidemiologically defined COVID-19. The Polish medical authorities presently only classify U07.1 deaths as COVID-19 deaths; U07.2 deaths are excluded. It seems to me that it would probably be best to have two separate wikidata items for U07.1 and U07.2 (which at some time in the future - months? - will be renamed to standard ICD-10 codes), which are subclasses of COVID-19 (Q84263196). Some health agencies/medical sources may separate these, some (at least one) ignore U07.2, and some may give just the two together. Boud (talk) 15:55, 1 April 2020 (UTC) (Update: the Polish health agency issuing the instructions for disease classification added U07.2 as a COVID-19 classification as of 1 April 2020. The main Polish centrist newspaper said that there has been a lot of fuss about the issue. See the article for details and a ref.) Boud (talk) 00:22, 2 April 2020 (UTC)
- See below for how AIDS was handled to take care of such a problem with "AIDS" and "AIDS related death", which needs to harmonized. I am not sure two were needed but it does cover the problem under discussion. There are two causes now at AIDS and we need to standardize on one. --RAN (talk) 02:11, 27 March 2020 (UTC)
- Wikidata shouldn't be an endless spiral of reductionism. Practicality should trump technicality. I think a convention of "disease died from" = "cause of death" is most sensible. If someone is shot to death, is cause of death the bullet? The blood loss? The organ failure? Are cancer deaths due to cancerous cells, tumors, organ failure, or simply cancer (Q12078)? With COVD-19 deaths, the proximate (and practical) cause of death is COVID-19 (Q84263196), even if the ultimate cause of death is pneumonia (Q12192) (which in turn causes pulmonary aspiration (Q737510) which causes asphyxia (Q193840) which causes brain death (Q223867)...). Add qualifiers if needed. -Animalparty (talk) 22:23, 26 March 2020 (UTC)
1918-1920 flu pandemic deaths needs some TLC edit
We only have 285 1918 flu pandemic deaths formatted so they show up in a search, any ideas how to find the missing ones to harmonize them? --RAN (talk) 19:21, 26 March 2020 (UTC)
The following query uses these:
- Properties: cause of death (P509)
SELECT ?item ?itemLabel WHERE { ?item wdt:P509 wd:Q178275. SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". } }
- @ Richard Arthur Norton (1958- ): Hmm, good question. Maybe query for dead people in that time that cause of death is influenza (Q2840). Finding a reliable database would me, in my opinion, the best approach. Also, cause of death (P509) 1918-1920 flu pandemic (Q178275) does not seem super precise. If it is to be consistent with the current pandemic modelling, we would need an item for the flu itself, and another for the pandemic. TiagoLubiana (talk) 19:55, 26 March 2020 (UTC)
- I was just running the very query you suggested! Eventually we can harmonize all disease agent deaths the same way, I was just looking at HIV deaths too. --RAN (talk) 20:16, 26 March 2020 (UTC)
- We could also try to peruse relevant Wikipedia categories or perhaps mine relevant Wikipedia articles directly with queries like these:
- The following query uses these:
- Items: human (Q5)
- Properties: instance of (P31) , date of death (P570) , cause of death (P509)
SELECT DISTINCT ?item ?itemLabel ?dod ?cod ?codLabel WHERE { { SELECT DISTINCT ?item WHERE { SERVICE wikibase:mwapi { bd:serviceParam wikibase:endpoint "en.wikipedia.org" . bd:serviceParam wikibase:api "Generator" . bd:serviceParam mwapi:generator "search" . bd:serviceParam mwapi:gsrsearch "\"Spanish flu\"" . bd:serviceParam mwapi:gsrlimit "max" . ?item wikibase:apiOutputItem mwapi:item . } } LIMIT 100 } ?item wdt:P31 wd:Q5. OPTIONAL {?item wdt:P570 ?dod .} OPTIONAL {?item wdt:P509 ?cod .} SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". } } ORDER BY ASC(?dod)
While we are working on pandemic death data, can someone help me with the AIDS deaths. I know this is tangential, but people will be comparing notable people death rates. Currently the cause of death for is split between "AIDS" and "death from AIDS-related complications", I think we can switch the 200 with "AIDS" to the more specific "death from AIDS-related complications", which has about 500. See Keith Haring (Q485635) as an example to get the property numbers. Can someone help with the switch. I do not know how to automate something like this. --RAN (talk) 01:44, 27 March 2020 (UTC)
- @Richard Arthur Norton (1958- ): What you say seems correct to me but before doing this at scale I think it would be worthwhile to propose this as a data model for biographies. I started this at Wikidata:WikiProject_COVID-19/Data_models#Biographies. Perhaps for Keith Haring medical condition (P1050) should be HIV/AIDS (Q12199), the cause of death (P509) should be death from AIDS-related complications (Q4651894), and manner of death (P1196) should be natural causes (Q3739104). What do you think? How would the comparable situation work for COVID-19? Blue Rasberry (talk) 13:34, 28 March 2020 (UTC)
- I agree 100%! Is there a way to automate the task, I am not familiar with the tools for making changes on a large scale. I also have a list of 1918-1920 flu deaths that need to be harmonized. --RAN (talk) 18:40, 28 March 2020 (UTC)
- @Richard Arthur Norton (1958- ): The automation is the easy part. Either someone can help you with it, or you might be surprised at the options for trying it yourself. The much more difficult part of Wikidata is learning enough to have the conversations which are supporting evidence to justify harmonization. You are spot on with the HIV/AIDS comparison and we could ping Wikidata:WikiProject LGBT into a discussion. That might be enough, or additionally, we could also do an example for the 1918 flu pandemic. If you have like 3 similar cases, have an example for each, and can recruit someone to speak about each one and agree, then that makes the argument to reconcile them all. In general, people hesitate to do anything large scale without supporting evidence that the proposed massive change is worthwhile. I have done this before and I myself am not so sure about how to do things right in this case, but in general, having a greater number of examples and feedback from multiple communities of editors makes a stronger case. Thoughts? Blue Rasberry (talk) 20:16, 31 March 2020 (UTC)
Need two new property for "number of hospitalized cases" and for "number of home cases" edit
Hi all,
France (SpF) will no longer give, from 26 March 2020, the number of "total cases" (hospitalised + home) by region every day, but probably only once a week.
while waiting France (SpF) only give every day the number of "hospitalized cases" (number of people hospitalized in hospital).
what do I use as a property in "number of cases" P1603 to attach them?
quantity (P1114) with criteria used (P1013) ?
Or do I have to apply for two new properties :
- Wikidata:Property_proposal/number_of_hospitalized_cases
- Wikidata:Property proposal/number of home cases
thanks for answers--Viruscorona2020 (talk) 05:43, 27 March 2020 (UTC)
Notified participants of WikiProject COVID-19--- Jura 00:50, 28 March 2020 (UTC)
- @Viruscorona2020: Try adding qualifier valid in place (P3005) hospital (Q16917)? Dhx1 (talk) 05:37, 8 April 2020 (UTC)
- @Dhx1 please go to show Q83873593#P1603 and Q83873593#P1120 : for hospital (Q16917) , valid in place (P3005) didnt' but location (P276) did, thanks--Viruscorona2020 (talk) 05:48, 8 April 2020 (UTC)
Enter covid-19 affected and death cases for Switzerland at the canton level edit
As part of the HackZurich hackathon (https://codevscovid19.slack.com) I would like to make use of the open data provided by the canton of Zurich and enter the time development of infected and death cases for each Swiss canton. To start, I would do this manually so that I get a good understanding of the underlying metamodel. Then it would be nice to have a bot.
If I take as model the case of China, from the page https://www.wikidata.org/wiki/Q83872271 they have added an "includes" statement that links national outbreak with local outbreak instances. So I would need to create first for each canton a page "2019-2020 coronavirus outbreak in Swiss canton XX" and then, on this new page enter the time development via the property "number of cases" and "number of death". Is it correct ? – The preceding unsigned comment was added by AntoineLogean (talk • contribs).
- @AntoineLogean: You can create just the canton you are interested in. See Wikidata:WikiProject COVID-19/Data models/Outbreaks for properties (this is still being worked on). --- Jura 08:49, 28 March 2020 (UTC)
- @Jura: Not sure I follow you. If a take https://www.wikidata.org/wiki/Q87144034 as model is it ok? – The preceding unsigned comment was added by AntoineLogean (talk • contribs).
- Sure. I'd avoid the valid in place (P3005)-qualifier. Not sure what the "ranking"-qualifier is for. --- Jura 14:41, 28 March 2020 (UTC)
- @Jura: I have used quickstatements to create the canton specific items. Now I am blocked by an error when I try to add the number of death. I use the following commands:
Q88938850 P1120 1 P58 +2020-02-27T00:00:00Z/11 P854 "https://gd.zh.ch/internet/gesundheitsdirektion/de/themen/coronavirus.html#title-content-internet-gesundheitsdirektion-de-themen-coronavirus-jcr-content-contentPar-textimage_7"
But it does not accept the P58 statement. I have tried many things. I am stuck. Any ideas what is wrong ?- @AntoineLogean: Hello. For quickstatements, the following command should work:
- Q88938850|P1120|1|P585|+2020-02-27T00:00:00Z/11|S854|"https://gd.zh.ch/internet/gesundheitsdirektion/de/themen/coronavirus.html#title-content-internet-gesundheitsdirektion-de-themen-coronavirus-jcr-content-contentPar-textimage_7"
- Sometimes quickstatements runs into errors. You can try clicking on the reset errors button. If your commands are correct, it will work, so I believe you will be able to make it. I just run this line in QS (here) and had to reset a couple times. Sorry to see that people are dying in canton Zurich also. Lived there for a couple months, beautiful location, great people. TiagoLubiana (talk) 15:26, 30 March 2020 (UTC)
- By the way, I saw that the reference link page is constantly updated. This might be a problem. A solution might be using webarchive or other fixed reference sources. See the discussion here for some ideas.TiagoLubiana (talk) 15:26, 30 March 2020 (UTC)
- :: @TiagoLubiana: many thanks! Once I succeed to import the data I will work on the references. But I still get errors by the point in time statements see here my batch https://tools.wmflabs.org/editgroups/b/QSv2/27921/ I tried to reset the error but it does not seems to work. What other alternative do I have apart Quickstatements to perform a batch import ?
- @AntoineLogean: Hello. For quickstatements, the following command should work:
- @Jura: I have used quickstatements to create the canton specific items. Now I am blocked by an error when I try to add the number of death. I use the following commands:
- Sure. I'd avoid the valid in place (P3005)-qualifier. Not sure what the "ranking"-qualifier is for. --- Jura 14:41, 28 March 2020 (UTC)
- @Jura: Not sure I follow you. If a take https://www.wikidata.org/wiki/Q87144034 as model is it ok? – The preceding unsigned comment was added by AntoineLogean (talk • contribs).
Feedback on data model for countermeasures/strategies against the outbreak edit
Hello, I have been trying to make a data model for documenting the countermeasures/strategies that different actors, such as governments, implement against the outbreak. I now hope to get some feedback in this forum about if it is a useful model and what can be improved.
- First, I used the manual Non-pharmaceutical public health measures for mitigating the risk and impact of epidemic and pandemic influenza (Q88222459) from the WHO to create items for different categories of countermeasures, such as border closure (Q88707306), against outbreaks.
- Second, I created countermeasures in Sweden during the COVID-19 pandemic (Q88807626) as has part(s) (P527) to COVID-19 pandemic in Sweden (Q84081576)
- Third, I started creating items (only three so far) for each of the countermeasures that the Swedish authorities have taken, and connected them as has part(s) (P527) to countermeasures in Sweden during the COVID-19 pandemic (Q88807626).
(I read on Wikidata:WikiProject_COVID-19/Data_models that Wikidata:WikiProject Policies has a role, but I do not think it serves any purpose for what I am trying to do; that project seems to deal with policies in the sense of a written document, not as strategies)
If you want to have a look, countermeasures in Sweden during the COVID-19 pandemic (Q88807626) is probably the best way to start.
I am quite new to Wikidata, any advice would be helpful and appreciated! JoranL (talk) 21:23, 29 March 2020 (UTC)
- @JoranL: Hello and thanks for your contributions. That is some modelling work going on regarding countermeasures at the property proposal Wikidata:Property_proposal/countermeasure. Your opinion there would be of great value, as you already gave some thought into this issue. It is still very open, so I think every idea matters. TiagoLubiana (talk) 15:08, 30 March 2020 (UTC)
- @TiagoLubiana: Many thanks, I hadn't seen that, I will join the discussion there.
Archive: April 2020
Talk on Wikidata, COVID-19 and Education edit
Hello everyone,
Tomorrow there will be a talk discussing the Wikimedia projects and how they can help the efforts to tackle the current pandemic.
I will copy the Facebook communication ipsis litteris here.
"The board of the Wikipedia & Education User Group invites you to attend our user group's next Open Meeting, one week from today, on Thursday, April 2, at 15:00 UTC, as always via Zoom. We'll be discussing the Wikimedia & Education community's response to the COVID-19 pandemic. Guest speakers include:
- Nichole Saad and Melissa Guadalupe Huertas from the WMF Education Team will talk about their strategy and how you can help. (https://lists.wikimedia.org/pipermail/education/2020-March/002511.html)
- User:TiagoLubiana, a graduate student in Computational Biology from the University of São Paulo, and a leading editor at Wikidata:WikiProject COVID-19, will discuss Wikidata's work around COVID-19 and how the Wikimedia and education community can help. (https://www.wikidata.org/wiki/Wikidata:WikiProject_COVID-19)
Per usual, the board will provide an update on user group activities, and we'll offer an opportunity for others to briefly share what they've been up to in light of the COVID-19 pandemic. Join us!
What: Wikipedia & Education User Group Open Meeting When: Apr 2, 2020 15:00 UTC
Where: https://zoom.us/j/759620545 Meeting ID: 759 620 545" This is the facebook post made by (@LiAnna_(Wiki_Ed):) [Facebook post] I am going to present a little bit about this project, and share a few ideas on how the Education community might help. If anyone has insights or tips, I can add anything you feel might be important to the presentation. Best, TiagoLubiana (talk) 03:13, 2 April 2020 (UTC)
I have joined the call at 15h00 (Zürich time) but I was alone ☹️ Antoine Logean (talk) 18:20, 2 April 2020 (UTC)
- @AntoineLogean: Sorry about the confusion. The time was 3:00 PM Coordinated Universal Time (UTC) which is
5:00 PM in Zürich, Switzerland. TiagoLubiana (talk) 20:06, 2 April 2020 (UTC)
A logo for this wikidata project ? edit
- @AntoineLogean: Nice idea and cool logo! Is this the final version? Maybe we could leave this open for some days, so people can share their thoughts and perhaps bid other logos. What do you think? TiagoLubiana (talk) 14:14, 2 April 2020 (UTC)
Sure. It is just a draft anyway. Antoine Logean (talk) 18:12, 2 April 2020 (UTC)
- Looks great to me. --Daniel Mietchen (talk) 03:43, 6 April 2020 (UTC)
- Beautiful, @AntoineLogean: !
- @AntoineLogean, TiagoLubiana: My recommendations would be:
- Have as svg rather than jpg
- I quite like the hand-drawn look, but it doesn't quite work at the sizes we'd be using (200px and 20px wide versions). Options:
- Lean into it more so it's obvious at small size
- Revert to a cleaner basic shapes look
- Use the colour and order of bars in the main wikidata logo
- Happy to assist is useful. T.Shafee(evo&evo) (talk) 01:00, 7 April 2020 (UTC)
- @AntoineLogean, TiagoLubiana: My recommendations would be:
- @AntoineLogean, TiagoLubiana, Daniel Mietchen: Hearing the recommendations from @Evolution and evolvability:, I built this version, let me know what you all think. File:Wikiproject COVID-19 - logo.svg Ederporto (talk) 20:28, 14 April 2020 (UTC)
- There is also this version used in the enwiki File:SARS-CoV-2 (Wikimedia colors).svg Ederporto (talk) 20:31, 14 April 2020 (UTC)
- @Ederporto: Super nice! for me, we could already use that logo. TiagoLubiana (talk) 22:46, 14 April 2020 (UTC)
- @Ederporto: Nice work! I like File:Wikiproject COVID-19 - logo.svg. Although I like the idea of a unified logo for both the WD and WP wikiprojects, I think your design is much nicer than the one being used at WP, and it's reasonable to have a separate logo for the WD project. T.Shafee(evo&evo) (talk) 23:37, 14 April 2020 (UTC)
Voting process edit
In a similar situation, in December 2019 meta:Wikimedians for Sustainable Development had a vote about its logo as documented at meta:Wikimedians for Sustainable Development/Logo. There was community demand at that project for this process. Here at WikiProject COVID-19, I do not see the same level of demand for conversation about the logo. I think it would be acceptable to close this logo discussion after a few comments and use any logo. If a lot of people really requested it, then the process could be as formal as what this other project did. I see no reason to be more complicated than what this other project did. Blue Rasberry (talk) 15:27, 20 April 2020 (UTC)
- I got this idea from user:Daniel Mietchen, who proposed it in the meetup at Wikidata:WikiProject COVID-19/Project meeting. Blue Rasberry (talk) 15:28, 20 April 2020 (UTC)
- @Bluerasberry: Great! I have set up a voting page on Wikidata:WikiProject_COVID-19/Logo. TiagoLubiana (talk) 23:40, 20 April 2020 (UTC)
Wikipedia templates edit
Hello,
One thing that would greatly aid the effort for updating Wikipedia on the COVID situation in languages other than English would be to have proper templates so this information shows up there.
Anyone interested in leading a task force for building such templates? I could try and start it, but I have no previous experience with templates.
Here are some links that might be interesting for the matter:
- Wikidata templates tutorial in Wikipedia
- Project on bridging templates and Wikidata
- Module:WikidataIB
- Module:Wd
- Wikipedia:Wikidata
TiagoLubiana (talk) 01:24, 5 April 2020 (UTC)
- Have you looked into Template:Databox? Currently deployed on 16 Wikipedias. --Daniel Mietchen (talk) 03:47, 6 April 2020 (UTC)
Full individual Singapore cases table proposed for deletion edit
The table of all individual Singapore SARS-CoV-2 cases at en.Wikipedia has been proposed for deletion:
- prior discussion: w:Template talk:2019–20 coronavirus pandemic data/Singapore medical cases#Requested move 28 March 2020
- TfD - deletion discussion: w:Wikipedia:Templates for discussion/Log/2020 April 3
I don't have a strong opinion on this particular table, but if Wikidata people want the data, then probably now is the time to get involved, rather than trying to undelete the page later.
I do worry that this might set a precedent for deleting other COVID-19 related template tables on en.Wikipedia.... Boud (talk) 12:05, 3 April 2020 (UTC)
- @Boud: Thanks for your comment. I do not have a strong opinion on that. I saw that the sources would be there anyway, so if the Wikipedia community decides to remove it, we could build it from the sources in Wikidata later. And I hope this won't set a precedent too. Maybe other people here in the project will have different visions on the matter. TiagoLubiana (talk) 17:26, 3 April 2020 (UTC)
- @TiagoLubiana: By "the sources would be there anyway", do you mean that the Singapore Ministry of Health has well-organised static web pages with the info and there's a fair chance of them remaining stable for a long time? I noticed that the refs in the template proposed for deletion are all repeat refs, so deleting the template will clearly not delete the sources. I seem to remember that using repeat refs only is what I encouraged many many weeks ago - but checking now, I couldn't find or remember where the sources of the refs are, in the sense of full {{cite web|...}} type references. Did you notice where the full wikified sources are? Boud (talk) 00:06, 4 April 2020 (UTC)
Structuring effects and impacts of the pandemic edit
I am trying to recreate / mimick the enwiki COVID-19 template 2019–20 coronavirus pandemic with Wikidata. The template lists issues and impacts. Wikidata has at least facets and effects. They are a mixed bag of items both. Is anyone working on these, do you have suggestions for structuring them? – Susanna Ånäs (Susannaanas) (talk) 00:44, 4 April 2020 (UTC)
- @Susannaanas: Hello, what exactly you are trying to recreate? Would it be some way to link COVID-19 pandemic (Q81068910) and template items such as human rights issues related to the COVID-19 pandemic (Q87796964) and media coverage of the COVID-19 pandemic (Q88293962)? TiagoLubiana (talk) 01:49, 5 April 2020 (UTC)
- I am preparing to make a dashboard of COVID-19 topics for translation and editing purposes, see here: https://wikidocumentaries.wmflabs.org/wiki/COVID-19_Global_Dashboard. Translators find the enwiki template a useful overview. I am thinking how to regenerate something similar by querying Wikidata. For that purpose many topics would need more structure in Wikidata. Arranging topics can be a purpose that this dashboard could also serve. – Susanna Ånäs (Susannaanas) (talk) 07:32, 5 April 2020 (UTC)
- @Susannaanas, TiagoLubiana: See Template:Zika for a prototype. --Daniel Mietchen (talk) 03:53, 6 April 2020 (UTC)
- Thank you this is a really useful example! – Susanna Ånäs (Susannaanas) (talk) 06:47, 6 April 2020 (UTC)
- I would like to be able to produce this dynamically from Wikidata. It is especially difficult with key personalities or organizations whose connection to the outbreak is not recorded in Wikidata, such as World Health Organization (Q7817). – Susanna Ånäs (Susannaanas) (talk) 07:15, 6 April 2020 (UTC)
- So I am wondering if there should be a "tag" to add to relevant topics and then more ontology-oriented people could specify the links more precisely. The tag can be a focus list, for example. Any support, or if it has already been done? How could wikipedians add this tag easily, a gadget? – Susanna Ånäs (Susannaanas) (talk) 07:30, 6 April 2020 (UTC)
Wikidata is one of the topics on the Biohackathon covid-19-bh20 edit
The biohackahton is an annual hackathon with editions in Japan and Europe. Given the COVID-19 pandemic, there will be a virtual edition from April 5th until April 11th, specifically aimed at tackling the COVID19. The program is similar to the usual wikimedia hackathons where in idea's are pitched and then progress is self-organise and regularly reported to the wider audience. Amoung the different topics, there is a one on. Wikidata and Wikibase, which will give us the opportunity to align many other resources with Wikidata. You can join this hackathon by adding your name to [5] and joining Slack and the #wikidata channel there.
The program start with the 2-minute introductions on Sunday, after which different subprojects, sometimes together with other groups, will emerge. --Andrawaag (talk) 06:01, 4 April 2020 (UTC)
- @Andrawaag: That is most welcome. Any update / projects that Wikidatans in particular should join? Sj (talk)
Graphs from queries edit
Just to let you know, User:Abbe98 is showing some potential in this video of how the results from a query could be used in a graph. Ainali (talk) 17:57, 4 April 2020 (UTC)
- @Abbe98, Ainali: Looks promising. Have you found that Phabricator ticket? --Daniel Mietchen (talk) 04:29, 6 April 2020 (UTC)
- @Daniel Mietchen: Yes it's T226250.
What should the default Scholia queries on COVID 19 be? edit
Currently, Scholia has two logical pages on COVID-19:
The topic page returns mostly sensible results, though much of the author information is missing. The disease page returns a list of clinical trials (only 15?) and a list of "Possibly related diseases", which is very misleading (based on symptom overlap), with the final query apparently broken. The question is therefore: What queries should it ideally present, and what tasks need to be done to ensure those queries return accurate results? T.Shafee(evo&evo) (talk) 10:46, 6 April 2020 (UTC)
Notified participants of WikiProject Scholia
- Hi @T.Shafee:, agreed on the "Possibly related diseases". It should say what the query does: "Diseases with similar symptoms". Do you want to file an issue for that, or shall I? --Egon Willighagen (talk) 14:07, 7 April 2020 (UTC)
- @Egon Willighagen: Good point. I've added it to the existing scholia I-851. T.Shafee(evo&evo) (talk) 23:45, 7 April 2020 (UTC)
Lockdown being the best solution to curb the spread of covid-19 edit
Hi@joshpowon88 enforcing a lockdown to countries is the best solution as the spread of corona virus will be limited hence people contaminated with the virus will help reduce the population or number of people to be contaminated by the virus by staying at home and also quarantinig themselves
Current data edit
Should we set preferred rank for the latest value? --Infovarius (talk) 15:11, 8 April 2020 (UTC)
- @Infovarius: Good point. This recently has been put into discussion at Wikidata_talk:WikiProject_COVID-19/Data_models/Outbreaks. I believe that there are cases in which preferred rank is useful, specially for the integration with Wikidata pages (the simple syntax in this Wikipedia article only works for preferred rank. However, we have not reached any consensus. TiagoLubiana (talk) 22:51, 8 April 2020 (UTC)
Modeling case counts + disease outcomes + line-list data edit
Data used for modeling the spread of an outbreak includes
- Test coverage: reports of how many tests are done, according to what model (sentinel testing, testing the sickest, testing all potential candidates) [and perhaps: average time for test results, test facility # and type]
- Test outcomes: breakdown of tests into positive/negative/inconclusive/pending
- Treatment of illness: breakdown of treatment - isolation at home, hospitalization (fever clinic, in-patient, ICU), on ventilator
- Outcomes: in quarantine / released; in hospital / discharged; death
- Line-list data: time + location of infection (w/ degree of certainty), current treatment/outcome, contacts traced?, list of contacts (pseudonymized: by ID)
- Local policy implementation (taxonomy of policies by geography)
- Questions
- What of the above are we currently able to capture?
- What do we need new properties for?
Testing + outcome + poilicy data seem important and easy to verify. Line list data is essential for modeling and maps like Hong Kong's city map, but much larger (1000s of entries for one epidemic).
Sj (talk) 15:54, 8 April 2020 (UTC)
Properties and style edit
- How should we capture faceted data? E.g. "total tests" but also positive, negative, and pending tests (which should add to the same)
- What constraints/qualifiers are appropriate?
- In use now: for deaths - location? seems wrong. for all data points - refine-date (for last-update time)
Proposal subproject: a data registry edit
We could use a registry for datasets and time-series data that are essential to understanding the pandemic, and in wide use on the projects. see Wikidata:WikiProject COVID-19/Registry for details. Sj (talk) 20:20, 10 April 2020 (UTC)
Wikidata Lab XXII WikiProject COVID-19 edit
Hello all,
Tomorrow I will present in a Wikidata Lab about the project Covid-19 and would like your help!
The Wikidata Lab is part of a series of events for building capacity related to Wikidata editing here in Brazil. The event is going to be in English this time, even though the page for the event is in portuguese:
- Everyone is welcome to join.
- Many Wikidata editors will be there and any suggestions of which parts of the project need more editing would be really appreciated.
If you have a suggestion of an activity, please comment below.
Thanks! TiagoLubiana (talk) 15:20, 13 April 2020 (UTC)
- That was super, Tiago. We could use more like this, maybe casual working sessions w/ open audio channels (sth like discord?) Sj (talk) 14:15, 14 April 2020 (UTC)
- @Sj: Yes, we should definitely set up this kind of things! Perhaps regular open meetings? So we can more easily advance some points. TiagoLubiana (talk) 16:40, 14 April 2020 (UTC)
Regular project meetings / group editing? edit
Hello,
Inspired by the virtual bio hackathon and today's Wikidata lab (and Sj suggestions) I believe we could benefit of having some (open!) meetings with the participants of this project and any other interested editors. I am raising this discussion so we can share opinions about formats and platforms for this, and possible dates/times for those that would be interested in participating.
Notified participants of WikiProject COVID-19
TiagoLubiana (talk) 16:47, 14 April 2020 (UTC)
- I would join video meetups daytime EST/New York. Blue Rasberry (talk) 17:02, 14 April 2020 (UTC)
- Yes! (EEST) – Susanna Ånäs (Susannaanas) (talk) 17:49, 14 April 2020 (UTC)
- I can update Africa data every night --Nehaoua (talk) 17:54, 14 April 2020 (UTC)
- Most evenings b/t 2200-2330 I am updating US Covid data on the covidtracking project (in a spreadsheet! yow... until we can do it in WD itself) but otherwise free in the evenings, from 1600 ET onward. Sj (talk) 20:22, 14 April 2020 (UTC)
- I would be interested! (anytime will do) ·addshore· talk to me! 22:19, 14 April 2020 (UTC)
- Please count me in. --Csisc (talk) 22:55, 14 April 2020 (UTC)
- I'd love to tag along! --Librarian lena (talk) 21:03, 15 April 2020 (UTC)
- @Susannaanas, Bluerasberry, Nehaoua, Sj, Addshore, Csisc: Great! Timing might be a bit complicated, as he have a 7-hour gap between EEST to EST. Can we have a call, then, on monday? We need some time to organize and be sure that people are aware. I suggest we make an 1 hour call at 15:00 UTC which is:
- 12:00 EST/New York
- 13:00 Brazil
- 18:00 EEST
- @Susannaanas, Bluerasberry, Nehaoua, Sj, Addshore, Csisc: Great! Timing might be a bit complicated, as he have a 7-hour gap between EEST to EST. Can we have a call, then, on monday? We need some time to organize and be sure that people are aware. I suggest we make an 1 hour call at 15:00 UTC which is:
- Maybe User:Sj can make an exception for this first call? If not, we can think about other times . I am also not sure which tool would be the best (suggestions would be awesome). Eventually we can create a page for the calls, with open agendas for the meetings and so on. TiagoLubiana (talk) 22:09, 15 April 2020 (UTC)
- TiagoLubiana: The time is adequate. It is 16:00 in Sfax, Tunisia. I propose to use Google Meet. --Csisc (talk) 10:21, 16 April 2020 (UTC)
- Time is great for me Blue Rasberry (talk) 20:43, 16 April 2020 (UTC)
- @Susannaanas, Bluerasberry, Nehaoua, Sj, Addshore, Csisc: Great! I created a page for the first Wikiproject Call on Monday, 15:00 UTC . Just to confirm, I actually was mistaken and 15:00 UTC is 11:00 AM in New York. We can move also further discussions to that page. TiagoLubiana (talk) 16:44, 17 April 2020 (UTC)
- That works for me. I can make time regularly at 15:00 UTC, just noting that I'm naturally also online doing data-work in the evenings ET. Sj (talk) 18:58, 17 April 2020 (UTC)
- TiagoLubiana: That works for me. --Csisc (talk) 21:08, 17 April 2020 (UTC)
- That works for me. I can make time regularly at 15:00 UTC, just noting that I'm naturally also online doing data-work in the evenings ET. Sj (talk) 18:58, 17 April 2020 (UTC)
- Maybe User:Sj can make an exception for this first call? If not, we can think about other times . I am also not sure which tool would be the best (suggestions would be awesome). Eventually we can create a page for the calls, with open agendas for the meetings and so on. TiagoLubiana (talk) 22:09, 15 April 2020 (UTC)
Stay at home orders and other such "suggestions" edit
I just spotted stay-at-home order issued in the United States in response to the COVID-19 pandemic (Q88509703). Has anyone else been modeling these shelter in place and stay at home orders for other countries in some way? I had a quick look but this is the only one I could see. ·addshore· talk to me! 22:48, 14 April 2020 (UTC)
- @Addshore: there is organized response related to outbreak (P8045) with some uses. --- Jura 22:53, 14 April 2020 (UTC)
- Thanks, yes 6 usages so far, hopefully we can continue to model and add these! ·addshore· talk to me! 22:56, 14 April 2020 (UTC)
- There's this: Data_models/Curfews_and_lockdowns – Susanna Ånäs (Susannaanas) (talk) 12:52, 20 April 2020 (UTC)
People who died from COVID-19 edit
Wikidata:Lists/corona_virus_deaths got quite a lot of visits the other day [6]. --- Jura 06:58, 15 April 2020 (UTC)
Preprints listed in Wikidata? edit
I'm not finding many preprints listed as items in wikidata. Is there any policy against this? E.g. there are a few hundred preprints relating to COVID-19 in BioRXiv (example table). Would it be reasonable to start creating items on them and if so, are there any appropriate tools? T.Shafee(evo&evo) (talk) 10:24, 15 April 2020 (UTC)
- @Evolution and evolvability: There are already around 192 preprint articles from biorxiv on Wikidata: https://w.wiki/MtQ. You can use d:Help:QuickStatements. John Samuel (talk) 11:52, 15 April 2020 (UTC)
- @Jsamwrites: Thanks - useful to know. There are probably about 105 preprints total, so quite a gappy dataset currently! Having the full covid-19 set up would be a good start. I guess it's just a case of trying to export from bioRxiv, medRxive etc in a way that can be formatted into a QS. I have to admit, I was hoping that there was a SourceMD equivalent somewhere that I didn't know about! T.Shafee(evo&evo) (talk) 12:21, 15 April 2020 (UTC)
Hospital admissions (entry) and discharges (exit) edit
How do I enter the numbers for:
- number of hospital admissions per day : P8049 + P5822 ?
- number of hospital discharges per day : P8049 + P2225 ?
--Antidote2020 (talk) 03:00, 18 April 2020 (UTC)
Cross-reference literature database of WHO ? edit
Notified participants of WikiProject COVID-19
Should we do this? If yes, with catalog code (P528), described at URL (P973) or a dedicated property? --- Jura 15:18, 18 April 2020 (UTC)
- Support dedicated property, --Antidote2020 (talk) 03:20, 19 April 2020 (UTC)
- Support dedicated property too, but I'd be okay with any of the ways TiagoLubiana (talk)
Ok, thanks for your input. I proposed it at Wikidata:Property proposal/COVIDWHO ID. --- Jura 09:42, 21 April 2020 (UTC)
Graphs are temporarily unavailable due to technical issues. |
It's at COVIDWHO ID (P8150) --- Jura 08:41, 29 April 2020 (UTC)
Co-topic graph edit
In the same way that scholia calculates a co-author graph, I've been working on visualising co-topic graphs for topics that appear together as main subject (P921)s in publications. See the initial results below drawing from wikidata and visualising using [R] and D3:
- Website: https://ts404.shinyapps.io/TopicNetwork
- Source code: https://github.com/TS404/TopicNetwork
- Future plans: https://github.com/petermr/openVirus/issues/40
Feedback and ideas always appreciated! T.Shafee(evo&evo) (talk) 02:36, 20 April 2020 (UTC)
SARS-CoV-2 Cell Lines to Wikidata edit
Hello all, Does anyone here if previous work has been done on Wikidata regarding cell lines?
I was thinking about:
- 1 - adding CLO (http://www.clo-ontology.org/) cell lines to WIkidata properly citing the ontology. Wikidata uses a CC0 license, while CLO is CC-BY. Does that make it impossible to auto add the cells?
- 2 - Catalog cell lines used for SARS-CoV-2 (as per https://web.expasy.org/cellosaurus/sars-cov-2.html) and match them to articles by using the "describes a project that uses (P4510)" to relevant articles.
Step 1 is not needed for step 2, but it would make it more generalizable.
Thanks, TiagoLubiana (talk) 21:04, 21 April 2020 (UTC)
Tabular case data edit
Wikimedia Commons has the ability to store tabular case data for this outbreak in a format that's more convenient than a series of Wikidata statements. This should make it easier to keep the data more up-to-date, especially at the subnational and local level. On the English Wikipedia, w:Template:Medical cases chart is able to automatically parse these tables into bar charts. I've proposed Wikidata:Property proposal/tabular case data to formally link items about outbreaks to the corresponding case data. – Minh Nguyễn 💬 02:06, 29 April 2020 (UTC)
- @Mxn: Nice! There are some Wiki communities that are using data stored in properties (e.g. fr:Modèle:Infobox_Épidémie), so a full migration to tabular would not be desirable. But indeed, I do not like the huge number of values for number of deaths (P1120) and number of cases (P1603) that are currently used to track this data. I support fully that we additionally have a count tabular data. What about making a property that would accept number of deaths (P1120), number of cases (P1603), number of recoveries (P8010) and so on as qualifiers? In that way, we could have legacy information on the tabular format, and current information on the specialized properties. TiagoLubiana (talk) 22:38, 29 April 2020 (UTC)
Reader Vector Insight on Covid readership edit
Hi all, I wanted to highlight another piece of data that we were able to develop within the Foundation. Collaborating with User:MGerlach (WMF), we were able to develop a list that helps us learn from reader behavior, which articles that folks are likely to engage with beyond the “obvious” topics related to Covid. By tracing the paths that readers take while reading Wikipedia, we can see that a lot of readers are asking a lot of questions about contextual topics related to the public health response to the pandemic.
We published this data on a page on meta, and we would appreciate any feedback on how such information might be useful for you as editors or organizers. Astinson (WMF) (talk) 18:21, 29 April 2020 (UTC)
- @Astinson (WMF): Hello, congrats on the super interesting work. @Susannaanas: has been working with gathering items related to covid-19 on the Focus list page. Susanna, I've tagged you here because I believe this is both interesting to you and you are likely able to give good feedback on it :) . Also, @Daniel_Mietchen: is interested in navigation templates for items related to COVID-19 (as happened for the WP Zika Corpus) and might be interested in this discussion. TiagoLubiana (talk) 22:29, 29 April 2020 (UTC)
- @TiagoLubiana: Thanks for the ping. I had an initial look, and it seems very useful indeed. Will leave more detailed feedback over there. --Daniel Mietchen (talk) 23:59, 29 April 2020 (UTC)
- @TiagoLubiana: Thank you for my part as well! These topics are tied to each other. The focus list tries to capture topics that wikipedians have connected to the COVID-19 topics, and these connections have been queried through pagelinks and categorymembers in different language Wikipedias. Pinpointing items that don't have a stated connection to the "COVID-19 ontology" I am envisioning those connections could more easily be designed and made into data models. Having everything in a neat ontology would allow creating a dynamic navigation template that you mention. I added the items that were identified in the research to the focus list. 119 were new and 81 existing. Cheers, Susanna Ånäs (Susannaanas) (talk) 04:05, 30 April 2020 (UTC)
Hello all,
In the past weeks, me and User:Jvcavv have been adding code to this GitHub repository [7].
Specifically, there are folders related to:
- SARS-CoV-2 related PPI interactions
- Semiautomatic generation of distribution maps
- [https://github.com/lubianat/wikidata_covid19
/tree/master/sandbox/data-by-state updating case and death counts for Brazilian states]
Everyone is welcome to take a look, contribute to the source code, open issues, discuss implementations and so on.
Also, feel free to add your wikidata-related code there too, as this facilitates interactions.