User talk:Egon Willighagen/Archive 2020

Latest comment: 3 years ago by Egon Willighagen in topic protein family membership

Re: Plz check for duplicates edit

Thank you Egon for the information. While revewing my last entries, I'm trying to merge two items (8-Chloro-1-octene and 8-Chlorooct-1-ene) but when attempting it I receive the following message: Currently only Qid/Lid is a valid input and I can't continue. Do you know how to solve it?--Fran Ara (talk) 09:25, 30 January 2020 (UTC)Reply

Just solved it, using the Q-number instead of the item name. Thank you anyway.--Fran Ara (talk) 09:34, 30 January 2020 (UTC)Reply

DSSTOX compound identifier edit

I found that we do not have a property for this.--GZWDer (talk) 15:27, 30 January 2020 (UTC)Reply

WikiCite Satellite Cologne 2020 edit

Hi Egon - you have added yourself to the list of people who are interested in the WikiCite Satellite Cologne 2020 meeting. The call for submissions is still open and we would love to see your contribution! Let me know if you have any questions. Best wishes!

Konrad Foerstner (talk) 18:18, 12 February 2020 (UTC)Reply

WikiProject COVID-19 meeting edit

Hello Egon,

How are you doing? Tomorrow (on Monday 20th ) we will have an open group in Google Meet to discuss the directions of the WikiProject COVID-19, decide priorities better to organize the next steps.

You are more than invited to join us if you would like (and have the time) !

The talk is scheduled for tomorrow 15 UTC, in this link: https://meet.google.com/tar-ufru-syb

You can find more about it, and perhaps add your name to the list, here: Wikidata:WikiProject_COVID-19/Project_meeting

Thank you very much!

All the best,

TiagoLubiana (talk) 23:50, 19 April 2020 (UTC)Reply

Call Wikiproject COVID-19 tomorrow (Monday, May 4th) edit

Hello Egon,

How are you doing?

I am here to invite you for the Wikiproject COVID-19 call tomorrow (Monday, May 4th). If you are able to meet, the call (in this link: https://meet.google.com/vtp-ighz-npx) will happen tomorrow at 15:00 UT.

As a note, 15:00 UTC is:

  • 11:00 AM in New York, USA
  • 12:00 AM in São Paulo, Brazil
  • 4:00 PM in Tunisia
  • 6:00 PM Eastern European Summer Time (EEST)

If you want, also feel free to add topics at the meeting page!

Cheers, TiagoLubiana (talk) 22:29, 3 May 2020 (UTC)Reply

Call Wikiproject COVID-19 tomorrow (Monday, May 18th) edit

Hello Egon,

How are you doing?

I am here (once again)to invite you for the Wikiproject COVID-19 call tomorrow (Monday, May 18th).

Have you been working on COVID-19 related things in the past few weeks?

If you are able to meet, the call (in this link: https://meet.google.com/vtp-ighz-npx) will happen tomorrow at 15:00 UT.

As a note, 15:00 UTC is:

  • 11:00 AM in New York, USA
  • 12:00 AM in São Paulo, Brazil
  • 4:00 PM in Tunisia
  • 6:00 PM Eastern European Summer Time (EEST)

If you want, also feel free to add topics at the meeting page!

Cheers, TiagoLubiana (talk) 23:50, 17 May 2020 (UTC)Reply

Wikiproject COVID-19 call tomorrow (1st of June) edit

Hello, Egon,

How are you doing?

This is a friendly reminder of (and an invitation for) tomorrow's (Monday, 01 of June) Wikiproject meeting, at 15:00 UTC as usual, and at the usual link: https://meet.google.com/vtp-ighz-npx .


I was wondering from the biological perspective, which are the gaps that you see on Wikidata (related to COVID-19, more specifically). I am thinking of ways how the project can be of greater help to the bio community.

If you want to add something to the agenda in advance, feel free to just edit Wikidata:WikiProject_COVID-19/Project_meeting.

I hope you can make it! :)

Best,

TiagoLubiana (talk) 20:39, 31 May 2020 (UTC)Reply


Invitation for the WikiProject COVID-19 call tomorrow (Monday, 15 of June) edit

Hello Egon,

How are you? I hope things are all right.

I would like to invite you to the tomorrow (Monday, 15 of June) call of the Wikiproject COVID-19 at 15:00 UTC .

This is the link for the call: https://meet.google.com/vtp-ighz-npx

This is the link for the etherpad: https://etherpad.wikimedia.org/p/wikidata_covid_notes

The WikiProject seems to be losing momentum. Perception of the size of COVID-19 as an enormous problem is fading (at least in Brazil). That might be a sign of things getting better, but it is also very dangerous (as they might not be that better yet).

In tomorrow's call, I would like to discuss with you two important and related questions:

- How can this WikiProject best serve the anti-COVID-19 effort?

- What can this project offer for the post-COVID-19 Wikidata world?

As usual, if you have any topics to add, you can do so either before at the Project Meeting Page or at the meeting. or at the meeting.

I hope you can make it!

All the best,

Tiago

TiagoLubiana (talk) 00:38, 15 June 2020 (UTC)Reply

Interview Invitation edit

Greetings,

I noticed your editing stats in Wikidata, which led me to look up your profile. Thank you for all the great work!

I’m reaching out to you because I’m working on a research project about understanding what motivates editors like you to contribute to Wikidata. We’re also interested in learning about how you feel your contributions are being used outside of Wikidata. Since you are such an active community member, I thought you might also be interested in helping to build the broader community’s knowledge about Wikidata, and why it matters.

If you’re interested, let’s schedule a time to talk over Zoom, or whichever platform you prefer. If you are interested, please fill in a questionnaire. The conversation should take about 30 min.

Hope you have a great day,

Chuankaz (talk) 14:49, 14 July 2020 (UTC)Reply

protein family membership edit

Hi Egon, about the SARS-Cov-2 proteins, Pfam ID is for protein family items, not their members. For the members, if you look at how the bot does it for all other proteins, we say protein-->part of-->family item. --SCIdude (talk) 15:39, 22 July 2020 (UTC)Reply

Hi @SCIdude: sorry for the late reply. I will look into it. --Egon Willighagen (talk) 06:38, 25 July 2020 (UTC)Reply
So far, I: 1. created a EntitySchema for protein family: https://www.wikidata.org/wiki/EntitySchema:E233 2. updated spike glycoprotein [SARS-CoV-2] (Q87917585), but for this I had to create two missing Pfam entries. This is a problem. How complete is Pfam in Wikidata? --09:07, 25 July 2020 (UTC)
Not very. The canonical database in WD is InterPro. Also, Pfam is a subset of InterPro, as far as I am aware. The InterPro import is also quite recent. As to Cov-2 there is a list of family items at the Cov-2 project to chose from. Moreover, in principle it's already automated: the bot should create these statements from the resp. UniProt data, no idea why it does not do it here. --SCIdude (talk) 13:49, 25 July 2020 (UTC)Reply
Okay, then I'll check InterPro ID (P2926) too next time. I know created Coronavirus S2 glycoprotein family (Q97662399) which has the InterPro ID (P2926) already used on another item too. But when doing this, I noticed a third item for the InterPro ID (P2926) of that one and I wonder if they should be unique. Should they? Because you created Spike receptor binding domain, protein family (Q83136543) which has the same InterPro ID (P2926) as Spike receptor binding domain (Q24738749). Can you first let me know the same InterPro ID (P2926) value should only be on one Wikidata item? If so, is there a reason why that is not a constraint? Should it not? Let's keep talking and get this right. --15:00, 25 July 2020 (UTC)
You have seen the P31 statements on each of the two? One is a domain, one the associated family. InterPro has this ambiguity that its domain entries actually represent families. So there are two items that link to "InterPro domains". If a constraint is needed a complex constraint could take this ambiguity into account. --SCIdude (talk) 04:24, 26 July 2020 (UTC)Reply
So, how is that modeled right now? From what you say, I'd expect items of InterPro ID (P2926) to have two different types (instance of (P31)). That may not be easy to express as constraint, but should not be hard to do that as ShEx. How about creating two ShEx then, one for InterPro Domain and the other for InterPro Family (or something like that)? --Egon Willighagen (talk) 06:27, 26 July 2020 (UTC)Reply
I have no idea about ShEX but the design sounds good. --SCIdude (talk) 07:32, 26 July 2020 (UTC)Reply
Okay, all four incorrect uses have now been fixed, see https://w.wiki/XsY --Egon Willighagen (talk) 11:29, 26 July 2020 (UTC)Reply
Thanks! --SCIdude (talk) 14:15, 26 July 2020 (UTC)Reply

Hi Egon, I finally updated your EntitySchema:E233 to what is the common case in WD. I'll continue with shapes as I want to have a basic set of shapes that define a molbio slice of WD. --SCIdude (talk) 15:52, 12 December 2020 (UTC)Reply

@SCIdude: great! There is a growing overview here: https://www.wikidata.org/wiki/Wikidata:Database_reports/EntitySchema_directory#chemistry --Egon Willighagen (talk) 16:15, 13 December 2020 (UTC)Reply

P31/P279 cycle in on fatty acyl-CoA (Q33616143) edit

It looks like fatty acyl-CoA (Q33616143) is a subclass of acyl-CoA (Q2699537) which is an instance of fatty acyl-CoA (Q33616143). Should these two items be merged? --Cdo256 (talk) 17:01, 9 September 2020 (UTC)Reply

No, this needs a fix. Acyl is the residue of any carbonic acid, of which fatty acids are a subset. You can also see this expressed in ChEBI as "fatty acyl-CoA is a Acyl-CoA". It is true that most acyl-CoAs in biological systems are fatty, and so these are often used as synonyms, but it is wrong usage. --SCIdude (talk) 04:39, 10 September 2020 (UTC)Reply
Done. Thanks for noticing! --SCIdude (talk) 04:55, 10 September 2020 (UTC)Reply
Yes, that circular subclassing sounds like a bug to me too. I agree with the explanation of SCIdude. And thanks for reporting indeed. --Egon Willighagen (talk) 05:31, 10 September 2020 (UTC)Reply

Compounds without defined stereochemistry edit

Hi, in December 2019 you wrote a script to find all compounds with missing stereochemistry. What is the current status of this? Is it possible to use this script to change instance of (P31)chemical compound (Q11173) to instance of (P31)group of stereoisomers (Q59199015) + subclass of (P279)chemical compound (Q11173) (bot/QS), at least for more than a few hundred, maybe a few thousand items? Wostr (talk) 18:41, 25 September 2020 (UTC)Reply

@Wostr:, thanks for asking. I had the code running on our groups Jenkins server, but the 2019 cyber attack at Maastricht University took it out. While our work was not directly involved, we were affected by the new regulations and are still recovering from that. The scripts are online here: https://github.com/egonw/ons-wikidata/tree/master/Wikidata The output of the scripts so far has always been new statements; I do not currently know how to use QuickStatements to remove statements (only theoretically). It's been a bit busy here, and not sure when I get to it again, but will try to run the script later this weekend and put the output somewhere (and link here). --Egon Willighagen (talk) 08:38, 26 September 2020 (UTC)Reply

We sent you an e-mail edit

Hello Egon Willighagen/Archive 2020,

Really sorry for the inconvenience. This is a gentle note to request that you check your email. We sent you a message titled "The Community Insights survey is coming!". If you have questions, email surveys@wikimedia.org.

You can see my explanation here.

MediaWiki message delivery (talk) 18:45, 25 September 2020 (UTC)Reply

Items for PubMed articles edit

Items for PMID < 32240000 is 100% complete and < 33260000 is 99% complete. Please do not create any more items. Search the PMID in search box (like this) will find the item.--GZWDer (talk) 17:01, 3 December 2020 (UTC)Reply

Return to the user page of "Egon Willighagen/Archive 2020".