User:ProteinBoxBot/Protein family bot
Objective edit
This bot function should add and update Wikidata items for protein family (Q417841), protein domain (Q898273), active site (Q423026), binding site (Q616005), supersecondary structure (Q7644128), post-translational protein modification (Q898362), structural motif (Q3273544) items, and create links between proteins and these items.
Introduction edit
This bot is part of a family of bots to capture and maintain Genes, Diseases and Drugs in Wikidata. This builds upon the ongoing work of incorporating all genes and proteins into wikidata. Adding protein family information would allow several new use cases and would allow linking classes of proteins together across species and querying proteins by function.
Properties edit
On items
Property | Datatype | Explanation |
---|---|---|
subclass of (P279) | item | hierarchy |
instance of (P31) | item | type of item (protein family, etc) |
InterPro ID (P2926) | external-id |
On proteins
Property | Datatype | Explanation |
---|---|---|
subclass of (P279) | item | member of protein family |
has part(s) (P527) | item | contains a ... |
Data sources edit
Output edit
Counts of number of proteins grouped by taxon of proteins that are subclass of a protein family link
Counts of interpro items by type link