User:Fnielsen/Autolists/Datasets
< User:Fnielsen | Autolists
Dataset used in works.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changes | Query:select DISTINCT ?item where { ?work wdt:P4510 ?item . ?item wdt:P31/wdt:P279* wd:Q1172284 . }
OWL ontology
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Simple Knowledge Organization System | https://www.w3.org/TR/skos-reference/ | |||||||
The Data Cube vocabulary | https://www.w3.org/TR/vocab-data-cube/ | http://purl.org/linked-data/cube |
Wiktionary language edition
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
German Wiktionary | German | https://de.wiktionary.org/ | ||||||
English Wiktionary | 2002-12-12 | English multiple languages |
Creative Commons Attribution-ShareAlike 3.0 Unported | https://en.wiktionary.org/ |
bibliographic database
edit
biological database
edit
chemical database
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
PubChem | English | PubChem in 2021: new data content and improved web interfaces The Bioregistry Nucleic Acids Research (NAR) database |
free content | http://pubchem.ncbi.nlm.nih.gov | ||||
ChEMBL | The ChEMBL database in 2017 The Bioregistry |
Creative Commons Attribution-ShareAlike 3.0 Unported | https://www.ebi.ac.uk/chembl/ http://www.ebi.ac.uk/chembl |
|||||
GNPS | English | Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking | https://gnps.ucsd.edu/ |
clinical trials registry
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
ClinicalTrials.gov | English | http://www.clinicaltrials.gov | ||||||
International Clinical Trials Registry Platform | 2005 | https://www.who.int/ictrp |
data set
edit
database
edit
digital library
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Wikisource | 2003-11-24 | https://wikisource.org/ | ||||||
Project Gutenberg | 1971-07-04 | multiple languages | Unlicense | https://gutenberg.org | ||||
PubMed Central | en:PMC | English | Open Science Thesaurus The varying openness of digital open science tools |
http://www.ncbi.nlm.nih.gov/pmc/ https://www.ncbi.nlm.nih.gov/pmc/ |
||||
Europeana | 2008-11-20 | Dictionary of Common Goods | https://www.europeana.eu | |||||
HathiTrust | 2008 | Free Software Directory | https://www.hathitrust.org/ | https://tapor.ca/tools/1461 https://marketplace.sshopencloud.eu/tool-or-service/VUsxa0 |
free and open-source software
edit
free software
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
World Atlas of Language Structures | en:WALS | 2008 | Creative Commons Attribution 4.0 International | http://wals.info | ||||
Wikibase | GNU General Public License, version 2.0 or later | https://wikiba.se/ |
graph database
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Blazegraph | awesome RDF github page | GNU General Public License, version 2.0 proprietary license |
https://blazegraph.com/ | |||||
Stardog | awesome RDF github page OntoCommons Report D4.3 |
proprietary license | https://www.stardog.com |
image database
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
CBCL Face Database | http://cbcl.mit.edu/software-datasets/FaceData2.html | http://www.ai.mit.edu/courses/6.899/lectures/faces.tar.gz | ||||||
imSitu | Situation Recognition: Visual Semantic Role Labeling for Image Understanding | http://imsitu.org/ | https://s3.amazonaws.com/my89-frame-annotation/public/of500_images.tar |
image dataset
edit
knowledge base
edit
knowledge graph
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Artificial Intelligence Knowledge Graph | AI-KG: An Automatically Generated Knowledge Graph of Artificial Intelligence | |||||||
CaLiGraph | http://caligraph.org/ |
knowledge graph of science
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Microsoft Academic Graph | 2015-06-05 | English | https://www.microsoft.com/en-us/research/project/microsoft-academic-graph/ | |||||
Open Research Knowledge Graph | en:ORKG | http://orkg.org/ |
lexical database
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
WordNet | 1998 | English | WordNet: An Electronic Lexical Database WordNet: a lexical database for English |
BSD licenses | https://wordnet.princeton.edu/ | |||
FrameNet | 1997 | English | FrameNet: Theory and Practice | https://framenet.icsi.berkeley.edu/fndrupal/ | https://framenet.icsi.berkeley.edu/fndrupal/WhatIsFrameNet | |||
VerbNet | English | https://verbs.colorado.edu/verbnet/ | ||||||
NorthEuraLex | Creative Commons Attribution-ShareAlike 4.0 International | http://northeuralex.org/ |
online database
edit
ontology
edit
open-access repository
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
CiteSeer | http://citeseer.ist.psu.edu | |||||||
Figshare | 2011-01-12 | Free Software Directory Open Science Thesaurus The varying openness of digital open science tools Directory of Open Access Preprint Repositories |
https://figshare.com/ | https://tapor.ca/tools/1045 https://marketplace.sshopencloud.eu/tool-or-service/mdEbYT |
open-source software
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Virtuoso Universal Server | awesome RDF github page OntoCommons Report D4.3 |
GNU General Public License, version 2.0 proprietary license |
https://virtuoso.openlinksw.com/ | |||||
Apache Jena Fuseki | Apache Software License 2.0 | https://jena.apache.org/documentation/fuseki2/index.html |
organization
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
World Register of Marine Species | en:WoRMS | 2008 | English | Creative Commons Attribution 4.0 International | https://www.marinespecies.org | |||
Orphanet | 1997 | English French Spanish German Italian Portuguese Dutch Polish |
Representation of rare diseases in health information systems: the Orphanet approach to serve a wide range of end users The Bioregistry |
Creative Commons Attribution-NoDerivs 3.0 Unported | https://orpha.net |
question-answering dataset
edit
semantic network
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
GermaNet | German | http://www.sfs.uni-tuebingen.de/GermaNet/ | ||||||
ConceptNet | https://www.conceptnet.io/ |
software
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
BabelNet | multiple languages | https://babelnet.org/ | https://marketplace.sshopencloud.eu/tool-or-service/CEgdSF | |||||
BridgeDb | Providing gene-to-variant and variant-to-gene database identifier mappings to use with BridgeDb mapping services The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services |
https://www.bridgedb.org/ https://bridgedb.github.io/ |
text corpus
edit
trait database
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
AmphiBIO | ||||||||
TRY | 2007 | TRY - a global database of plant traits | http://www.try-db.org/ |
treebank
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Penn Treebank | English | https://catalog.ldc.upenn.edu/ldc99t42 | ||||||
Hamburg Dependency Treebank | German |
video streaming service
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
YouTube | en:YT | 2005-02-14 | multiple languages | Lentapedia Free Software Directory |
end-user license agreement | https://www.youtube.com/ | ||
PlayStation Now | 2014 | https://www.playstation.com/ps-now |
voice dataset
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Common Voice | 2017-06-19 | multiple languages | Common Voice: A Massively-Multilingual Speech Corpus | Creative Commons CC0 License | https://commonvoice.mozilla.org/ | |||
LibriSpeech | Librispeech: An ASR corpus based on public domain audio books | Creative Commons Attribution 4.0 International | ||||||
VoxPopuli | VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation | https://github.com/facebookresearch/voxpopuli | ||||||
VoxLingua107 | VoxLingua107: a Dataset for Spoken Language Recognition | http://bark.phon.ioc.ee/voxlingua107/ |
website
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
PubMed | en:PM | 1997 | English | Nucleic Acids Research (NAR) database | https://pubmed.ncbi.nlm.nih.gov/ https://pmlegacy.ncbi.nlm.nih.gov |
|||
Science Daily | 1995 | https://www.sciencedaily.com | ||||||
LibraryThing | 2005-08-29 | LibraryThing: A Review | https://librarything.com/ | |||||
DNA Data Bank of Japan | Creative Commons Attribution 2.1 Japan | http://www.ddbj.nig.ac.jp/ | ||||||
Media Cloud | 2009 | https://mediacloud.org | ||||||
Semantic Scholar | Semantic Scholar Free Software Directory |
https://www.semanticscholar.org | ||||||
Nextstrain | https://nextstrain.org/ | |||||||
Dimensions | 2018-01-15 2014 |
English | Free Software Directory The varying openness of digital open science tools |
https://app.dimensions.ai/discover/publication https://www.dimensions.ai |
word analogy dataset
edit
word net
editartikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
plWordNet | 2005 | Polish | BSD licenses | http://plwordnet.pwr.wroc.pl | ||||
DanNet | 2009 | Danish | DanNet: the challenge of compiling a wordnet for Danish by reusing a monolingual dictionary | MIT License Creative Commons Attribution 4.0 International |
http://www.wordnet.dk/ https://cst.ku.dk/projekter/dannet/ |
http://www.wordnet.dk/owl/instance/ | ||
Arabic WordNet | 2006 | Arabic | The Use of Arabic WordNet in Arabic Information Retrieval | |||||
Chinese WordNet | da:CWN | Constructing chinese wordnet: Design principles and implementation | ||||||
MultiWordnet of Portuguese | en:MWN.PT | Portuguese | ||||||
KeNet | Turkish | Constructing a WordNet for Turkish Using Manual and Automatic Annotation | http://haydut.isikun.edu.tr/kenet.html | |||||
odenet | de:odenet | German | https://ikum.mediencampus.h-da.de/projekt/open-de-wordnet-initiative/ |
word similarity dataset
edit
Misc
editEnd of automatically generated list.