Wikidata:WikiProject Informatics/Language Models

HomeAlgorithmsLanguagesStructuresProtocolsSoftwareHardware
Welcome to the language model (Q3621696) section of the WikiProject Informatics

Language Models

This page documents language model (Q3621696).

List of language models

This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!

WDQS | PetScan | TABernacle | Find images | Recent changes | Query: SELECT DISTINCT ?item WHERE { ?item wdt:P31/wdt:P279* wd:Q3621696 . }
number label Inception Website Repository
1 AI Novelist https://ai-novel.com/
2 ALBERT https://github.com/google-research/ALBERT
3 AlexaTM https://github.com/amazon-science/alexa-teacher-models
4 Amazon CodeWhisperer https://aws.amazon.com/codewhisperer/
5 Auto-GPT https://agpt.co[1] https://github.com/Significant-Gravitas/Auto-GPT
6 BART 2020
7 BLOOM 2021 https://bigscience.huggingface.co/
https://huggingface.co/bigscience/bloom
8 Bidirectional Encoder Representations from Transformer 2018 https://arxiv.org/abs/1810.04805[2] https://github.com/google-research/bert
9 BioBERT
10 BloombergGPT
11 CamemBERT 2019[3] https://camembert-model.fr/
12 ChatGPT 2022-11-30[4] https://chat.openai.com/[5]
13 Chinchilla AI
14 Claude https://claude.ai
https://www.anthropic.com/claude
15 Claude
16 Claude 3 Haiku
17 Claude 3 Opus
18 Claude 3 Sonnet
19 DeBERTa https://github.com/microsoft/DeBERTa
20 DeppGPT https://www.der-postillon.com/2023/05/deppgpt.html
21 Devin
22 DistilBERT
23 Dolly https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html[6] https://github.com/databrickslabs/dolly
24 ELECTRA https://github.com/google-research/electra
25 Ferret https://github.com/apple/ml-ferret
26 FinanceGPT 2022-06 https://financegpt.cloud[7]
https://financegpt.uk
https://financegptlabs.com
27 GPT-1 2018-06-11 https://openai.com/blog/language-unsupervised/[8] https://github.com/openai/finetune-transformer-lm
28 GPT-2 2019-02-14 https://openai.com/blog/better-language-models/[9] https://github.com/openai/gpt-2
29 GPT-3 2020-05-28 https://arxiv.org/abs/2005.14165[10] https://github.com/openai/gpt-3
30 GPT-4 2023-03-14 https://openai.com/product/gpt-4
https://openai.com/gpt-4
31 GPT-J https://6b.eleuther.ai/
32 GPT-SW3 https://huggingface.co/AI-Sweden-Models
33 GPT4-Chan https://github.com/yk/gpt-4chan-public
34 Gemini 2023-03-21 https://gemini.google.com/app
35 Gemini 2023-12-06 https://deepmind.google/technologies/gemini/#introduction
36 Gemini 1.5 Pro
37 Gemini Nano
38 Gemini Pro
39 Gemini Ultra
40 Gemma https://ai.google.dev/gemma
41 Gemma 2B 2024-02 https://ai.google.dev/gemma
42 Grok 2023-11-04 https://grok.x.ai/ https://github.com/xai-org/grok-1[11]
43 InstructGPT
44 Jais https://www.arabic-gpt.ai/
45 LLaMA https://llama.meta.com/ https://github.com/facebookresearch/llama
46 LaMDA 2020
47 Llama 1
48 Llama 2
49 MM1
50 Med-PaLM https://sites.research.google/med-palm/
51 Microsoft Copilot https://copilot.microsoft.com
52 NLLB-200 https://ai.facebook.com/research/no-language-left-behind/ https://github.com/facebookresearch/fairseq/tree/nllb
53 NovelAI https://novelai.net/
54 Open Assistant 2023-04-15[12] https://open-assistant.io https://github.com/LAION-AI/Open-Assistant
55 OpenAI Codex
56 PaLM https://ai.google/discover/palm2/
57 PaLM 2
58 PanGu-Σ
59 Pathways Language Model
60 Phi-2 https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/
61 RoBERTa
62 StructBERT
63 T5 2015 https://arxiv.org/abs/1910.10683[13] https://github.com/google-research/text-to-text-transfer-transformer
64 Titan Text Express
65 Vicuña
66 VideoPoet 2023
67 Wu Dao
68 XLM-RoBERTa
69 XLNet https://github.com/zihangdai/xlnet
70 YandexGPT 2023-05-17 https://yandex.ru/project/alice/yagpt
71 code-davinci-002
72 flan-ul2 https://huggingface.co/google/flan-ul2
73 mT5 2021 https://github.com/google-research/multilingual-t5

∑ 73 items.

End of automatically generated list.


Properties

Title ID Data type Description Examples Inverse
instance ofP31Iteminstance of: that class of which this subject is a particular example and member; different from P279 (subclass of); for example: K2 is an instance of mountain; volcano is a subclass of mountain (and an instance of volcanic landform)Bidirectional Encoder Representations from Transformer <instance of> language model-
inceptionP571Point in timedate of establishment: time when an entity begins to exist; for date of official opening use P1619Bidirectional Encoder Representations from Transformer <inception> 2018-
official websiteP856URLofficial website and home page: URL of the official page of an item (current or former). Usage: If a listed URL no longer points to the official website, do not remove it, but see the "Hijacked or dead websites" section of the Talk pageFacebook <official website> https://www.facebook.com-
named afterP138Itemeponym, memorial society and namesakes: entity or event that inspired the subject's name, or namesake (in at least one language). Qualifier "applies to name" (P5168) can be used to indicate which oneCamembert <named after> Camembert-
software version identifierP348Stringsoftware version and version number: numeric or nominal identifier of a version of a software program or file format, current or pastBugzilla <software version identifier> 4.5.1-
developerP178Itemvideo game developer and software developer: organization or person that developed the itemSuper Mario Bros. <developer> Nintendo Entertainment Analysis & Development-
data sizeP3575Quantityfile size, parameter and data size: size of a software, dataset, neural network, or individual fileSly Cooper and the Thievius Raccoonus <data size> 5.15 gigabyte-
source code repository URLP1324URLrepository and source code: public source code repositoryOpenVPN <source code repository URL> https://gitlab.com/openvpn/openvpn-
copyright licenseP275Itemlicense: license under which this copyrighted work is releasedInkscape <copyright license> GNU General Public License, version 2.0-
described by sourceP1343Itemsource of information: work where this item is describedVladimir K. Zworykin <described by source> Brockhaus Enzyklopädie (19 ed.)-
usesP2283Itemuse: item or concept used by the subject or in the operation (see also instrument [P1303] and armament [P520])painter <uses> paintbrush and paintused by
has useP366Itemuse: main use of the subject (includes current and former usage)book <has use> reading-

Datasets and Benchmarks

List of datasets and benchmarks for QA or NLQ

QA=question answering, NLU=natural language understanding

This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!

WDQS | PetScan | TABernacle | Find images | Recent changes | Query: SELECT ?item { VALUES ?type {wd:Q1172284 wd:Q816747} VALUES ?use {wd:Q1078276 wd:Q1074173} ?item wdt:P31 ?type; wdt:P366 ?use. }
number label Website Repository
1 General Language Understanding Evaluation benchmark https://gluebenchmark.com/
2 Situations With Adversarial Generations https://rowanzellers.com/swag/ https://github.com/rowanz/swagaf
3 Stanford Question Answering Dataset https://rajpurkar.github.io/SQuAD-explorer/

∑ 3 items.

End of automatically generated list.

References