Wikidata:Lexicographical data/Documentation/Languages/en
English
Subclass of | Anglic |
---|---|
Native label | English |
IPA transcription | ˈeɪŋɡlɪʃ, ˈɪŋɡlɪʃ |
Named after | England |
Indigenous to | England |
Influenced by | French, Vulgar Latin, Germanic, Greek, Old Danish |
Linguistic typology | subject–verb–object, nominative–accusative language, stress-timed language, place–manner–time, fusional language |
Has grammatical case | genitive case, nominative case, oblique case |
Has grammatical mood | indicative, subjunctive, interrogative |
Has grammatical gender | masculine, feminine, neuter |
Writing system | Latin script, English orthography |
Language regulatory body | no value |
Signed form | manually coded English |
UNESCO language status | 1 safe |
Ethnologue language status | 1 National |
Studied in | English studies |
Described at URL | https://afbo.info/languages/64 |
History of topic | history of English |
Related category | Category:English pronunciation |
Entry in abbreviations table | англ. |
PyPI trove classifier | Natural Language :: English |
Stack Exchange site URL | https://english.stackexchange.com |
Stack Exchange tag | https://linguistics.stackexchange.com/tags/english |
Wikimedia language code | en |
Opposite of | non-English |
Language code edit
- en is the main code used
- en-gb for British spellings and en-us for American spellings may be used where they differ
- en-ca not generally used since they follow either en-gb or en
- en-in is only available for monolingual text, so cannot be used for lexemes
- en-au, en-nz not used because not supported (yet)
- Additional variations may be added by using en-x-QID, replacing the QID with that of a particular variety or convention. Due to the breadth of English varieties, these will likely be necessary should one have reason to document a South African English or Philippine English representation, for example.
Lexical categories edit
Categories for individual words edit
- noun (Q1084)
- verb (Q24905)
- adjective (Q34698)
- adverb (Q380057)
- proper noun (Q147276)
- interjection (Q83034)
- conjunction (Q36484)
- preposition (Q4833830)
- pronoun (Q36224)
- grammatical particle (Q184943)
- numeral (Q63116)
- determiner (Q576271)
Categories for word parts edit
Categories for groups of words (in addition to individual-word categories) edit
Lemma edit
For verbs, use the infinitive form, not including 'to', i.e. 'run', not 'to run'.
Lemmas should always be lower case except for proper nouns, proper adjectives, demonyms, or adjectives formed from proper nouns.
Statements edit
Identifiers edit
These external identifier properties are currently available for use on English lexemes, and may be helpful as references for data on a lexeme entity. Currently, most English lexemes lack any identifier statements, so it is a good idea to add any of these where applicable.
- Merriam-Webster online dictionary entry (P11130) - Comprehensive monolingual English dictionary with part of speech indications and glosses. ID values are typically identical to the lemma. Specific entries on shared pages can be linked by suffixing #dictionary-entry-1 substituting the number for the corresponding section.
- Collins Online English Dictionary entry (P11230) - Monolingual English dictionary. ID values are typically lowercase forms of the lemma, with spaces and punctuation replaced with hyphens.
- Dictionary.com entry (P11228) - Monolingual English dictionary. ID values are typically lowercase forms of the lemma, with spaces and punctuation replaced with hyphens. In some cases additional hyphens are present in order to distinguish entries.
- Oxford English Dictionary entry ID (pre-July 2023) (P5275) - Monolingual English dictionary. Partially paywalled. ID values are typically numerical strings.
- Green's Dictionary of Slang ID (P11481) - Monolingual English slang dictionary. ID values are alphanumeric strings.
- Sri Granth word ID (P7575) - Bilingual English-Punjabi and Punjabi-English dictionary. Includes part of speech indications and glosses in Punjabi Gurmukhi for English lexemes. ID values are typically identical to the lemma.
- Oqaasileriffik online dictionary ID (P5912) - Trilingual dictionary of Greenlandic, Danish, and English. ID values are numeric.
- DiACL lexeme ID (P11055) - Cross-linguistic database which may be helpful in tracing the etymologies of English lexemes. Limited to some of the most frequently used lexemes. Seems to be in the early stages of development; entries may not be particularly detailed. ID values are typically numerical strings. Offline as of 2023-12-01.
Forms edit
Noun grammatical features edit
- Either singular (Q110786) or plural (Q146786)
- Possessive ('s) forms for nouns should also use English possessive (Q1861696)
Verb grammatical features edit
- One of simple present (Q3910936), simple past (Q1392475), present participle (Q10345583) or past participle in English (Q1230649)
- For inflections that differ by verb subject (generally only for 3rd person singular) include items to indicate number and person
Adjective grammatical features edit
- One of positive (Q3482678), comparative (Q14169499) or superlative (Q1817208) for comparable adjectives; just use positive (Q3482678) for others.
Senses edit
Example lexeme entries edit
Some example lexeme entries of English language are given below. These may be used as models to improve other entries.
- book (L536) (noun)
- book (L16168) (verb)
- like (L45034) (preposition)
- under (L3438) (preposition)