Wikidata:Lexicographical data/Documentation/Languages/te
This page is a documentation page for Telugu (Q8097) language lexemes in WikiProject Lexicographical data, for assisting contributions to Telugu lexeme content. Telugu is a Dravidian language spoken by more than 80 million people predominantly of the Indian states of Andhra Pradesh and Telangana.
Wikidata:Lexicographical data project aims to provide a CC0 licensed structured lexicographical data for everyone to use for different purposes, including for Wiktionary, the upcoming Abstract Wikipedia and external projects. It aims to store all words of all languages in a structured model under a CC0 (public domain dedication) license.
Layout
editEvery lexeme entry has the following layout:
- Lemma (dictionary form) of the lexeme as title or headword, along with lexeme ID. Language code and lexical category is also given.
- Senses - different meanings of the same word
- Statements for senses. This usually includes image, item for this sense, translation, synonym, antonym, usage example, and more (see list). Note that for translation, antonym, & synonym properties, the lexeme "sense ID" (
LXXXXX-S1
) of the target lexeme has to be copy pasted, not the lexeme ID.
- Statements for senses. This usually includes image, item for this sense, translation, synonym, antonym, usage example, and more (see list). Note that for translation, antonym, & synonym properties, the lexeme "sense ID" (
- Forms - different forms and cases of the lexeme
- Senses - different meanings of the same word
Structure and properties
editCommon properties to be added for lexeme entries are given below:
Lemma
editLemma is the dictionary form (base form) of the word/lexeme. Lemma is to be written in Telugu script.
Statements
edit- grammatical gender (P5185): (masculine (Q499327) / feminine (Q1775415))
- derived from lexeme (P5191)
- usage example (P5831)
- homograph lexeme (P5402)
- combines lexemes (P5238)
Senses
editForms
edit- Grammatical features
- Grammatical gender: masculine (Q499327) / feminine (Q1775415)
- Grammatical number: singular (Q110786) / plural (Q146786)
- pronunciation audio (P443)
- IPA transcription (P898)
- See Telugu grammar (Q7697890) for noun-cases, verb-tenses and other inflections.
From enwiki:
Cases (vibhakti) | Telugu | Usual Suffixes | Transliteration of Suffixes |
---|---|---|---|
nominative case (Q131105) | ప్రథమా విభక్తి (Prathamā Vibhakti) | డు, ము, వు, లు | ḍu, mu, vu, lu |
accusative case (Q146078) | ద్వితీయా విభక్తి (Dvitīyā Vibhakti) | నిన్, నున్, లన్, కూర్చి, గురించి | nin, nun, lan, kūrchi, gurinchi |
instrumental case (Q192997) | తృతీయా విభక్తి (Trutīyā Vibhakti) | చేతన్, చేన్, తోడన్, తోన్ | chētan, chēn, tōḍan, tōn |
dative case (Q145599) | చతుర్థి విభక్తి (Chaturthi Vibhakti) | కొఱకున్, కై | korakun, kai |
ablative case (Q156986) | పంచమీ విభక్తి (Panchamī Vibhakti) | వలనన్, కంటెన్, పట్టి | valanan, kaṇṭen, paṭṭi |
genitive case (Q146233) | షష్ఠీ విభక్తి (Shashthī Vibhakti) | కిన్, కున్, యొక్క, లోన్, లోపలన్ | kin, kun, yokka, lōn, lōpalan |
locative case (Q202142) | సప్తమీ విభక్తి (Saptamī Vibhakti) | అందున్, ఇందున్, నన్ | andun, indun, nan |
vocative case (Q185077) | సంబోధనా ప్రథమా విభక్తి (Sambodhanā Prathamā Vibhakti) | ఓ, ఓయీ, ఓరీ, ఓసీ | ō, ōī, ōrī, ōsī |
Maintenance
edit- Recent Changes to Telugu Lexemes
- Search lexemes:
To do
edit- Complete WD:Wikidata Lexeme Forms/Telugu
- Improve this page
- Add all Telugu words to Lexemes: namespace
Queries
edit- Main page: WD:Lexicographical data/Ideas of queries
- Telugu Q-id:
Q8097
Example queries:
1) Get all existing lexemes in Telugu: query result
The following query uses these:
- Items: Telugu (Q8097)
SELECT ?lexeme ?lemma WHERE { ?lexeme dct:language wd:Q8097; wikibase:lemma ?lemma. }
(press play button on the left side of query service page to execute a query to get its results. Keyboard shortcut ctrl+↵ Enter)
2) Telugu lexemes without senses: https://w.wiki/6gSM
3) Count of lexemes in Telugu belonging to different lexical categories: https://w.wiki/3$vk (query)
4) Query for all Telugu nouns missing a locative case: query
The following query uses these:
- Items: Telugu (Q8097) , noun (Q1084) , locative case (Q202142)
SSELECT DISTINCT ?l ?lemma WHERE { ?l a ontolex:LexicalEntry ; dct:language wd:Q8097; wikibase:lexicalCategory wd:Q1084; wikibase:lemma ?lemma ; ontolex:lexicalForm ?form . ?form ontolex:representation ?word ; minus { {?l a ontolex:LexicalEntry ; ontolex:lexicalForm/wikibase:grammaticalFeature wd:Q202142.} }. }
Resources
edit- Category:Telugu lemmas (Q31161018) (Note that Wiktionaries are cc-by-sa licensed while WD:Lexemes is cc0 licensed.)
- Telugu Wiktionary
Tools
edit