Wikidata:Lexicographical data/Documentation/Languages/tr
Subclass of | Western Oghuz |
---|---|
Native label | Türkçe |
Country | Bulgaria, Cyprus, Greece, Turkey, Northern Cyprus |
Based on | Old Anatolian Turkish |
Has tense | present tense, past tense, future tense |
Has grammatical mood | admirative |
Writing system | Turkish alphabet |
Uses capitalization for | proper noun |
Language regulatory body | Turkish Language Association |
UNESCO language status | 1 safe |
Ethnologue language status | 1 National |
Studied in | Turkology |
Described at URL | https://turkic.elegantlexicon.com/lx.php?lx=tur, https://afbo.info/languages/81 |
Related category | Category:Turkish pronunciation |
Entry in abbreviations table | тур., ਤੁ. |
Stack Exchange tag | https://stackoverflow.com/tags/turkish, https://linguistics.stackexchange.com/tags/turkish |
Wikimedia language code | tr |
Turkish (Q256) is the national language of Turkey (Q43) and Northern Cyprus (Q23681).
Turkish (Q256) is a member of the Oghuz (Q494600) of the Turkic (Q34090). Other members include Azerbaijani (Q9292), spoken in Azerbaijan (Q227) and north-west Iran (Q794), Gagauz (Q33457) of Gagauzia (Q164819), Qashqai (Q13192) of south Iran (Q794) and the Turkmen (Q9267) of Turkmenistan (Q874).
Lexical category
editExamples of the most common lexical categories applied to different types of lexemes are shown below:
- noun (Q1084): قادین/kadın (L302751), قاب/kap (L1210047), دولاب/dolap (L1209996)
- verb (Q24905): بولمق/bulmak (L1124172), قوللانمق/kullanmak (L30604), طاڭریلاشطرمق/tanrılaştırmak (L1219795)
- adjective (Q34698): بویوك/büyük (L1171763), فنانسال/finansal (L1217729), اوڭمسز/önemsiz (L1215396)
- adverb (Q380057): چارچابوق/çarçabuk (L1124521), محقق/muhakkak (L1217177), دها/daha (L1159028)
- proper noun (Q147276): غلطه/Galata (L1144401), الله/Allah (L1012756), اورفه/Urfa (L1144173)
Examples of less common lexical categories are shown below:
Lemma
editThe lemma for each lexeme should be the written representation used for it in a dictionary, both in the Latin script (tr) and in the Ottoman script (ota).
- For verbs, this should be the form ending with the infinitive suffix -mak/-mek: اولمق/olmak (L750029), ویرمك/vermek (L1151392)
- For most other parts of speech, this should be the base form of the word: كتاب/kitap (L50461), آتا/ata (L1122673), سرای/saray (L50463)
The spelling of the Ottoman script lemma should be consistent with the spelling used in Turkish sources, such as the İmlâ Lûgati and the Osmanlıca-Türkçe Ansiklopedik Lûgat. For words introduced after the institution of Latin script in 1928, some degree of consistency in terms of how sounds are represented between lexemes should still be ensured. As an example, اوزگورلك/özgürlük (L1132299) was introduced after 1928, but is composed of elements "öz", "gür", and "lük" each of which has consistent Ottoman script representations, so that the eventual Ottoman script form of this lexeme reflects those representations.
Statements
edit- The components of a Turkish lexeme, if it can be split into components with separate lexemes, can be added with combines lexemes (P5238) statements (with series ordinal (P1545) qualifiers to specify their order in the combined lexeme):
- e.g. دوریمجی/devrimci (L729698) → دوریم/devrim (L729694) (with series ordinal (P1545) "1")
- e.g. دوریمجی/devrimci (L729698) → جی/cı (L728512) (with series ordinal (P1545) "2")
- e.g. آیاق قابی/ayakkabı (L1122661) → آیاق/ayak (L1202409), قاب/kap (L1210047), and سی/sı (L1010944) (with series ordinal (P1545) values "1", "2", and "3" respectively)
- Since a large number of Turkish words are derived ultimately from other languages, a derived from lexeme (P5191) statement pointing to a lexeme in another language may be added.
- e.g. كتاب/kitap (L50461) → كِتاب (L2233) (Arabic)
- e.g. هفتە/hafta (L622587) → هفته/ҳафта (L622584) (Persian)
- e.g. آناختار/անախթար/anahtar (L228282) → ανοιχτήρι (L950230) (Modern Greek)
- e.g. آبونە/abone (L1122692) → abonner (L27372) (French)
- e.g. آبلوقە/abluka (L1122691) → blocco (L684558) (Italian)
- e.g. هامبورگر/hamburger (L1036084) → hamburger (L35020) (English)
Identifier properties
editIf a particular lexeme is described in one of the dictionaries below, then a link to its entry in that dictionary may be added with the corresponding properties:
Sourcing properties
edit- If a particular source is in the public domain (typically a dictionary), then senses may have a gloss quote (P8394) statement containing the definition given in that source corresponding to that sense.
- e.g. یارین/yarın (L1011036-S1), with the meaning 'tomorrow', is defined in A Turkish and English Lexicon (Q115959290) as 'To-morrow'.
- e.g. آباژور/abajur (L1122676-S1), with the meaning 'lampshade', is defined in Illustrated Ottoman Dictionary (Q123035545) as 'لمبه سپرى'.
- e.g. روسیە/Rusya (L1216821-S1), with the meaning 'Russia', is defined in Q123476879 as 'شرقی اور و پادە بر دولت پایتختی پتر سبورق'.
- If a particular source is not in the public domain, then a described by source (P1343) statement pointing to an item for that source can be added, with qualifiers to better specify where in that source it is found.
- e.g. آبدست آلمق/abdest almak (L1218476-S1) → Ötüken Türkçe Sözlük (Q6069467) (with qualifier page(s) (P304) "80")
- e.g. سەقتەر/sektör (L1217722) → Q123556294 (with qualifier page(s) (P304) "401")
- e.g. جنرال/general (L1211955) → Vowel and Consonant Disharmony in Turkish (Q123195485) (with qualifiers section, verse, paragraph, or clause (P958) "3.1 (12) b." and page(s) (P304) "222")
- An example of a lexeme being used in a sentence may be added using usage example (P5831). They are particularly useful when none of the identifier properties can be added to the lexeme, and when the lexeme cannot be sourced in an existing dictionary, but when there is nevertheless evidence of a word's use available. This example should always be sourced to an external text!
- e.g. تلفون قلیفی/telefon kılıfı (L1122538) provides as a usage example an article about Serenay Sarıkaya (Q6638863) using that phrase.
- e.g. طاڭریلاشطریلمق/tanrılaştırılmak (L1122550) provides as a usage example an opinion piece published in Yeniçağ (Q3572167).
- e.g. عقرب/akrep (L485516) provides as a usage example a line from Rumi's Mesnevi.
- (https://luthor.toolforge.org may be useful for adding examples found in old public-domain texts.)
Forms
editBecause Turkish is a highly regular agglutinating language, it is generally enough for a single form to be present on most lexemes (that is, without any inflectional suffixes whatsoever):
- For nouns, adjectives, and adverbs, this form should be the word itself: گڭل/genel (L1217570-F1), دها/daha (L1159028-F1), سرای/saray (L50463-F1)
- For verbs, this form should be the base of the word without the infinitive suffix -mak/-mek: اول/ol (L750029-F2), ویر/ver (L1151392-F1)
A relatively small number of commonly used adjectives have pekiştirme (“intensive”) forms with reduplicated prefixes which are not entirely predictable. These forms can be marked with the feature intensive (Q6029894) and the non-intensive forms can be distinguished as positive (Q3482678). Some adjectives have more than one pekiştirme form, as seen on یشیل/yeşil (L1122532) for example. A list of pekiştirme adjective forms can be found in On Intensities Done with m, p, r, s Consonants and Their Rules (Q123709158).
Where a derived lexeme requires a particular inflectional suffix, this can be added as a separate 'combines' statement using the appropriate affix lexeme as the value (see "Common affixes" below).
There are some cases for which additional forms should be used to record word stem alternations occuring before suffixes. These can be distinguished with the features surface representation (Q124100534), the form the word stem takes without suffixes, and lexical representation (Q124100536), the form the word stem takes with suffixes beginning with a vowel. The conditions which determine the form used can be more specific than that, and can be specified using appears before phonological feature (P11950) statements on the forms. The examples here are illustrative:
- A noun whose final consonant(s) 'p', 't', 'nk', 'ç' become respectively 'b', 'd', 'ng', 'c' when followed by a vowel (often, but not always, indicating that the word from which the noun is derived ends with a voiced consonant) should have a separate form reflecting this change: كتاب/kitap (L50461-F1) vs. كتاب/kitab (L50461-F2).
- Given that the 'Vk' → 'Vğ' change (for some vowel V) is regular in Turkish[1], separate forms are not needed in these cases. Exceptions occur where a final k is palatal before a vowel-initial suffix, in which case an additional form in which the preceding vowel is marked with a circumflex accent can be used.
- A noun whose final consonant is doubled when followed by a vowel (usually indicating that the word from which the noun is derived ends with a lengthened consonant) should have a separate form reflecting this change: صیرت/sırt (L1216881-F1) vs. صیرت/sırtt (L1216881-F2)
- A noun whose final high vowel disappears when the noun is followed by a vowel (usually indicating that this final high vowel was introduced to break up a final consonant cluster in the word from which the noun is derived) should have a separate form reflecting this change: بورون/burun (L1217101-F1) vs. بورن/burn (L1217101-F2)
Senses
editGlosses
edit- In general, the Turkish gloss can be as detailed as necessary, but the gloss in languages other than Turkish should be more simplified (with just enough extra information to disambiguate):
- e.g. for كلمە/kelime (L1072661-S1), it is enough to provide the English gloss as 'word', but also to provide more detail in Turkish to explain what a 'kelime' is.
- e.g. for بالق/balık (L1217117-S1), it is enough to provide the English gloss as 'fish', without needing to explain what a fish is in English.
- e.g. for فیل/fil (L1122742-S2), however, since 'bishop' itself has multiple possible meanings in English, the extra information '(chess piece)' is needed in the English gloss for disambiguation.
Statements
edit- If a sense on a noun (or sometimes an adjective) corresponds to the concept represented by a Wikidata item, a item for this sense (P5137) statement may be added to the sense:
- If a sense on a verb represents an action/occurrence/event represented by a Wikidata item, a predicate for (P9970) statement may be added to the sense:
- Sometimes a sense in Turkish may not correspond to any particular Wikidata item, but there may be another sense in another language that does correspond to one in Turkish. For this translation (P5972) can be used:
- If two senses in Turkish correspond with each other and no Wikidata item may be linked to them using the properties mentioned above, then synonym (P5973) can be used to connect each other:
- If two senses are opposites of each other, antonym (P5974) could be added to them:
- If one sense is a more specific version of another sense, hyperonym (P6593) could be added to the first sense pointing to the other:
- If a sense is used in a specific place or region, location of sense usage (P6084) could be added:
- If a particular sense describes a person/animal of a particular gender, semantic gender (P10339) could be added:
- e.g. لاچین/laçın (L1159045-S1) → female organism (Q43445) (the sense 'peregrine falcon' on the lexeme for 'doğan', however, would not have a statement like this)
- If the use of a particular sense is restricted to certain styles of speaking or writing, language style (P6191) may be used to specify those:
Common affixes
editThe following is a list of affixes that may frequently be needed when annotating multi-part lexemes:
Nominal affixes
edit- یی/եը/yı (L1011033) (definite (Q53997851) + accusative case (Q146078))
- ە/ա/a (L1011040) (dative case (Q145599))
- دە/տա/da (L1149451) (locative case (Q202142))
- دن/տան/dan (L1149452) (ablative case (Q156986))
- نڭ/նընկ/nın (L1011017) (genitive case (Q146233))
- لر/լար/lar (L941749) (plural (Q146786))
- ایلە/ile (L1133345) (instrumental case (Q192997))
- م/ım (L1217034) (first person singular possessive (Q56703567))
- ڭ/ın (L1217035) (second person singular possessive (Q56703570))
- سی/sı (L1010944) (third person singular possessive (Q56703572))
- مز/mız (L1217036) (first person plural possessive (Q56703579))
- ڭز/nız (L1217037) (second person plural possessive (Q56703580))
- لری/ları (L1217038) (third person plural possessive (Q56703584))
Verbal affixes
edit- مە/ma (L30601) (negation (Q1478451))
- ار/ar (L1151394) (geniş zaman (Q6093075) ~ aorist (Q216497))
- دی/di (L941731) (perfective)
- سه/sa (L1158886) (conditional)
- ە/a (L1232561) (optative)
- مش/mış (L941739) (perfective/evidential)
- در/dır (L1235210) (inferential copula)
situation | person | suffix to use | example with باقمق/bakmak (L1037217) |
example with ویرمك/vermek (L1151392) |
example with بولمق/bulmak (L1124172) |
example with گورمك/görmek (L1151536) |
---|---|---|---|---|---|---|
after دی/di (L941731), سه/sa (L1158886) |
بن/ben (L294421) | م/m (L1232527) | baktım baksam |
verdim versem |
buldum bulsam |
gördüm görsem |
سن/sen (L294422) | ڭ/n (L1232525) | baktın baksan |
verdin versen |
buldun bulsan |
gördün görsen | |
او/o (L294423) | (none) | baktı baksa |
verdi verse |
buldu bulsa |
gördü görse | |
بز/biz (L294424) | ك/k (L1232526) | baktık baksak |
verdik versek |
bulduk bulsak |
gördük görsek | |
سز/siz (L294425) | ڭز/nız (L1151717) | baktınız baksanız |
verdiniz verseniz |
buldunız bulsanız |
gördüniz görseniz | |
onlar | (none) | baktı baksa |
verdi verse |
buldu bulsa |
gördü görse | |
after other suffixes |
بن/ben (L294421) | م/ım (L1151562) | bakmışım bakar mıyım? |
vermişim verir miyim? |
bulmuşum bulur muyum? |
görmüşüm görür müyüm? |
سن/sen (L294422) | سین/sın (L1232522) | bakmışsın bakar mısın? |
vermişsin verir misin? |
bulmuşsun bulur musun? |
görmüşsün görür müsün? | |
او/o (L294423) | (none) | bakmış bakar mı? |
vermiş verir mi? |
bulmuş bulur mu? |
görmüş görür mü? | |
بز/biz (L294424) | ز/ız (L1232521) | bakmışız bakar mıyız? |
vermişiz verir miyiz? |
bulmuşuz bulur muyuz? |
görmüşüz görür müyüz? | |
سز/siz (L294425) | سڭز/sınız (L1232523) | bakmışsınız bakar mısınız? |
vermişsiniz verir misiniz? |
bulmuşsunuz bulur musunuz? |
görmüşsünüz görür müsünüz? | |
onlar | (none) | bakmış bakar mı? |
vermiş verir mi? |
bulmuş bulur mu? |
görmüş görür mü? | |
with respect to ە/a (L1232561) |
بن/ben (L294421) | یم/yım (L1202947) (after that suffix) |
bakayım | vereyim | bulayım | göreyim |
سن/sen (L294422) | سین/sın (L1232522) (after that suffix) |
bakasın | veresin | bulasın | göresin | |
او/o (L294423) | سین/sın (L1202954) (replacing that suffix) |
baksın | versin | bulsun | görsün | |
بز/biz (L294424) | لم/lım (L1232530) (after that suffix) |
bakalım | verelim | bulalım | görelim | |
سز/siz (L294425) | سڭز/sınız (L1232523) (after that suffix) |
bakasınız | veresiniz | bulasınız | göresiniz | |
onlar | سینلر/sınlar (L1232518) (replacing that suffix) |
baksınlar | versinler | bulsunlar | görsünler | |
in imperative situations |
سن/sen (L294422) | (none) | bak | ver | bul | gör |
او/o (L294423) | سین/sın (L1202954) | baksın | versin | bulsun | görsün | |
سز/siz (L294425) | ن/ın (L1232517) or ڭز/ınız (L1232516) |
bakın bakınız |
verin veriniz |
bulun bulunuz |
görün görünüz | |
onlar | سینلر/sınlar (L1232518) | baksınlar | versinler | bulsunlar | görsünler |
Queries
edit- Lexemes in Turkish without senses (make sure there are never any results returned by this query!!!)
- Lexemes in Turkish without Ottoman lemmata
Resources
editDictionaries (quotable)
editBecause most of these were written prior to the introduction of the Latin script, they are more useful as sources of Ottoman spellings than as sources of word meanings used in the present day.
- Illustrated Ottoman Dictionary (Q123035545)
- A Turkish and English Lexicon (Q115959290)
- قاموس تورکی(6053167)
- İmlâ Lûgati (Q122228534) (spellings only)
A number of entries from public domain Ottoman dictionaries are indexed at the site Lugat Osmanlıca. Not all of these dictionaries have items yet. Those which have been cited on Wikidata are listed below; if you create a new item for an additional source please add it to this list:
- لهجۀ عثمانی(Q123476879)
- یڭى توركجە لغات(Q123556294)
Dictionaries (citable)
editThese dictionaries are most likely not in the public domain, so that quoting from them frequently is not advisable. They may however be cited using described by source (P1343) statements or stated in (P248) in references as appropriate.
Grammars
editMost suffix lexemes have references to Turkish: A Comprehensive Grammar (Q116446388) on them, though if there is a grammar of Turkish written in Turkish that you find more useful, you may also add it as a reference.
Contact
editSee also
editReferences
edit- ↑ Turkish: A Comprehensive Grammar (Q116446388) page 14 (bullet (v))