Wikidata:Lexicographical data/Malayalam/Queries
This page is an adaptation of WD:Lexicographical data/Ideas of queries for the Malayalam language.
An interesting feature facilitated by the structured data of Wikidata is the Wikidata:SPARQL query service. Below are examples of SPARQL queries related to Malayalam lexicographical data.
Interesting queries
edit- Words with senses with images (Property:P18 sampled): result
- Longest word in Malayalam added yet: adjectives, words, nouns
- Shortest word in Malayalam added yet: adjectives, proverbs, nouns, verbs
- Words having specified letters: result with ഴ&ങ. Modify query: query
- Number of words by first letter : result
- All words by lexical category in Malayalam: noun, adjectives, verbs, adverbs, proverb, proper noun, letters
- Get all existing lexeme entries in Malayalam: query result
LinguaLibre audio recording
editYou can use a Wikidata Query URL with Form-ID in External Tools option in LinguaLibre, and then the User:Lingua Libre Bot will automatically add pronunciation audio file in the respective Form-id of lexeme entry. An illustrated guide for Malayalam: Guidepage
- All lexeme forms of Malayalam language missing pronunciation audio: query
- All lemma forms (headwords) missing pronunciation: query
Statistics
edit- Get the count of lexemes in Malayalam belonging to different lexical categories: query result
Maintenance
editFor Malayalam
- All lexemes without any sense: result
- Nouns with senses missing item for this sense (P5137) property: result
- All lexemes without any form : result
- For all verbs and nouns find lexemes with forms with empty grammatical features: result
- Find all lexemes where lemma does not occur in form representations: result
- Find all lexemes with the same lemma, language and lexical category (candidates for merging or other action): result
- Find lexemes with forms that are missing grammatical features: result
- Find all lexemes with the same spelling: result
- Lexemes with described by source (P1343) instead of on the forms and senses the source describe: result
- Forms without pronunciation audio (P443) (affixes excluded): result
- Potential duplicate lexemes with same title spelling, language and lexeme category: result
- Number of forms missing grammatical features (times out): result
- Finding homographic forms of different non-homographic lemmas: result
- Verb lexemes which do not have a simple past tense form: result
- Verb lexemes which do not have a simple past and simple future forms: result
Query building
editYou can modify example queries or build new ones to suit your needs. See Wikidata:SPARQL query service/Query Helper for more information.
Malayalam Q-id: Q36236
Missing lexeme forms
editPlease fill the missing forms in Lexeme entries
- 1. Query for all Malayalam proper nouns with missing accusative case: query
The following query uses these:
- Items: Malayalam (Q36236) , proper noun (Q147276) , accusative case (Q146078)
SELECT DISTINCT ?l ?lemma WHERE { ?l a ontolex:LexicalEntry ; dct:language wd:Q36236 ; wikibase:lexicalCategory wd:Q147276 ; wikibase:lemma ?lemma ; ontolex:lexicalForm ?form . ?form ontolex:representation ?word ; minus { {?l a ontolex:LexicalEntry ; ontolex:lexicalForm/wikibase:grammaticalFeature wd:Q146078 .} }. }
- 2. Malayalam verbs missing a future tense:
The following query uses these:
- Items: Malayalam (Q36236) , verb (Q24905) , simple future (Q1475560)
SELECT DISTINCT ?l ?lemma WHERE { ?l a ontolex:LexicalEntry ; dct:language wd:Q36236 ; wikibase:lexicalCategory wd:Q24905 ; wikibase:lemma ?lemma ; ontolex:lexicalForm ?form . ?form ontolex:representation ?word ; minus { {?l a ontolex:LexicalEntry ; ontolex:lexicalForm/wikibase:grammaticalFeature wd:Q1475560 .} }. }