Wikidata talk:Lexicographical data/Archive/2019/04

This page is an archive. Please do not modify it. Use the current page, even to continue an old discussion.

Variant spellings

There is a discussion at Wikidata:Project chat#Variant spellings regarding the possibility of adding alternate labels for lemmas/forms/(senses?) with the exact same language code as other labels (that is, without any specific regional or other distinction marker á la "en-x-Q1068863" or something similar, and applying to the general case of a particular language). Mahir256 (talk) 19:00, 4 April 2019 (UTC)

derived from (P5191)

Can anybody build queries with this property that would be a workable alternative for Wikidata:Property_proposal/periphrasis ?

@EncycloPetey, Vive la Rosière: fyi: participants of earlier discussion. --- Jura 13:45, 10 March 2019 (UTC)

@Jura1: what do you want exactly? I dont see the link between derived from lexeme (P5191) and periphrasis... (but I must admit I don't really understand these proposals either).
Anyway, here a request of all lexemes using derived from lexeme (P5191) (with the corresponding lemmata) :
SELECT * WHERE {
  ?l a ontolex:LexicalEntry ; wikibase:lemma ?lemma ; dct:language ?language; wdt:P5191 ?derivedfrom .
  ?derivedfrom wikibase:lemma ?derivedfromlemma ; dct:language ?language.
}
Try it!
If you want something more specific, just tell me what you want (I feel this is a subset of this general query but I don't get what parameter are wished here, I already restricted to lexemes inside one and same language, what else is there to fit you idea?).
Cdlt, VIGNERON (talk) 15:15, 11 March 2019 (UTC)
I don't either, but if the opposing argument is valid, it should be possible. Can you say so in the property creation discussion? --- Jura 08:35, 16 March 2019 (UTC)
I don't understand this proposal (nor the example - but from what I get, this is what this query do, no? - and search engine give no meaningful result for "periphrastic definition") so no, I can't make a comment on something I don't understand (except saying that « I don't understand » which is not really constructive). Cheers, VIGNERON (talk) 15:00, 17 March 2019 (UTC)
@Jura1: I'm not very skilled with SPARQL queries, but given the examples given for the periphrasis property proposal, say for employer (L5512), we can state that it combines lexemes (P5238) employ (L5510) and ‎-er (L29845), and then qualify both with the correct senses using object sense (P5980). This basically accomplishes what the periphrasis property was trying to achieve. Liamjamesperritt (talk) 07:18, 23 April 2019 (UTC)

How to avoid duplicate lexeme?

I am from Tamil wiktionary and getting enthusiasm in this project. Today i created this lexeme first and then linked with the audio file. Then i tested to create another lexeme with the same word. I am surprised. It should not happen for the same word. How can i create lexemes without duplicates? --Info-farmer (talk) 14:31, 23 April 2019 (UTC)

@Info-farmer: Actually it is entirely acceptable for there to be multiple lexemes with the same lemma. Consider the two Russian words cognate with the English 'Venus' (Lexeme:L34836 and Lexeme:L34837): the first of these, referring to the planet, has a different set of inflections than the second of these, referring to the Roman goddess. Mahir256 (talk) 14:47, 23 April 2019 (UTC)
To check quickly what Lexemes with a specific lemma are existing in Wikidata, you can type the prefix L: in the Wikidata search box. For example L:taxi. Lea Lacroix (WMDE) (talk) 15:15, 23 April 2019 (UTC)
@Info-farmer: For English I've found the Wikidata:Wikidata Lexeme Forms tool very useful, it checks for duplicates before adding anything. You would have to work with Lucas to set up template for Tamil. ArthurPSmith (talk) 15:30, 23 April 2019 (UTC)
Return to the project page "Lexicographical data/Archive/2019/04".