Wikidata talk:Lexicographical data/Archive/2020/07

This page is an archive. Please do not modify it. Use the current page, even to continue an old discussion.

Addition of a new lexeme and translation

Please check the lexemes Lexeme:L1428 and Lexeme:L7. I would like to know whether the senses mentioned in 'cat' be added manually to 'പൂച്ച' also. I am a newbie to Lexemes. Secondly, when adding translations, why can't I see an existing lexeme? For eg: if I try to add 'cat' in 'പൂച്ച', it shows that 'No match was found. Create a new item...'. Adithyak1997 (talk) 17:15, 11 June 2020 (UTC)

@Adithyak1997: it looks mostly good.
For പൂച്ച (L1428), I would add the singular form and I would shorten the gloss (I already removed the unnecessary precision in parentheses but I would shorten it further). Except for that, it is ok.
For the new lexeme not showing, maybe it was just a delay, does it work now or is there still a problem?
Cdlt, VIGNERON (talk) 08:54, 2 July 2020 (UTC)
@VIGNERON: My actual problem is: whether I need to add English, French, Malay, etc. language senses to Lexeme:L1428 or not? Adithyak1997 (talk) 09:25, 2 July 2020 (UTC)
@Adithyak1997: I guess you mean "glosses" and not "senses". And on that point, there is no strong consensus. You could add them but it's not mandatory. My personal opinion is that this is not useful if there is already a item for this sense (P5137), then you can automatically retrieve the description as a good approximation of the gloss. Cheers, VIGNERON (talk) 11:40, 2 July 2020 (UTC)

Lexical masks now in JSON

We have released lexical masks as ShEx files before, schemata for lexicographic forms that can be used to validate whether the data is complete.

We saw that it was quite challenging to turn these ShEx files into forms for entering the data, such as Lucas Werkmeister’s Lexeme Forms. So we adapted our approach slightly to publish JSON files that keep the structures in an easier to parse and understand format, and to also provide a script that translates these JSON files into ShEx Entity Schemas.

Furthermore, we published more masks for more languages and parts of speech than before.

Full documentation can be found in Wikidata:Lexical Masks.

Background can be found in the paper.

Thanks Bruno, Saran, and Daniel for your great work! --Denny (talk) 20:59, 22 June 2020 (UTC)

Now also for Hebrew and Basque. --Denny (talk) 18:46, 1 July 2020 (UTC)
Return to the project page "Lexicographical data/Archive/2020/07".