Wikidata talk:Lexicographical data/Archive/2021/04

This page is an archive. Please do not modify it. Use the current page, even to continue an old discussion.

30 Lexic-o-days starting today

Hello all,

As a reminder, the month dedicated to Lexicographical Data is starting today! On this page, you can find plenty of sessions and discussions taking place in the next few weeks. There will be for example live editing and querying sessions, a presentation of Lingua Libre, a Q&A about Abstract Wikipedia, and plenty of open discussions. There's also a Phabricator board where we will track tasks, for example on improving the documentation.

The opening session will take place today at 15:00 GMT/UTC on Jitsi (the first part will be recorded).

We're looking forward to your participation! Lea Lacroix (WMDE) (talk) 09:48, 15 March 2021 (UTC)

Please remember next time to put an announcement on Wiktionaries as well, someone just mentioned this event on wikt:WT:BP and 14 days of sessions have already passed, many of them unrecorded. Thanks! – Jberkel (talk) 09:22, 1 April 2021 (UTC)

30 Lexic-o-days, next steps and outcomes

Hello all,

Thanks to everyone who participated in the first 3 weeks of 30 Lexic-o-days! We still have a few days ahead of us with plenty of exciting sessions, such as the Climate Lexeme Week, another Abstract Wikipedia Q&A, or a discussion about text corpora.

If you still have ideas of sessions or discussions, feel free to schedule them directly in the calendar! You can pick the day and time that work best for you, and feel free to contact me if you need any help with scheduling, preparing or running your session.

If you have been working on Lexeme-related things during the past few weeks, for example contributing to the content, improving documentation or tools, please add a quick summary on the outcomes page. This will help us a lot with evaluating the success of this event. You will also have the opportunity to present your work during the showcase on April 14th (more info coming soon).

Thanks a lot, Lea Lacroix (WMDE) (talk) 14:50, 1 April 2021 (UTC)

Involve Little Languages

Hello,

I think about how it is possible to involve little languages and the people who speak this languages in creating Lexemes in their language. As far as I understand Lexemes are a thing what is needed for abstract Wikipedia and at the moment there are little languages without lexemes or with a little amount of them. What do you think how can be people introduced into Lexemes in Wikidata. --Hogü-456 (talk) 20:59, 12 April 2021 (UTC)

@Hogü-456: I am editing lexemes for Slovak, which has 5M native speakers. IMO, editing lexemes directly in Wikidata is unproductive. Solid tools, likely utilizing machine learning, are needed for productive lexeme editing and later maintenance. Some tools exist, but the overall tooling is not there yet. I am making fairly fast progress only because I have developed one such tool myself. I took shortcuts by specializing it for Slovak language, so it's not yet useful for other languages. Another possible route is Wiktionary imports, but that also depends on tooling, mostly Wiktionary parsers. — Robert Važan (talk) 23:21, 13 April 2021 (UTC)
@Robert Važan: thank you for the answer. With Wikidata:Text corpora to lexicographical data there is a page what was created during the 30-lexico-days and there was a meeting about how words can be extracted out of texts. If you are interested you can participate at this project. Maybe there are possibilites to reuse some of the things you created so that it can help smaller languages. For the EU the laws are translated into the major languages of the member states. So especially for them it should be possible to create the lexemes. For other languages I dont know what for sources exist. --Hogü-456 (talk) 14:51, 14 April 2021 (UTC)

wbeditentity now supports editing statements on Senses

Hello all,

This announcement is relevant for people using the APIs to edit Lexemes, for example building tools on top of Wikidata.

We’re happy to let you know that a feature that was missing for a long time, the ability to edit statements on Senses from wbeditentity, has now been added. This means that adding or editing statements on Senses given a Lexeme ID or a Sense ID is possible, as well as creating new Senses with statements.

You can now use the 'claims' property within Sense objects in the JSON data passed to wbeditentity when editing or adding Senses, just like you would use it for other Entity types.

We hope that this improvement will allow the creation of more editing tools for Lexemes, as well as the support of Lexemes in existing tools.

If you encounter any issue or have questions, feel free to contact me or to add a comment on the Phabricator task. Cheers, Lea Lacroix (WMDE) (talk) 14:14, 26 April 2021 (UTC)

Return to the project page "Lexicographical data/Archive/2021/04".