Wikidata:Events/Lexicodays 2024

Welcome!: The Lexicodays 2024 was an online event designed to offer a discussion space for the Wikidata community about Lexicographical Data. During 3 days of presentations, discussions and workshops, we discovered how we describe and model words in Wikidata, discover useful tools to edit lexicographical data, and improve modelling in various languages, with a focus on languages in Indonesia. The event took place online across different time zones.

Participation: If you attended the live event, feel free to add your username to the Participants page.

Code of conduct: Please note that the entire initiative, including the presentations and discussions on various channels, is covered by the Code of Conduct for technical spaces. Please contribute in making the discussion spaces safe and welcoming for everyone.

Feedback form: In order to learn about your experience during the event and improve our future events, we need your feedback! Please fill out the participants survey in English or in Indonesian before July 7th.

Documentation: The event is mostly documented through videos and slides. You will find the pre-recorded videos and replay of most sessions in this Commons category (with collaborative captions). The speakers slides will be added in this category. Please be patient as the files will be uploaded during the days following the event.

Discussions and help: If you have questions or suggestions about the event, feel free to use the talk page. Announcements about the event are talking place on the same talk page, and on the Lexicographical Data Telegram group. If you want to reach out to the organizers directly, feel free to contact Lea Lacroix (WMDE) (lea.lacroix_extwikimedia.de or @Auregann on Telegram).

Sessions

Day 1 - Introduction to Lexicographical Data

Sessions are taking place on Zoom or on Jitsi. Find the link next to each presentation in the program below.


Session details	Title, description and speakers
June 28th at 12:00 UTC 25 minutes 🌐 English and Indonesian 🔗 Zoom	Opening session Opening Lexicodays 2024 (bilingual EN and ID) - Slides Presentation of the Lexicodays 2024 program and how to participate. id: Paparan mengenai acara Lexicodays 2024 dan cara bergabung. Speakers: Lea Lacroix (WMDE), Raisha (WSC)
June 28th at 12:30 UTC 50 minutes 🌐 Indonesian 🔗 Zoom	Introduction to Lexicographical data (Indonesian) Introduction to Lexicographical data (Indonesian) - Slides Introduction to Lexicographical Data (Indonesian) (Lexicodays 2024) During this workshop, we will learn everything we need to know about Lexicographical Data, how we describe words on Wikidata, and how we can contribute. This entry-level session is open to everyone with a basic understanding of Wikidata. id: Dalam lokakarya ini, kita akan mempelajari hal-hal yang perlu kita ketahui tentang Data Leksikografis, bagaimana cara mendeskripsikan kata-kata di Wikidata, serta cara berkontribusi. Sesi ini terbuka untuk semua orang yang memiliki pemahaman dasar tentang Wikidata. Speakers: Raisha (WSC)
June 28th at 13:30 UTC 25 minutes 🌐 Indonesian 🔗 Zoom	How to start with modelling words on Wikidata? How to start modelling words (Indonesian) - Slides How to Start Modelling Words (Indonesian) (Lexicodays 2024) During this workshop, we will discover how to start modelling words in Indonesian languages. This entry-level session is open to everyone with a basic understanding of Wikidata. id: Dalam lokakarya ini, kita akan mempelajari bagaimana cara memodelkan kata-kata dalam bahasa Indonesia. Sesi pemula ini terbuka untuk semua orang yang memiliki pemahaman dasar tentang Wikidata. Speakers: Raisha (WSC)
June 28th at 14:00 UTC 50 minutes 🌐 Indonesian 🔗 Zoom	Editathon Editathon (Indonesian) (Lexicodays 2024) Following the presentations of Lexicographical Data and how to model words, we will apply our new knowledge by editing together. We will create and improve words in Indonesian. id: Sebagai lanjutan dari paparan sebelumnya tentang Data Leksikografis dan cara untuk memodelkan kata, sesi ini adalah tempat untuk menerapkan apa yang sudah kita pelajari dengan menyunting bersama secara langsung. Di dalam sesi ini, kita akan membuat dan meningkatkan kualitas entri kata-kata dalam bahasa Indonesia. Facilitators: Raisha (WSC)
June 28th at 15:00 UTC 50 minutes	Break
June 28th at 16:00 UTC 50 minutes 🌐 English 🔗 Zoom	Opening session and Introduction to Lexicographical data (English) Introduction to Lexicographical data (English) - Slides Introduction to Lexicographical Data (English) (Lexicodays 2024) After a quick presentation of the event program and how to participate, we will discover how to start modelling words in Indonesian languages. This entry-level session is open to everyone with a basic understanding of Wikidata. Speakers: Lydia Pintscher (WMDE), Mohammed Abdulai (WMDE)
June 28th at 17:00 UTC 50 minutes 🌐 English 🔗 Zoom	How to start with modelling words on Wikidata? How to start modelling words (English) - Slides How to start modelling words (English) (Lexicodays 2024) During this workshop, we will discover how to start modelling words in English and other languages. This entry-level session is open to everyone with a basic understanding of Wikidata. Speakers: Lydia Pintscher (WMDE), Mohammed Abdulai (WMDE)
June 28th at 18:00 UTC 25 minutes 🌐 English 🔗 Zoom	Editathon Following the presentations of Lexicographical Data and how to model words, we will apply our new knowledge by editing together. We will create and improve words in English and other languages. Speakers: Lydia Pintscher (WMDE), Mohammed Abdulai (WMDE)
June 28th at 18:30 UTC 30 minutes	Break
June 28th at 19:00 UTC 50 minutes 🌐 English 🔗 Jitsi	How many lexemes does a language really need? This session will discuss the question of how many lexemes a language really needs to have in order to be sufficient for natural language generation purposes. (Some spoilers: 1) it depends on how the language works, and 2) depending on what other languages can bring to the table, this number may actually be smaller than you think!) Slides from this presentation may be found here. id: Dalam sesi ini, kita akan mendiskusikan mengenai pertanyaan berapa banyak Leksem yang sebenarnya perlu dimiliki sebuah bahasa untuk dapat menghasilkan bahasa alami. (Beberapa beberan: 1) ini tergantung bagaimana suatu bahasa bekerja, dan 2) tergantung apa yang dapat dibawa oleh bahasa lain, bahkan bisa jadi lebih sedikit dari yang Anda pikirkan!) Speaker: Mahir256 This session takes place on Jitsi.
June 28th at 20:00 UTC 50 minutes 🌐 English 🔗 Jitsi	Modeling proverbs and sayings as lexemes This session will discuss how sentences (specifically proverbs and sayings) in different languages can be modeled on Wikidata, using information on their "combines lexeme" statements to capture syntactic and semantic information. Slides from this presentation may be found here. id: Dalam sesi ini, kita akan mendiskusikan bagaimana sebuah kalimat (terutama pepatah dan peribahasa) dalam bahasa yang berbeda dapat dimodelkan di Wikidata berdasarkan informasi dari pernyataan "gabungkan leksem" untuk menangkap informasi sintaksis dan semantiknya Speaker: Mahir256 This session takes place on Jitsi.
June 28th at 21:00 UTC 50 minutes 🌐 English 🔗 Jitsi	Modeling compound word lexemes (and helping other languages in the process) This session will discuss how lexemes for compound words (particularly nouns composed of other nouns) can be set up so that other languages can take advantage of the information in them. Slides from this presentation may be found here. id: Dalam sesi ini, kita akan mendiskusikan bagaimana Leksem untuk kata majemuk (terutama kata benda yang terdiri dari kata benda lainnya) dapat disiapkan sehingga bahasa lain dapat memanfaatkan informasi yang ada di dalamnya. Speaker: Mahir256 This session takes place on Jitsi.

Day 2 - Modelling and tools

Sessions are taking place on Zoom or on Jitsi. Find the link next to each presentation in the program below.


Session details	Title, description and speakers
June 29th at 02:00 UTC 25 minutes 🌐 English with Indonesian captions 🔗 Zoom	Introduction to Abstract Wikipedia Abstract Wikipedia and Lexemes - Cory Massaro - Lexicodays 2024 Lexeme and Abstract Wikipedia (Lexicodays 2024) During this presentation, you will learn about the newest Wikimedia projects WikiFunctions and Abstract Wikipedia. This entry-level session is open to everyone with basic understanding of the Wikimedia projects. id: Melalui presentasi ini, kita akan mempelajari dua proyek Wikimedia terbaru, yakni WikiFunctions dan Abstract Wikipedia. Sesi pemula ini terbuka untuk semua orang yang memiliki pemahaman dasar tentang proyek Wikimedia. Speaker: CMassaro (WMF)
June 29th at 02:30 UTC 25 minutes 🌐 English and Indonesian 🔗 Zoom	Lexemes & Abstract Wikipedia: how will they work together? Wikifunctions - Cory Massaro - Lexicodays 2024 During this discussion, we will learn how Abstract Wikipedia will use content from Wikidata, and Lexemes in particular, and why it is especially important to improve Lexemes in our languages on Wikidata. Understanding the content of this discussion will be easier if you attended the previous sessions "Introduction to Lexicographical data" and "Introduction to Abstract Wikipedia". id: Dalam diskusi ini, kita akan mempelajari bagaimana Abstract Wikipedia menggunakan konten-konten yang ada pada Wikidata, khususnya Leksem, serta mengapa meningkatkan Leksem di Wikidata untuk bahasa yang kita tuturkan merupakan hal yang penting. Anda akan lebih mudah memahami diskusi ini apabila Anda menghadiri dua sesi sebelumnya, yakni "Pengantar Data Leksikografis" dan "Pengantar Abstract Wikipedia". Speakers: Denny, Mahir256. Interpretation: Raisha (WSC).
June 29th at 03:00 UTC 50 minutes 🌐 Indonesian 🔗 Zoom	Modelling practices and editathon (Banjarese) Modelling practices and editathon (Banjar) Modelling practices and editathon for Banjarese (Lexicodays 2024) During this session, we will explore the words that Wikidata contains in Banjarese language, discuss what to be improved, how to best model words in this language, and the possible issues that the editors may encounter when adding words in Banjarese language. We will then continue with adding and improving Lexemes together. id: Dalam sesi ini, kita akan menelusuri kata-kata dalam bahasa Banjar yang ada di Wikidata, mendiskusikan hal-hal yang perlu ditingkatkan, membahas cara terbaik untuk memodelkan kata-kata dalam bahasa tersebut, serta masalah-masalah yang mungkin ditemui oleh penyunting saat akan menambahkan kata-kata dalam bahasa Banjar. Sebagai lanjutan, kita akan menambahkan dan meningkatkan kualitas Leksem bersama-sama. Speaker: Arcuscloud, Volstand.

From June 29th at 04:00 UTC to 12:00 UTC: break, no session happening at that time.


Session details	Title, description and speakers
June 29th at 12:00 UTC 50 minutes 🌐 English with Indonesian captions 🔗 Zoom	Useful tools for Lexemes Introduction to Wikidata Lexeme Forms - Lucas Werkmeister - Lexicodays 2024 a demonstration of the tool Luthor Scribe presentation - Lexicodays 2024 - speaker: Andrew McAllister During this session, we will discover several tools that make editing and using Lexemes easier. Wikidata Lexeme Forms a tool to create a new lexeme with a standard set of forms, by Lucas Werkmeister (see also the documentation page) Luthor, a tool that searches Wikisource for usage examples you can add to lexeme senses that don't have them yet, by Ijon Scribe, an open-source iOS app that uses Wikidata lexicographical data as a basis for keyboards to help language learners, by the Scribe community id: Dalam sesi ini, kita akan membahas peralatan yang akan membuat kegiatan menyunting dan menggunakan Leksem menjadi lebih mudah. Wikidata Lexeme Forms adalah peralatan untuk membuat Leksem berdasarkan formulir yang terstandar, dibuat oleh Lucas Werkmeister (lihat juga halaman dokumentasinya) Luthor, adalah peralatan yang menjelajari halaman Wikisource untuk mencari contoh penggunaan yang dapat ditambahkan ke dalam Leksem, dibuat oleh Ijon Scribe, aplikasi iOS sumber terbuka yang menggunakan data leksikografis di Wikidata sebagai basis pembuatan papan ketik yang dapat membantu pembelajar bahasa. Aplikasi ini dibuat oleh komunitas Scribe See also: videos with collaborative captions
June 29th at 13:00 UTC 25 minutes 🌐 Indonesian 🔗 Zoom	Adding audio pronunciation from Lingua Libre to Wikidata Adding audio pronunciation from Lingua Libre to Wikidata Adding audio pronunciation from Lingua Libre to Wikidata (Lexicodays 2024) During this session, we will discover how to record audio pronunciation on Lingua Libre and how to add the files to Lexemes on Wikidata. id: Melalui sesi ini, kita akan mempelajari cara merekam pelafalan melalui Lingua Libre dan cara menautkan berkas audio ke dalam Leksem di Wikidata. Speaker: Raisha (WSC)
June 29th at 13:30 UTC 50 minutes 🌐 Minangkabau/Indonesian 🔗 Zoom	Modelling practices and editathon (Minangkabau) Modelling practices and editathon (Minangkabau) - Slides Modelling practices and editathon for Minangkabau (Lexicodays 2024) During this session, we will explore the words that Wikidata contains in Minangkabau, discuss what to be improved, how to best model words in this language, and the possible issues that the editors may encounter when adding words in this language. We will then continue with adding and improving Lexemes together. id: Dalam sesi ini, kita akan menelusuri kata-kata dalam bahasa Minangkabau yang ada di Wikidata, mendiskusikan hal-hal yang perlu ditingkatkan, membahas cara terbaik untuk memodelkan kata-kata dalam bahasa Minangkabau, serta masalah-masalah yang mungkin ditemui oleh penyunting saat akan menambahkan kata-kata dalam bahasa Minangkabau. Sebagai lanjutan, kita akan menambahkan dan meningkatkan kualitas Leksem bersama-sama. Speaker: Kartika (WSC)
June 29th at 14:30 UTC 50 minutes 🌐 English and Indonesian 🔗 Zoom	Bug Triage Hour: Wikidata Lexeme interface Bug Triage Hour (Lexeme Interface) (Lexicodays 2024) During this session, we will discuss issues and improvements with the Wikidata Lexeme interface. Come with your suggestions and existing Phabricator tickets! Together, we will improve the existing tickets and create some new ones. id: Dalam sesi Jam Triase Kekutu ini, kita akan membahas mengenai permasalahan dan perbaikan yang dapat dilakukan terhadap antarmuka Leksem di Wikidata. Silakan membagikan saran maupun tiket Phabricator yang telah Anda buat! Dalam sesi ini, kita akan bersama-sama membahas mengenai tiket yang telah ada dan membuat beberapa tiket baru. Facilitator: Lydia Pintscher (WMDE)
June 29th at 15:30 UTC 30 minutes	Break
June 29th at 16:00 UTC 50 minutes 🌐 English 🔗 Jitsi	Useful tools for Lexemes Introduction to Wikidata Lexeme Forms - Lucas Werkmeister - Lexicodays 2024 a demonstration of the tool Luthor Scribe presentation - Lexicodays 2024 - speaker: Andrew McAllister During this session, we will discover several tools that make editing and using Lexemes easier. Wikidata Lexeme Forms a tool to create a new lexeme with a standard set of forms, by Lucas Werkmeister (see also the documentation page) Luthor, a tool that searches Wikisource for usage examples you can add to lexeme senses that don't have them yet, by Ijon Scribe, an open-source iOS app that uses Wikidata lexicographical data as a basis for keyboards to help language learners, by the Scribe community See also: videos with collaborative captions
June 29th at 17:00 UTC 50 minutes 🌐 English 🔗 Jitsi	Modeling predicate lexemes on Wikidata In this session, we will look at some senses for verbs or other verb phrases in some under-resourced languages and then determine what information would be necessary to model a proposition item for them. id: Dalam sesi ini, kita akan membahas mengenai makna untuk kata kerja atau frasa kerja lainnya dalam bahasa yang masih kekurangan sumber daya, serta untuk menentukan informasi apa yang perlu ditambahkan dalam memodelkan butir proposisi untuknya. Speakers: Mahir256 and عُثمان This session takes place on Jitsi.
June 29th at 18:00 UTC 60min 🌐 English 🔗 Watch + chat on Owncast, Youtube or Twitch	Wikidata live editing - sport lexemes Ainali and Abbe98 do some live editing, and discuss the thought process of what we are doing and why we do it. In this session we will query for, and edit, lexemes related to sport and show ways to find lexeme content in any topic area. id: Dalam sesi ini, Ainali dan Abbe98 akan menayangkan penyuntingan langsung, serta mendiskusikan proses berpikir mereka tentang apa yang mereka lakukan dan bagaimana cara melakukannya. Dalam sesi ini, kita akan melakukan kueri dan menyunting leksem yang bertopik olahraga, serta menunjukkan cara menemukan konten Leksem dalam berbagai topik. Speakers: Ainali and Abbe98
June 29th at 19:00 UTC 110 minutes - open ended 🌐 English, with some text in Norwegian and Turkish (and possibly other languages?) 🔗 Jitsi	Fun generating sentences with lexemes! This is a live coding session in which Ninai/Udiron, a natural language generation system using Wikidata lexemes and items, will be worked on. Anything could be accomplished here: the functionality for some grammatical constructs might be fixed; some new abstract content types might be introduced; even a new language might be added! Feel free to hop on or hop off the session when you like; I will greatly appreciate your live inputs and feedback! id: Ini adalah sesi pengkodean langsung di mana Ninai/Udiron, sebuah sistem penghasil bahasa alami yang menggunakan Leksem dan Butir Wikidata, akan dikerjakan. Beberapa hal yang dapat dilakukan dalam sesi ini antara lain: perbaikan fungsionalitas untuk beberapa konstruksi gramatikal; pengenalan beberapa jenis konten abstrak; bahkan bahasa baru dapat ditambahkan! Anda dipersilakan untuk bergabung dengan sesi ini kapapun Anda berkenan. Kami akan sangat mengapreasiasi masukan dan saran untuk sesi ini. Speaker: Mahir256 This session takes place on Jitsi.

Day 3 - More tools and conclusion

Sessions are taking place on Zoom or on Jitsi. Find the link next to each presentation in the program below.


Session details	Title, description and speakers
June 30th at 12:00 UTC 50 minutes 🌐 English 🔗 Zoom	Modelling practices and editathon (Malay) Modelling practices and editathon (Malay) Modelling practices and editathon for Malay (Lexicodays 2024) During this session, we will explore the words that Wikidata contains in Malay language, discuss what to be improved, how to best model words in this language, and the possible issues that the editors may encounter when adding words in Malay. We will then continue with adding and improving Lexemes together. id: Dalam sesi ini, kita akan menelusuri kata-kata dalam bahasa Melayu yang ada di Wikidata, mendiskusikan hal-hal yang perlu ditingkatkan, membahas cara terbaik untuk memodelkan kata-kata dalam bahasa tersebut, serta masalah-masalah yang mungkin ditemui oleh penyunting saat akan menambahkan kata-kata dalam bahasa Melayu. Sebagai lanjutan, kita akan menambahkan dan meningkatkan kualitas Leksem bersama-sama. Speaker: Tofeiku
June 30th at 13:00 UTC 50 minutes 🌐 Indonesian 🔗 Zoom	Querying Lexemes on Wikidata Querying Lexemes on Wikidata Querying Lexemes in Wikidata (Lexicodays 2024) During this session, we will discover how to build lists and queries using the Wikidata Query Service, and we will build interesting lists related to Lexemes. Attending this session will be easier if you are already familiar with Lexicographical Data on Wikidata. id: Dalam sesi ini, kita akan mempelajari bagaimana cara melakukan kueri menggunakan layanan Wikidata Query Service, dan kita akan belajar caranya membuat daftar yang berkaitan dengan Leksem. Speaker: Raisha (WSC)
June 30th at 14:00 UTC 50 minutes 🌐 English and Indonesian 🔗 Zoom	Conclusion and next steps Closing slides During this closing session, we will learn more about the projects developed by the Wikidata Software Collaboration Team in Indonesia. We will wrap up the Lexicodays 2024 and ask the participants for feedback. id: Dalam sesi terakhir ini, kita akan mempelajari lebih lanjut mengenai proyek yang dikembangkan oleh tim Wikidata Software Collaboration di Indonesia. Setelah sesi tersebut, tibalah penutupan Lexicodays 2024. Kami akan meminta masukan dari seluruh peserta. Speaker: Raisha (WSC), Lea Lacroix (WMDE)
June 30th at 15:00 UTC 50 minutes 🌐 English 🔗 Zoom / Jitsi	Social time and games To conclude the event, let's play some Wikidata-based games and chat!

Wikidata:Events/Lexicodays 2024

Sessions

Day 1 - Introduction to Lexicographical Data

Day 2 - Modelling and tools

Day 3 - More tools and conclusion

See also