Wikidata:Requests for permissions/Bot/AroundTheBot

AroundTheBot (talkcontribsnew itemsnew lexemesSULBlock logUser rights logUser rightsxtools)
Operator: Hardwigg (talkcontribslogs) & BrigidGit (talkcontribslogs)

Task/s: Automated import of Albanian nouns with IPA from Wiktionary, with the long-term goal of using this data to do pronunciation-based comparison/word evolution between languages.

Code: This notebook performs initial kaikki dataset analysis/cleanup. This notebook (run inside PAWS) coerces the cleaned up data to Wikidata format and performs the actual import.

Function details: We worked with the kaikki dataset, a structured parsing of wiktionary, to find relevant Albanian nouns with IPA pronunciation, remove any noisy entries, coerce the words into the lexeme format used by Wikidata, and then import them into Wikidata. --Hardwigg (talk) 12:22, 18 July 2024 (UTC) & @BrigidGit[reply]

Please make some test edits Ymblanter (talk) 20:22, 28 July 2024 (UTC)[reply]
Awesome! We will get that done this week. Hardwigg (talk) 23:59, 2 August 2024 (UTC)[reply]
@Ymblanter Ok we're still working through a few fixes with the script, but should be ready to do a 50-edit test set by next week. You can see the first few automated edits we've been making here: Special:Contributions/AroundTheBot Hardwigg (talk) 10:27, 15 August 2024 (UTC)[reply]