Function details: The codes analyses dumps of Wikidata and can create an auto-transliterating system for any given pair of languages based on that. I started with Persian and Hebrew (some edits for test )
--Amir (talk) 18:14, 7 April 2015 (UTC)Reply[reply]
Comment, please let me know when you try your system for some cyrillic language. I'd like to see it myself. --Infovarius (talk) 14:10, 8 April 2015 (UTC)Reply[reply]
@Infovarius: I work in pair of languages like fa and he (which the bot adds Persian transliteration based on Hebrew and vice versa) which pair of language do you suggest? en and ru? Amir (talk) 11:54, 9 April 2015 (UTC)Reply[reply]
Probably you should have stated this in your request. Your phrase "I started with" has encouraged me :) No, I don't suggest Russian as I understand the complexity of the task. --Infovarius (talk) 13:16, 10 April 2015 (UTC)Reply[reply]
@Infovarius: I don't think Russian is too complicated to abandon. I took care of lots of different issues including country of citizenship, etc. so It's not hard for this bot. I asked you what language do think is the best pair for Russian *to start with* Amir (talk) 21:11, 10 April 2015 (UTC)Reply[reply]
Just a caveat when when dealing with Chinese languages: Chinese to Latin script (and vice versa) transliterations are rarely standardized. For example, Alan Turing's given name might be transliterated into 艾伦 or 阿兰 (as in the case of Alan Moore (Q205739)) or 亚伦 (as in the case of Alan Arkin (Q108283)). These Chinese characters are roughly resembles "Alan" when pronounced, but due to regional differences (i.e. mainland China, Taiwan, Hong Kong, etc), they result in different transliterations. Even when two people's names are transliterated by the same region, they can be different. There is simply no standardization on this matter. —Wylve (talk) 14:53, 23 April 2015 (UTC)Reply[reply]
It's not wrong, but it might not be the only way people call Alan Turing in Chinese. The lead sentence of Turing's article on zhwiki mentions that "Alan" is also transliterated as 阿兰. —Wylve (talk) 20:48, 25 April 2015 (UTC)Reply[reply]
@Wylve: I made 50 auto-transliterations , please check and say if anything is wrong or unusual. Thanks Amir (talk) 20:05, 16 May 2015 (UTC)Reply[reply]
I can't verify every name, since some of those people aren't mentioned in Chinese news sources. My standard of what is "wrong" or "unusual" is whether the transliterations you've produced are used predominantly in reliable and reputable sources. It is hard to judge sometimes, as there is a variety of transliterations used. For instance:
Jonathan Ross is transliterated as 强纳·森罗斯 and also 喬納森·羅斯
Leonard B. Jordan is also transliterated as 萊昂納德·B·喬丹
Jimmy Bennett is also transliterated as 吉米·本内特, 吉米班奈, 吉米班奈特.
Jason Lee is also named 杰森·李.
"Scott" from A. O. Scoot is also transliterated as 史考特.
All of your edits should be fine if read in Chinese, as they all sound like their English name. Also, I have found this page (), which documents Xinhua News Agency (Q204839)'s official transliterations of names. These transliterations are considered official only in Mainland China. —Wylve (talk) 21:58, 16 May 2015 (UTC)Reply[reply]
┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ @Ladsgroup, Wylve: Does this look okay for an approval, or is there something we're missing? I don't speak (or read, for that matter) Chinese HazardSJ 05:40, 28 December 2015 (UTC)Reply[reply]
Well, last time people talked in this page was a year and half ago. I need to search to find the script and check. I'll do it soon Amir (talk) 19:12, 5 January 2017 (UTC)Reply[reply]
@Ladsgroup: Only human names? How about geographical objects (populated places, rivers, etc.)? Right now I'm thinking to transliterate manually some batches of names of Ukrainian localities and to harvest them in WD; should I leave this task for your bot?:) --XXN, 14:49, 12 May 2017 (UTC)Reply[reply]
I don't think the AI would be good enough to do that for now, I'm planning to use w:LSTM in near future and in that case we might do some experiments soon. Amir (talk) 14:56, 12 May 2017 (UTC)Reply[reply]
@PinkAmpersand: I love to, this is one of the most exciting things that can happen in labeling in Wikidata but unfortunately I don't have time for this. I hereby withdraw from this request Amir (talk) 23:57, 6 March 2018 (UTC)Reply[reply]