Wikidata talk:WikiProject Labels and descriptions

Latest comment: 5 years ago by Epìdosis in topic Wrong capital letters

Tranches edit

A fantastic idea. The only suggestion I have at this stage is that while tranches of 1000 for labels seems do-able, tranches of 100 for descriptions might be a bit more manageable for the casual contributor. —WFC01:20, 3 November 2012 (UTC)Reply

The headers are only there to see easy where something has been done already and to prevent edit conflicts. So you can also add under the header Q4001 - Q5000 that you have done * English: Q4301 - Q4400 - {{done}} - ~~~~. Romaine (talk) 01:25, 3 November 2012 (UTC)Reply
And Italian language? Raoli (talk) 22:22, 3 November 2012 (UTC)Reply

Thinking ahead edit

There should be soon millions of Wikidata entries, and they all should have labels and descriptions in as many languages as possible, even those that do not have a corresponding Wikipedia article. To achieve this, we surely need a bit of planning and quite a bit of bot development. I think it could roughly follow the following steps:

  • collect ideas for format and internationalization
  • run bots to do the bulk of the work
  • manually refine
  • go back to step one.

One thing that may be useful would be to collect first many translations for phrases that could be used in many items, like "a disambiguation page" or "a village in India". This way a bot could add that, and then users could tweak them to make them more precise.Zolo (talk) 17:48, 4 November 2012 (UTC)Reply

Good point, even if a policy for labels/descriptors would be language-specific. However, Wikidata is organized by numbers and we will need some tables with gaps on different languages (by items, actually). These tables are going to be HUGE (items multiplied by languages) so it is better to find good enough structure for them. Zanka (talk) 18:42, 4 November 2012 (UTC)Reply

Special page edit

Would it be possible to have a special page with all the unlabeled/undescripted pages in a certain language? --Cruccone (talk) 11:02, 6 November 2012 (UTC)Reply

Please follow bugzilla:41597 for that. --Lydia Pintscher (WMDE) (talk) 11:18, 6 November 2012 (UTC)Reply
Code to do this is now committed; do you know when it will be released to production? Jdforrester (talk) 06:22, 10 November 2012 (UTC)Reply
It'll be in the next deployment which will hopefully happen soon. I unfortunately have no date for it though. --Lydia Pintscher (WMDE) (talk) 11:49, 10 November 2012 (UTC)Reply
As long as there is no special page i could list all items which are missing a label or description. (I already did it for a part of fr and de labels). --Sk!d (talk) 22:33, 10 November 2012 (UTC)Reply

I just submitted a bug to request a special page for missing descriptions, too: bugzilla:42102 --Waldir (talk) 15:00, 14 November 2012 (UTC)Reply

It would be great to have special pages for that. Romaine (talk) 15:28, 14 November 2012 (UTC)Reply

Splitting Pages into Languages edit

I am thinking about splitting this page into Subpages e.g. Wikidata:Labels and Descriptions Task Force/en for every language. Because this page will get bigger and bigger and not all languages are equally fast. Another aspect is that most of the people don't care for other languages so if they had one site in their own language they might be more active and the introduction/discussion would be in there first language. Another aspect is that i am thinking other languages like Russian or so maybe be also willing to start a task force. What do you think? --Sk!d (talk) 13:02, 17 November 2012 (UTC)Reply

Comparison is motivating, but of cause your right that the page is getting to long. BTW: The names of the categories are not consistent, see Wikidata:Project chat#Category:Task force. --Kolja21 (talk) 19:07, 17 November 2012 (UTC)Reply
The page is getting too long, but it will become much much longer in the near future (something like 10 millions once all relevant Wikipedia articles are added). So I am not sure that this way of dealing with description and labels is viable. I guess thematic task forces like Wikidata:Country subdivision task force will be better able to handle a high number of pages in a relatively clean way (and for items that are not part of any task force, we should soon get special pages for items without a label/description). --Zolo (talk) 09:04, 18 November 2012 (UTC)Reply
On the long run we need tracking categories like de:Kategorie:Wikipedia:Normdaten-Wartung. They would have names like "Label missing" and subcategories like "Label missing/en". --Kolja21 (talk) 11:23, 18 November 2012 (UTC)Reply
This wont be necessary with the next wikidata update there should be a special page where you can search items with missing Labels/Discriptions. --Sk!d (talk) 16:38, 18 November 2012 (UTC)Reply
What a bright future ;) --Kolja21 (talk) 16:51, 18 November 2012 (UTC)Reply

Q1161 edit

It seems that this item links to articles about Limbourg region but also to disambiguation pages in different languages. I don't know which ones should be kept. B25es (talk) 16:05, 20 November 2012 (UTC)Reply

I've changed the description: "the concept of things called 'Limburg'" to "disambiguation page", see Help:Description. --Kolja21 (talk) 16:35, 20 November 2012 (UTC)Reply

Difficult labels edit

I don't know what to put as label:

Anyone? Romaine (talk) 11:44, 24 November 2012 (UTC)Reply

Cheers --Kolja21 (talk) 12:05, 24 November 2012 (UTC)Reply

I've put the labels and descriptions. It's Ivanivka, 'cause in Ukraine. Cheers. --Robot Monk (talk) 15:45, 24 November 2012 (UTC)Reply
Thanx! I've just looked at the Russian translation, but of cause you a right. --Kolja21 (talk) 18:13, 24 November 2012 (UTC)Reply

Language worklist subpages edit

As you may have seen i added subpages that are updated by my bot. Please feel free to add any comments here. Is there sth. that i can improve at content or layout? Have you any question? If there is a major community for an additional language i could create a new report, too. If Toolserver is ok (currently not), reports are updated twice a day. Merlissimo (talk) 13:19, 26 November 2012 (UTC)Reply

Die Liste Wikidata:Labels and descriptions task force/de ist eine riesen Hilfe. Kannst du einen kurzen Einleitungstext schreiben, wie die Untergliederung zu verstehen ist? In den ersten beiden Abschnitten gibt es keine, im dritten folgen offenbar div. deutsche Dialekte, incl. dem Abschnitt "other". Gruß --Kolja21 (talk) 16:05, 26 November 2012 (UTC)Reply
1. Abschnitt: de-description vorhanden, aber kein de-Label
2. Abschnitt: de-label vorhanden, aber keine de-description
3. Abschnitt: weder de-label noch de-description vorhanden. Allerdings ist die Liste weiter unterteilt abhängig von den vorhandenen sitelinks. Items in dem jeweiligen Unterschschnitt haben einen sitelink zu den Wiki, das in der h4-Überschrift genannt ist. Einzeln aufgeführt werden alle Wikis, wo de im Fallbackpath der Sprache "de" steht. Alle, wo de nicht im fallbackpath auftaucht sind unter "other" zu finden. Der "other"-Abschnitt wird am schwersten abzuarbeiten sein, weil dort auch Items zu finden sind, die z.B. nur auf das koreanische und arabische Wiki verweisen und man deshalb richtig Arbeit in die Übersetzung investieren muss.
4. Abschnitt: Label und description sind identisch. Man sollte Items aber unterscheiden können, weshalb man die description erweitern sollte.
Merlissimo (talk) 16:19, 26 November 2012 (UTC)Reply

Bot setting automatic disambiguation description edit

My bot is currently setting autmatic disambuigation descriptions for en, de and it (like this). There for it checks if there is already a description and after this, if the page is a disambiguationpage. If this is true he sets the description (if there is no interwikilink at all but e.g. the English page is disambiguation he also sets the description). My bot also removes the brackets part e.g. in en "(disambiguation)". I would prefer to do this work for many other languages too. But therefore the rules must be clear, so every language should discuss if they want my bot to do the same for their language. And if so they should tell my what description my bot should add and which part of the Label my bot should remove. It is also possible to only add a description if there is an interwikilink (with a disambiguation site). --Sk!d (talk) 01:11, 5 December 2012 (UTC)Reply

+1 looks good! --Kolja21 (talk) 15:19, 5 December 2012 (UTC)Reply
Bot should now add all three descriptions above. --Sk!d (talk) 23:26, 6 December 2012 (UTC)Reply
  Done, bot is also adding ru: "страница значений" now. --Sk!d (talk) 15:09, 13 December 2012 (UTC)Reply
  Done --Sk!d (talk) 15:37, 13 December 2012 (UTC)Reply
  Done --Sk!d (talk) 13:10, 28 December 2012 (UTC)Reply
  Done --Sk!d (talk) 03:14, 10 January 2013 (UTC)Reply
Already added this. --Sk!d (talk) 22:28, 21 February 2013 (UTC)Reply
  Done --Sk!d (talk) 18:31, 20 April 2013 (UTC)Reply

Proposal for a new page design edit

This page will get more and more confusing with increasing content. I would like to change it to something like this:


Labels

Labels completed till...


Labels missing inbetween

English: Q123, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456
German: Q123, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456
French: Q123, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456
Spanish: Q123, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456
Italian: Q123, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456, Q123, Q456


What do you think? --Sixsi6ma (talk) 02:29, 9 December 2012 (UTC)Reply

¿Es necesario tener por separado los títulos y las descripciones? Obviamente cuando se hace una página se completan ambos conceptos y no veo la finalidad de tenerlos que incluir en dos listas. --LadyInGrey (talk) 02:48, 9 December 2012 (UTC) Si alguien traduce, se lo agradecería mucho.Reply
I assume the question is, if we would have the same thing for the descriptions? Yes, the same thing goes for those accordingly. I am not sure what the following means, either you dont like the separation between progress border and the table of the missing ones or the separation between label and description. The labels are mostly set by bot and therefore further ahead, I would not recommend to summarize them with the descriptions. I seperated the ones missing inbetween because there will be quite many and I dont want them to obstruct the progress border. --Sixsi6ma (talk) 05:29, 9 December 2012 (UTC)Reply
Following with LadyInGrey's point, whenever we fill the title field we also fill the description field. Sometimes the opposite is also true: we modify bot-generated titles when we describe. Therefore, having different listings for titles and descriptions doesn't help. Bot titles are not perfect. For instance, the provincia de Valencia is to be titled Valencia and described provincia de España. The bot would just title Provincia de Valencia. Not terribly wrong, but could lead to error if you search Valencia, as you wouldn't find that in addition to being a city is also a province. I support having just one list of Qs with missing info, either title and/or description. B25es (talk) 07:57, 9 December 2012 (UTC)Reply
Are you talking about the main page layout or subpages created by my bot? The subpages also have different lists, but every item is only member of exact one list. Merlissimo (talk) 12:18, 14 December 2012 (UTC)Reply

Duplicate items edit

I have a problem with this two items: Q3298 and Q3695. The second seems to be the duplicate both in English and German of the first. Someone have any ideas? Thanks in advance! --Paperoastro (talk) 23:21, 14 December 2012 (UTC)Reply

They don't seem to be duplicates to me, the german articles even mention each other. --Sixsi6ma (talk) 02:11, 15 December 2012 (UTC)Reply
I got it. Thanks for the explanation! :) --Paperoastro (talk) 13:39, 16 December 2012 (UTC)Reply

List of item without labels and/or descriptions edit

I have created some pages with list of item without labels and/or descriptions: English, Deutsch,Français, Español, Italiano. If it's useful I can keep update. --ValterVB (talk) 21:26, 19 January 2013 (UTC)Reply

Thanks. I've add a link to Wikidata:Labels and descriptions task force/de. --Kolja21 (talk) 01:35, 20 January 2013 (UTC)Reply
For now still the best pages for adding labels and description.--Giftzwerg 88 (talk) 08:07, 16 August 2013 (UTC)Reply

style guide edit

Is there a style guide for labels and descriptions yet, especially for new contributors? Or do we need to write one? -- Phoebe (talk) 18:05, 5 March 2013 (UTC)Reply

I guess to answer my own question there is Help:Description and . A combined how-to-do-it style guide might be nice though. -- Phoebe (talk) 18:14, 5 March 2013 (UTC)Reply
How about puting the help pages Help:Label and Help:Description and Help:Aliases in one file? Every beginner needs to read all three help files, they are closely connected and so it is easier to make it to an how-to-do-it style guide.--Giftzwerg 88 (talk) 07:02, 16 August 2013 (UTC)Reply

2 pages in English, 1 page in Spanish edit

Q293074 links to Holy League in English and Liga Santa in Spanish. Q226091 links to Catholic League in English and Liga Santa in Spanish. So when I try to add the spanish label to Q293074 it is already in use on the other article. Do I just change one of the descriptions to wikipedia disambiguation? Is there already a recommendation for this situation? Thanks! --Tbennert (talk) 02:46, 8 March 2013 (UTC)Reply

Default label edit

There is a lot of items in the wikidata, and there is a lot of missing label in various languages. Would it be possible to introduce something like default label, to show the item name in more descriptive format than Q#####? The name might be a copy of the label in the origin language of the item if possible, i.e. for the German person it could be a copy of German label, for vilage in France .. the label in French etc. The default label might be presented in case no label is provided in some language. The other solution for the default labeling might be a selection of the one preferred label from the list of available labels in various languages. Additionaly a corresponding mw.wikibase.defaultLabel() can be provided for use in Lua scripts to access the label. Paweł Ziemian (talk) 20:16, 5 July 2013 (UTC)Reply

Howto, please? edit

I think I may be able to help with the Spanish labels, but I'm not sure what this task force is about. Could someone provide a brief step-by-step intro on how the task is issued to participants, and what to do at each linked property page? I'm interested in learning the ropes of Wikidata, and this one looks like a reasonable hands-on entry point. Diego Moya (talk) 18:14, 11 August 2013 (UTC)Reply

Hi Diego Moya,

On User:ValterVBot/Labels_and_descriptions/es you find a daily updated list of items which don't have a label and/or description. You are free to add those in Spanish. Before starting adding labels and descriptions, please have a look at the following pages: Help:Label/es and Help:Description/es --Pasleim (talk) 18:03, 14 August 2013 (UTC)Reply

Items with only asian sitelinks and labels edit

National Army (Q26953) has only a very vague english description, so I can´t put a German label / description.--Giftzwerg 88 (talk) 11:52, 15 August 2013 (UTC)Reply

As it's a disambiguation, the label should probably be "國軍" in all languages. --  Docu  at 11:56, 15 August 2013 (UTC)Reply
But it needs transscription in latin alphabeth.--Giftzwerg 88 (talk) 12:12, 15 August 2013 (UTC)Reply

Interim balance edit

For now this task force develops very different in speed in the different languages. I´m considering just the "big six" languages en, fr, de, es, it, ru.

Diagnosis:

  • the first 1000 Items are done by all
  • the work has stuck for ru at Q1500
  • the "big fife" (big six without ru) languages continue up to Q9000
  • continuous work reaches for fr roughly up to Q9000
  • continuous work reaches for it roughly up to Q10000
  • continuous work reaches for de roughly up to Q14000
  • continuous work reaches for en roughly up to Q30000

Conclusions and suggestions:

  • progress is slow
  • the taskforce has apperently no russian speaking members at present. Some more russian speaking users should be motivated to continue the work
  • the coloured table on this page could be updated
  • we have no idea about the development of all other languages besides the big six (perhaps it´d be a good idea to make a statistic about all languages which is updated roughly every 3 months)
  • In comparison to the 14.000.000 Items the 14.000 labels and descriptions of de is just 1/1.000th of the whole items, en has 2/1.000th completed and ru 1/10.000th.
  • It does not matter how many and how skillfull the bots are adding labels and descriptions, there will be gaps that can only be done by hand by human users.
  • It is desireable for fr, it, de to hit the interim goal of Q20000 asap
  • the work of this task force needs more support by the community
  • put links on the help:labels and help:descriptions pages to the pages of the taskforces
  • List from ValterVbot to all other languanges would be helpfull to get an overview, however it is not necessary to update these on a daily base. But it can bee an indicator about the change rate from unlabeled to labeled items so the update rate can be adjusted from every 3 months to every month, to every 14 days, up to daily update, depending on the activity rate.
  • More languages could be included in this task force.

--Giftzwerg 88 (talk) 09:28, 18 August 2013 (UTC)Reply

  • Statics about all languages you find on User:Byrial/Language_statistics_for_items.
  • There are not only 14.000 german labels and descriptions but there are more than one million.
  • It is a systematic way to edit items in the ascending order of Q numbers but I think there are better way to do it. Terminator is a list of items which have the same label but no description. There is much more benefit if some of us would work on that list.
--Pasleim (talk) 10:32, 18 August 2013 (UTC)Reply
Frequency of update: we can use the dump of database, available every 2 weeks (more or less), with the dump we can made more detailed list. --ValterVB (talk) 10:38, 18 August 2013 (UTC)Reply
Thanks for your input, I´ll oaccasionaly do some of this list. I am aware that there are other ways to edit items and there are much more items edited. In the beginning I also did sometimes searches for common names as "Alexander" or "Ludwig". I found numerous uncomplete itens and then added labels and descriptions. Whenever I do a search, I add labels and description based on the results if they are missing. However there allways remain gaps which can not be accessed on this way, so the continuous work is necessary and does the rest, so it should be done and updated at least on a occasional level. The gap between the first item on Valters list and the last represents somehow the need of work to be done. If all the items are in a range of 5000 ist is clear that there is allmost nothing completed, if the range is 20.000 it means the gaps are small and can be closed much easier. The aim is to have no gaps in some ranges.--Giftzwerg 88 (talk) 11:40, 18 August 2013 (UTC)Reply

Add description (need a little help) edit

Here, need a little help to add description on Category, Disambiguation and Template Item. --ValterVB (talk) 12:43, 1 November 2013 (UTC)Reply

Launch of WikiProject Wikidata for research edit

Hi, this is to let you know that we've launched WikiProject Wikidata for research in order to stimulate a closer interaction between Wikidata and research, both on a technical and a community level. As a first activity, we are drafting a research proposal on the matter (cf. blog post). It would be great if you would see room for interaction! Thanks, --Daniel Mietchen (talk) 01:45, 9 December 2014 (UTC)Reply

Automatic label for music albums and singles edit

Please partecipate in this discussion, which is related to the scope of this project :) thank you. --Superchilum(talk to me!) 08:21, 4 November 2015 (UTC)Reply

Descriptions for Wikinews articles edit

Should all items about Wikinews articles (e.g. d:Q17889377) have the description "Wikinews article"? --Superchilum(talk to me!) 13:55, 6 November 2015 (UTC)Reply

──────────────────────────────────────────────────────────────────────────────────────────────────── That wouldn't be very helpful. We could use, say:

  • Wikinews article about foo
  • Wikinews article dated 1 November 2015
  • Wikinews article about foo, dated 1 November 2015

-- Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 20:15, 6 November 2015 (UTC)Reply

Wikimania 2016 edit

Only this week left for comments: Wikidata:Wikimania 2016 (Thank you for translating this message). --Tobias1984 (talk) 11:56, 25 November 2015 (UTC)Reply

Outdated ? edit

Hi,

Jura1 marked this project as outdated. True, most of the job has been take over by bots but I think there is still a lot to be done by humans (at least monitoring the bots works).

Per suggestion of Tubezlob, I propose to create Wikidata:WikiProject Labels and descriptions/French to focus on the French language.

Cdlt, VIGNERON (talk) 17:47, 24 March 2017 (UTC)Reply

Descriptions - Mass duplication, automation, translation edit

Hi. There are some notes in two locations:

Can we talk about this? Does anyone have a good collection of other previous discussions, and/or is willing to summarize the current state of affairs and future options available to us? I made an attempt to summarize, but I'm not a power-editor here, and hopefully someone else can do a better job.

Thanks. :-) Quiddity (talk) 17:42, 4 September 2017 (UTC)Reply

I was reminded of Wikidata:Automating descriptions, too. Quiddity (talk) 19:55, 15 September 2017 (UTC)Reply

Wrong capital letters edit

Hi all! In my watchlist I often see new users speaking different languages adding labels, descriptions and/or aliases with wrong capital letter or ending with punctuation (tag). I have sent a lot of messages in order to inform them about our policy regarding capital letters and punctuation. How about creating a standard notice template in order to inform new users about this issue? I think it would be really useful. Thank you all, --Epìdosis 14:19, 15 August 2018 (UTC)Reply

I've found it (for uppercase letters):Template:Uw-uppercase. --Epìdosis 12:21, 27 August 2018 (UTC)Reply
Return to the project page "WikiProject Labels and descriptions".