User talk:Legobot/properties.js

Latest comment: 9 years ago by Nemo bis in topic Italian person data

U.S. Roads

edit
Took a look at it and it should be fine. --Rschen7754 20:46, 19 February 2013 (UTC)Reply

The main type of item (P107) for roads is place (= geographical feature, Q618123). --Kolja21 (talk) 20:42, 19 February 2013 (UTC)Reply

We've been running just these three types, but I'm planning to do a larger request with main types soon. --Rschen7754 20:46, 19 February 2013 (UTC)Reply
Since these aren't very big requests I don't mind re-running them later on to add more properties. Legoktm (talk) 12:26, 20 February 2013 (UTC)Reply
Looks good to me (and thanks for getting the next few started - it's been a busy few days). --Rschen7754 21:45, 26 February 2013 (UTC)Reply
All good except the last one - a bit too much collateral to only tag one article, w:en:U.S. Route 11 in Mississippi. --Rschen7754 07:25, 28 February 2013 (UTC)Reply

Rivers in Africa

edit

None currently

Gender categories from Swedish Wikipedia

edit
The Swedish Wikipedia is great, because they have two flat categories, one for each gender. This will be a huge job – 28285 woman articles, 117281 man articles, but it is all safe.

These categories have been   Checked, but are   Waiting for m:Steward_requests/Global_permissions#Global_API_High_Limit_Requestor_for_Legobot to be resolved. Also a few optimizations in the code. Legoktm (talk) 14:12, 17 February 2013 (UTC)Reply

Global permission received. One more code optimization to make and we'll be ready to go. Legoktm (talk) 07:16, 19 February 2013 (UTC)Reply
Can you please also add the main type of item (Property:P107): person (Q215627)? Thanks --Kolja21 (talk) 15:13, 19 February 2013 (UTC)Reply
That is already being done! I modified the template so if you look at the actual code, you will see |pid2=P107|qid2=Q215627, but the template doesn't display it yet. Legoktm (talk) 15:14, 19 February 2013 (UTC)Reply
Note that as of a while ago, the first category is running. Legoktm (talk) 15:18, 19 February 2013 (UTC)Reply
Wow, I'm impressed! --Kolja21 (talk) 15:21, 19 February 2013 (UTC)Reply
Wheee. :D Jon Harald Søby (talk) 22:28, 19 February 2013 (UTC)Reply
First is   Done, second is   In progress. Legoktm (talk) 12:18, 28 February 2013 (UTC)Reply

  Done! Managed to create Q6000000 in the process :) Legoktm (talk) 08:06, 7 March 2013 (UTC)Reply

Gender categories from German Wikipedia

edit
Same as the one above. The categories have the same criteria for inclusion, namely that the contents are only real people, so here are ~470000 more articles. :-D
Same status as above. Legoktm (talk) 07:16, 19 February 2013 (UTC)Reply
  Done first, second is queued. Legoktm (talk) 09:12, 7 March 2013 (UTC)Reply
Second is   In progress, using a much much more optimized script. Legoktm (talk) 08:34, 23 March 2013 (UTC)Reply

Further investigation

edit

Categories that will be useful, may just need formatting or investigation whether they are accurate enough.

Italian person data

edit

Italian Wikipedia person data

edit
General discussion: Wikidata:Project chat#Italian person data (now archived -> consensus found); original location User_talk:Legobot/properties.js#Italian_person_data
  • Category: w:it:Categoria:BioBot (200+ thousands people, cf. [1])
  • Properties: w:it:Template:Bio#Tabella_completa, to be fetched from template usage (not everything translated to categories)
    • Examples: name and gender (mandatory), surname, place/date/year of birth/death, one out of 552 defined jobs
    • More details will follow, are you also interested in a mapping for the jobs? There's a lot of them and I doubt any other wiki has them in a structured format. --Nemo 19:27, 17 February 2013 (UTC)Reply
    • I will work on implementing template parsing hopefully by this weekend.
    • Sesso: P21 -> 6581097 if M, 6581072 if F (note that this is used only for grammatical purposes so "intersex" is not used; in non-trivial cases, it may reflect the policy here on Wikidata or not)
        Done
    • LuogoNascita (but LuogoNascitaLink should prevail if available): P19 -> the entry for the item corresponding to the page with that title
        Done
    • LuogoNascitaAlt: same as above, for complex cases with alternatives; maybe a secondary statement for P19? no other property is available
    • NoteNascita: pull sources for the Nascita statements from the ref tags in here.
    • LuogoMorte, LuogoMorteLink and LuogoMorteAlt, NoteMorte: same as above but for P20
        Done
    • Nazionalità: P27 -> linked country
    • NazionalitàNaturalizzato: additional statement to P27
        Done for countries instances of a subclass of state (Q7275) except a few, see list of articles not imported yet and breakdown by their value.
        Info See the map from adjectives to countries. The local information is based on current sources. Except 4 entities to sync, all the values used are compatible with this property. See further discussion.
    • PostNazionalità: this field may contain sources for any of the previous statements (more general ones could also be right after the end of the template or in FineIncipit).
    • FineIncipit: replaces standard occupation etc., maybe add to item description?
    • Immagine: P18 -> image with this name (check if it's on Commons; over 35k usages)
    • For each statement: add as reference the Property:P143 with value Q11920, example cat (update: as discussed at project chat).
    • First name (Nome): P735
        Done where it equals an it.wiki article and hence entity.
    • Last name (Cognome): P734
        Done (same);   Info below on disambiguation pages, transliteration
    • Day and month of birth (GiornoMeseNascita) + Year of birth (AnnoNascita): P569
    • Day and month of death (GiornoMeseMorte) + Year of death (AnnoMorte): P570
      Do not add a date in contrast with a Integrated Authority File (Q36578) statement if available.
        Done in part by Dexbot, dates after 1920.   Info ViscoBot had started but stopped long ago.
        Question I also wrote the code to import dates of birth and death but I'm not running it yet because there is one important question: What is the colander model you use as date of birth and death? in some places Gregorian wasn't common until 1912 so I can't add these dates before 1912 because the bot can't be sure about calender model of these dates  – The preceding unsigned comment was added by Ladsgroup (talk • contribs).
      We're verifying, I'll let you know the final outcome. Past discussions seem to have all agreed on forcing gregorian calendar in the template, with the option to indicate giulian calendar next to it with a warning. --Nemo 13:27, 27 April 2014 (UTC)Reply
    • Title to be used before name, or after it in some languages other than Italian (Titolo): P511 (about 3k usages)
    • Missing properties:
      • Unrecognized citizenship (peoples without state), e.g. Kurds (Cittadinanza)
      • Free text notes on dates of birth/death (NoteNascita, NoteMorte): some sources could be extracted from here. Example content is very varied but in 55 % of cases contains an URL, that could be imported as source.
    This should be it. --Nemo 08:49, 22 February 2013 (UTC)Reply
A proposal on sourcing for Wikidata was moved to Wikidata:Project chat#Proposal: preventive control of imported data correctness
As far as edit summaries go, the bot actually does send proper edit summaries, in the format of Bot: Setting [[Property:{pid}|{pid}]] to [[{target_qid}]]; using [[:{lang}:{source}]]; requested by [[User talk:{user}|{user}]], it's just that the software doesn't support them yet. It may be worth putting this run on hold until the software does support custom summaries.
I do believe that at this point, we may need to look how to properly source these claims, since they are no longer "obvious". Maybe that should be a discussion on Project chat? I believe there are legitimate concerns before this request can go forward, as well as code that I need to work on. Legoktm (talk) 01:23, 23 February 2013 (UTC)Reply
If edit summaries are a problem, we could just use a different username for the bot, like "Italian Wikipedia person data import bot".
What fields are no longer obvious, specifically? Surely place of birth is more "obvious" and less controversial than gender, for instance. I think it makes sense to start only with the "obvious" ones: it seems to me that most worries are about nationalist controversies, so probably those are the only fields to exclude in the first run? Otherwise, sources exist of course, you could pull them at the same time if people feel it can't be done later. --Nemo 08:50, 23 February 2013 (UTC)Reply
Ping. I have updated the data above, it seems to me that we no longer have anything to wait for? Were the easy parts like gender done already? --Nemo 08:58, 23 August 2013 (UTC)Reply

Constellations

edit

None at the moment

Astronomic object

edit

See here ;) --Paperoastro (talk) 12:03, 23 February 2013 (UTC)Reply

Pre-approved

edit

These have been approved to run, and will be sent over once Legoktm fixes Legobot Sven Manguard Wha? 17:10, 9 November 2013 (UTC)Reply

Requests

edit

Please add your requests here:

  Comment: sport (P641) seems a bit odd to me, why not "occupation: hand-ball player

no such occupation exists. --Danrok (talk) 16:50, 15 October 2013 (UTC)Reply
You mean no item ? Then let's create one, association football player (Q937857) is already in use in 1000s of items, and I do not think it should be any different here. --Zolo (talk) 20:46, 15 October 2013 (UTC)Reply
Since when is sport always an occupation? Many modern footballers are employed by a club and paid a salary, that is why footballer is their occupation. This situation does not exist for every sport, and has not existed since the beginning of sport. There are many sports persons who are notable and compete internationally, but are not employed in sport, it is not their occupation, they have a day job. We can't simply mass-claim a sport as an occupation. --Danrok (talk) 21:04, 15 October 2013 (UTC)Reply
I would say it is ok to use "occupation" for unpaid or part-time activities. That might be "hermit" or "slave" too. If someone is known for being a handball player or a writer, I think it makes sense to use "occupation", even if he has a day-job. If it does not, we may need a specific property to express that. I am concerned that properties like "sport" do not have very clear semantics (we don't know for sure whether it means handball player, handball coach or even handball fan), and that properties like this would cause some scattering of information. --Zolo (talk) 07:19, 16 October 2013 (UTC)Reply
The sport property is much the same as genre. Genre doesn't tell us enough on its own, either. But, all becomes clear via other properties and qualifiers. For example, sport = tennis, occupation = coach (Q41583). --Danrok (talk) 17:22, 16 October 2013 (UTC)Reply

Thank you for copypaste.py

edit

Thank you for the very useful tool copypaste.py. Lionel Allorge (talk) 13:37, 18 March 2013 (UTC)Reply

Could you remove two requests from User:Legobot/properties.js please?

edit

Hello, could you remove the two folowing lines from User:Legobot/properties.js please? Somebody deleted the original items and replace them with another which makes no real sense...

I've asked for the undeletion of this items and will make a new request soon. Thanks, Ayack (talk) 09:52, 19 April 2013 (UTC)Reply

  Done. Legoktm (talk) 09:53, 19 April 2013 (UTC)Reply
Wow, it was fast. Thanks! Ayack (talk) 09:59, 19 April 2013 (UTC)Reply
You got lucky actually :P The way it's set up right now, the only way to stop the bot from processing a job after it's been added to the page is by a) blocking it, or b) logging into the Toolserver and killing it (what I did). I think it's time to come up with an alternative solution... Legoktm (talk) 11:28, 19 April 2013 (UTC)Reply
The original items have just been undeleted. Could you add the following lines wich have another qid2?
Thanks again and sorry for the trouble! Ayack (talk) 15:06, 19 April 2013 (UTC)Reply
Added to User talk:Legobot/properties.js/requests. Ayack (talk) 18:28, 24 April 2013 (UTC)Reply

Could you remove request from User:Legobot/properties.js please?

edit

Thanks --Maxx (talk) 20:01, 9 May 2013 (UTC)Reply

there is no such request --Akkakk 21:12, 9 May 2013 (UTC)
Return to the user page of "Legobot/properties.js".