Topic on User talk:Ladsgroup/Flow

Epìdosis (talkcontribs)

Hi! I remember you run a very efficient bot and in the past I asked you some fixes which were very efficient. Now I mostly do fixes through QuickStatements, which is a very good tool, but isn't still able to fix references leaving the statements unchanged. I sometimes notice big groups of items (thousands and tens of thousands) having references which are imprecise or wrong and I don't know who to ask for correction. Could I slowly report you some notable cases of references to be fixed, so that we can slowly deal with them through your bot? I think it is crucial for our data quality having references which are exactly correct, whilst at the moment this fact often doesn't happen. Thank you very much in advance!

Ladsgroup (talkcontribs)

Hey sure. I try to write something but I want to know the exact framework so I don't need write similar code every time, so I would write something general and use that every time.

Can you give me a couple of examples?

Epìdosis (talkcontribs)

OK, great! So, here is a detailed panoramic of the situation. I see three main types of errors to be corrected:

For whichever question, ask me! When you have the bot ready, please start with some test-edits, so that I can have a look. Thank you very much in advance!

Ladsgroup (talkcontribs)

Thanks. I try to tackle it next weekend. This weekend I'm drowning in something personal.

Epìdosis (talkcontribs)

Hi! Any updates? Obviously no urgence, as I said - just a little message in order not to forget myself the issue :)

Ladsgroup (talkcontribs)

Hey, sorry. I have been doing a million things and have been drowning in work but will get to it ASAP. I took some vacation for volunteer work :)

Ladsgroup (talkcontribs)

But it's on my radar, always has been. Don't worry.

Ladsgroup (talkcontribs)

Again. I have not forgotten about this. One day I will get it done. It's just there are so many things to do :(

Ladsgroup (talkcontribs)

Okay, one part is done: The bot now takes a SPARQL query and removes references that are exact duplicates. here's an example. I will write more in next weekends.

Epìdosis (talkcontribs)

Very good, thanks!

Ladsgroup (talkcontribs)

And the second type Let me know if we want to clean up more. First type is very similar to the second one. So consider that done as well. Let's do this then.

Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

I'm doing them one by one because there's so many of them and for example the P902 took a day to finish. The P863 is underway

Epìdosis (talkcontribs)

Ok P863!

Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

Right now I'm cleaning up the third part of type two () but I will get to others soon.

Epìdosis (talkcontribs)

Very good P4459!

Ladsgroup (talkcontribs)

Done now, Gosh it took days :))) Let me fix type one now.

Ladsgroup (talkcontribs)

Can you give me a SPARQL query for the first type? I'm not good at queries involving refs :(

Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

Started

Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

Fixing

The third type is not that hard. I thought it's done. Let me double check and clean the mess.

Ladsgroup (talkcontribs)

Re-reading what you wrote for the third type a couple of times and now I get what you want but it's pretty complex. I'll try to see what I can do about it next weekend.

Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

Hey, Sure. Just give me a week or two.

Ladsgroup (talkcontribs)

Wrote something that can cleanup duplicates and subsets (e.g. if the reference is fully covered in another reference and more). I already started the bot and it's cleaning. Will continue but I don't think I can clean up more than that as it gets really really complicated.

Epìdosis (talkcontribs)

Perfect! When it finishes, could you schedule it as periodic maintenance (e.g. once a month)? This would assure us the stability of the quality.

Ladsgroup (talkcontribs)

It works based on SPARQL queries. Which queries you want me to run regularly?

Epìdosis (talkcontribs)

Maybe after the cleanup Dexbot is doing now it won't be necessary anymore; I think that these redundant references have been inserted due to an error by Reinheitsgebot, so maybe the error has been solved and the cases won't surge again. Maybe, however, I will give you other queries (of third type) in the future if I find similar problems with different properties.

Epìdosis (talkcontribs)
Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

Hi, you mean the Czech part? I just fixed it and running it again. Everything else has been for really long time now.

Epìdosis (talkcontribs)
Epìdosis (talkcontribs)
Epìdosis (talkcontribs)

Let's sum up the missing ones:

  1. Wikidata:Bot requests#Accademia delle Scienze di Torino multiple references (updated)
  2. https://w.wiki/4HP4: move qualifier statement is subject of (P805) to qualifier field of work (P101).  Done
  3. Wikidata:Bot requests#request to delete wrong references (2021-10-27)  Done in October by MisterSynergy
  4. a little sequel: change stated in (P248) Southern Africa Association for the Advancement of Science (Q7569570) into stated in (P248) Biographical Database of Southern African Science (Q24276683) for these references  Done

Thanks!

Ladsgroup (talkcontribs)

Hi, I started the last one. Will get to the rest slowly.

Epìdosis (talkcontribs)
Ladsgroup (talkcontribs)

The fifth one is running now. I'll check the rest a bit later.

Ladsgroup (talkcontribs)
Reply to "Correct references through bot"