Wikidata:Requests for permissions/Bot/NMBot
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Not done @Notme1560: This request seems to be abandoned, please reopen it if that is not the case. Thanks. Mike Peel (talk) 20:25, 21 July 2020 (UTC)[reply]
NMBot edit
NMBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Notme1560 (talk • contribs • logs)
Task/s: Remove brackets indicating translation around English scholarly article labels titles
Code: GitLab repository has scripts and queries to be copied into pywikibot installation.
Function details:
Selects scholarly article (Q13442814) with a PubMed ID (P698) claim and an English label starting with [
and ending with ].
The English label is converted from [XXX].
to XXX
.
If no title claim exists, it currently exits (uses the content of the old title to build the new title) but this can be refactored later. If multiple title claims exist, it also exits (doesn't handle deprecating multiple previous claims) but this can also be refactored later. Otherwise, the existing title claim is set to deprecated and a new claim with the correct format is added. --Notme1560 (talk) 20:33, 23 March 2019 (UTC)[reply]
- How do you want to reflect that this a) a translation and b) of which language? What about title (P1476)? --Succu (talk) 20:52, 23 March 2019 (UTC)[reply]
- All these articles are from PubMed, but I'm not sure who imported them and when they were imported. The titles with brackets in PubMed are supposed to indicate that the displayed title has been translated to English, but this bot doesn't have access to the untranslated/original titles since it doesn't pull from PubMed's API (only editing existing items). The original language/title information should be on the PubMed page accessible through the PubMed ID (P698) claim which links to the site: (ex)
[Article in Portugese]
source on the site so it can be shown there. Other than that, I'm not sure. --Notme1560 (talk) 21:11, 23 March 2019 (UTC)[reply]
- All these articles are from PubMed, but I'm not sure who imported them and when they were imported. The titles with brackets in PubMed are supposed to indicate that the displayed title has been translated to English, but this bot doesn't have access to the untranslated/original titles since it doesn't pull from PubMed's API (only editing existing items). The original language/title information should be on the PubMed page accessible through the PubMed ID (P698) claim which links to the site: (ex)
- I know. Hence Oppose --Succu (talk) 21:58, 23 March 2019 (UTC)[reply]
- The original title and language can be retrieved from the XML version. In this case <Language>por</Language> <VernacularTitle>Estudo caso-controle com resposta multinomial: uma proposta de análise.</VernacularTitle>. Emijrp (talk) 09:22, 24 March 2019 (UTC)[reply]
- Thanks Emijrp, I guess I will have to integrate the PubMed API now and I guess I can pull other missing article data now as well. I guess this can be closed and I can create a new request with the new tasks and details later. --Notme1560 (talk) 19:09, 24 March 2019 (UTC) (sig added hours later, forgot to sign)[reply]
- I don't see anything wrong with fixing these English labels that are clearly wrong. There's no assertion anywhere that the label is the actual original title of the paper, we have other properties to state that sort of thing. That said, it would be nice to get title in the original language as well. It would also be nice if somebody could fix the rather substantial number of these which have been added with NO label in any language! I'm not sure how they even did that... ArthurPSmith (talk) 17:36, 25 March 2019 (UTC)[reply]
- Sometimes CrossRef provides no title information... Is enWP preferring translated titles as labels? My question above remains unanswered. ([Case-control studies with multinomial responses: a proposal for analysis]. (Q27687073)). --Succu (talk) 20:21, 25 March 2019 (UTC)[reply]
- I don't think it's Crossref that's the problem - here are examples with only a Pubmed ID: Q58595485 and Q61049189. SourceMD must be doing some over-filtering and then somehow creating items with no label at all!? ArthurPSmith (talk) 12:17, 26 March 2019 (UTC)[reply]
- Somehow I missed this request. Thanks for doing it! --- Jura 19:43, 12 May 2019 (UTC)[reply]